#13 A Simple Probabilistic Approach to Learning from Positive and Unlabeled Examples

D. Zhang and W. S. Lee.
In Proceedings of the 5th Annual UK Workshop on Computational Intel ligence (UKCI), pages 83–87, Sept. 2005.









Estimating Pr[P|x] and Pr[U|x]


Estimating b

これは[4]と同じ。[4]の論文のmain contributionの一つであったF値ではないパフォーマンス尺度を使って推定を行なう(このタスクでは負例がないので、F値ではパフォーマンス評価ができない)。


[1] Denis, F. Gilleron, R and Tommasi, M. (2002). "Text classification from positive and unlabeled examples." IPMU, 2002.
[2] Bing Liu, Yang Dai, Xiaoli Li, Wee Sun Lee and and Philip Yu. "Building Text Classifiers Using Positive and Unlabeled Examples." Proceedings of the Third IEEE International Conference on Data Mining (ICDM-03), Melbourne, Florida, November 19-22, 2003.
[3] Liu, Bing and Lee, Wee Sun and Yu, Philip S. and Li, Xiaoli (2002). "Partially Supervised Classification of Text Documents" In Proc. 19th Intl. Conf. on Machine Learning.
[4] Lee, W. S. & Liu, B. "Learning with Positive and Unlabeled Examples Using Weighted Logistic Regression." In Proceedings of the Twentieth International Conference on Machine Learning (ICML (2003).
[5] T. Joachims (1997). `A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization'. In D. H. Fisher (ed.), Proceedings of ICML-97, 14th International Conference on Machine Learning, pp. 143-151, Nashville, US. Morgan Kaufmann Publishers, San Francisco, US.
