#8 Large Scale Semi-Supervised Learning

論文100本ノック

J. Weston. Proceedings of NATO Advanced Study Institute on Mining Massive Data Sets for Security, IOS Press.videolectureのvideoとかプレゼンの資料とか。 Large-Scale Semi-Supervised Learning - VideoLectures.NET Large Scale Semi-Supervised Le…

2010-01-07

#7 Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions

論文100本ノック ICML

Zhu, X., Ghahramani, Z., & Lafferty, J. (2003a). ICML-03, 20th International Conference on Machine Learning. The bulk of the harmonic functions section of the tutorial is devoted to this paper. It directly addresses many aspects of the har…

2010-01-06

#6 Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization

論文100本ノック

tutorialの資料があった。Amazonのカスタマーレビューのところにある☆がいくつかを当てるような問題にSSLを適用した、というもの。グラフベースの手法。この論文のmain contributionは3つあって教師ありでやられていたことを半教師あり学習に拡張グラフを…

2010-01-05

#5 Learning from Labeled and Unlabeled Data using Graph Mincuts

論文100本ノック ICML 半教師あり学習

プレゼンの資料がここに置いてあった。SSL(Semi-Supervised Learning)でグラフ理論を使ったものには Mincut Discrete Markov Random Fields and Harmonic Functions Mainfold Regularization Graph Kernels from the Spectum of Laplacian などなどがある(そ…

2010-01-04

#4 Learning with Positive and Unlabeled Examples Using Weighted Logistic Regression

論文100本ノック ICML 半教師あり学習

Lee, W. S. & Liu, B. In Proceedings of the Twentieth International Conference on Machine Learning (ICML (2003).この論文のmainのcontributionは2つ。出力値が(ただの実数ではなく)確率で返ってくるので、最尤法が使える。そして凸なので最適化が容易…

2010-01-03

#3 Building Text Classifiers Using Positive and Unlabeled Examples

論文100本ノック機械学習半教師あり学習

Bing Liu, Yang Dai, Xiaoli Li, Wee Sun Lee and and Philip Yu. Proceedings of the Third IEEE International Conference on Data Mining (ICDM-03), Melbourne, Florida, November 19-22, 2003.この論文も正例とラベルなしデータからの学習に関する論文…

2010-01-02

#2 Partially Supervised Classification of Text Documents

論文100本ノック ICML 半教師あり学習

Liu, Bing and Lee, Wee Sun and Yu, Philip S. and Li, Xiaoli (2002). In Proc. 19th Intl. Conf. on Machine Learning.これも考えている問題は、少数のラベルありドキュメントと大量のラベルなし(この場合はmixed documentsって書いてあるが)文章で文章分…

2010-01-01

#1 Text Classification from Labeled and Unlabeled Documents using EM

論文100本ノック機械学習半教師あり学習

Kamal Nigam, Andrew McCallum, Sebastian Thrun and Tom Mitchell. Machine Learning, 39(2/3). pp. 103-134. 2000.少数のラベルありドキュメントと大量のラベルなし文章で文章分類。学習器は主にNaive Bayes(以下NBと書く)を利用している。最初はラベル付…