Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Unsupervised word sense disambiguation rivaling supervised methods
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Weakly-supervised relation classification for information extraction
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Semi-supervised conditional random fields for improved sequence segmentation and labeling
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Graph-Based Semisupervised Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Semi-supervised model adaptation for statistical machine translation
Machine Translation
Transfer learning from multiple source domains via consensus regularization
Proceedings of the 17th ACM conference on Information and knowledge management
Word sense disambiguation: A survey
ACM Computing Surveys (CSUR)
The bootstrapping of the Yarowsky algorithm in real corpora
Information Processing and Management: an International Journal
Extracting structured information from user queries with semi-supervised conditional random fields
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Improving text categorization bootstrapping via unsupervised learning
ACM Transactions on Speech and Language Processing (TSLP)
Graph-based analysis of semantic drift in Espresso-like bootstrapping algorithms
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
PUTOP: turning predominant senses into a topic model for word sense disambiguation
SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
On the use of virtual evidence in conditional random fields
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data
The Journal of Machine Learning Research
Multi-view regression via canonical correlation analysis
COLT'07 Proceedings of the 20th annual conference on Learning theory
Semi-supervised ranking for document retrieval
Computer Speech and Language
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Image segmentation by iterated region merging with localized graph cuts
Pattern Recognition
HITS-based seed selection and stop list construction for bootstrapping
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
An iterative semi-supervised approach to software fault prediction
Proceedings of the 7th International Conference on Predictive Models in Software Engineering
Mining inter-entity semantic relations using improved transductive learning
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Analysis and improvement of minimally supervised machine learning for relation extraction
NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
Semi-supervised linear discriminant analysis using moment constraints
PSL'11 Proceedings of the First IAPR TC3 conference on Partially Supervised Learning
Software defect prediction using semi-supervised learning with dimension reduction
Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Bootstrapping via graph propagation
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Extracting and modeling durations for habits and events from Twitter
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Semi-supervised linear discriminant analysis through moment-constraint parameter estimation
Pattern Recognition Letters
Hi-index | 0.00 |
Many problems in computational linguistics are well suited for bootstrapping (semisupervised learning) techniques. The Yarowsky algorithm is a well-known bootstrapping algorithm, but it is not mathematically well understood. This article analyzes it as optimizing an objective function. More specifically, a number of variants of the Yarowsky algorithm (though not the original algorithm itself) are shown to optimize either likelihood or a closely related objective function K.