Active learning with committees for text categorization

Authors:
Ray Liere;Prasad Tadepalli
Affiliations:
Department of Computer Science, Oregon State University, Corvallis, OR;Department of Computer Science, Oregon State University, Corvallis, OR
Venue:
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Year:
1997

Citing 14
Cited 40

TCS: a shell for content-based text categorization

Proceedings of the sixth conference on Artificial intelligence applications
Redundant noisy attributes, attribute errors, and linear-threshold learning using winnow

COLT '91 Proceedings of the fourth annual workshop on Computational learning theory
Query by committee

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
Automated learning of decision rules for text categorization

ACM Transactions on Information Systems (TOIS)
A sequential algorithm for training text classifiers

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Improving Generalization with Active Learning

Machine Learning - Special issue on structured connectionist systems
Bagging predictors

Machine Learning
Selective Sampling Using the Query by Committee Algorithm

Machine Learning
Effective Text Retrieval Based on Combining Evidence from the Corpus and Users

IEEE Expert: Intelligent Systems and Their Applications
Queries and Concept Learning

Machine Learning
Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm

Machine Learning
Queries and Concept Learning

Machine Learning
Information, Prediction, and Query by Committee

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Representation and Learning in Information Retrieval

Representation and Learning in Information Retrieval

Active learning for hierarchical wrapper induction

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Content-based book recommending using learning for text categorization

DL '00 Proceedings of the fifth ACM conference on Digital libraries
Active learning using adaptive resampling

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Improving learning by choosing examples intelligently in two natural language tasks

Learning language in logic
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
Active learning in neural networks

New learning paradigms in soft computing
Managing Semantic Content for the Web

IEEE Internet Computing
Query by committee, linear separation and random walks

Theoretical Computer Science
Improving Classification Accuracy of Large Test Sets Using the Ordered Classification Algorithm

IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
Dynamic Models of Expert Groups to Recommend Web Documents

ECDL '01 Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries
Interactive deduplication using active learning

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Diverse ensembles for active learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Rule writing or annotation: cost-efficient resource usage for base noun phrase chunking

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Word sense disambiguation by learning from unlabeled data

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Coaxing confidences from an old friend: probabilistic classifications from transformation rule lists

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Large-scale text categorization by batch mode active learning

Proceedings of the 15th international conference on World Wide Web
Learning the unified kernel machines for classification

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
An active approach to spoken language processing

ACM Transactions on Speech and Language Processing (TSLP)
Improving classification performance using unlabeled data: Naive Bayesian case

Knowledge-Based Systems
Active sampling for multiple output identification

Machine Learning
Selective generation of training examples in active meta-learning

International Journal of Hybrid Intelligent Systems - HIS 2007
Semisupervised SVM batch mode active learning with applications to image retrieval

ACM Transactions on Information Systems (TOIS)
Active Learning Strategies for Multi-Label Text Classification

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
A machine learning approach to sentiment analysis in multilingual Web texts

Information Retrieval
MMR-based active machine learning for bio named entity recognition

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Active algorithm selection

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Acquiring word-meaning mappings for natural language interfaces

Journal of Artificial Intelligence Research
Active learning with multiple views

Journal of Artificial Intelligence Research
Integrative Windowing

Journal of Artificial Intelligence Research
Ranking web documents with dynamic evaluation by expert groups

CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Batch mode active learning based multi-view text classification

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Active learning using on-line algorithms

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Fuzzy semi-supervised support vector machines

MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Improving control-knowledge acquisition for planning by active learning

ECML'06 Proceedings of the 17th European conference on Machine Learning
A semi-naive bayesian learning method for utilizing unlabeled data

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Active sampling for multiple output identification

COLT'06 Proceedings of the 19th annual conference on Learning Theory
Combining Uncertainty Sampling methods for supporting the generation of meta-examples

Information Sciences: an International Journal
EAGLE: efficient active learning of link specifications using genetic programming

ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
A neuro-fuzzy immune inspired classifier for task-oriented texts

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many real-world domains, supervised learning requires a large number of training examples. In this paper, we describe an active learning method that uses a committee of learners to reduce the number of training examples required for learning. Our approach is similar to the Query by Committee framework, where disagreement among the committee members on the predicted label for the input part of the example is used to signal the need for knowing the actual value of the label. Our experiments are conducted in the text categorization domain, which is characterized by a large number of features, many of which are irrelevant. We report here on experiments using a committee of Winnow-based learners and demonstrate that this approach can reduce the number of labeled training examples required over that used by a single Winnow learner by 1-2 orders of magnitude.