Editorial: Classifying text streams by keywords using classifier ensemble

Authors:
Baoguo Yang;Yang Zhang;Xue Li
Affiliations:
College of Information Engineering, Northwest A&F University, China;College of Information Engineering, Northwest A&F University, China;School of Information Technology and Electrical Engineering, The University of Queensland, Australia
Venue:
Data & Knowledge Engineering
Year:
2011

Citing 42
Cited 2

Mining time-changing data streams

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
Learning Approaches for Detecting and Tracking News Events

IEEE Intelligent Systems
Partially Supervised Classification of Text Documents

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Refining Initial Points for K-Means Clustering

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Detecting Concept Drift with Support Vector Machines

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Ensemble Methods in Machine Learning

MCS '00 Proceedings of the First International Workshop on Multiple Classifier Systems
PEBL: positive example based learning for Web page classification using SVM

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
On-line new event detection, clustering, and tracking (information retrieval, internet)

On-line new event detection, clustering, and tracking (information retrieval, internet)
Mining Relevant Text from Unlabelled Documents

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Dynamic Weighted Majority: A New Ensemble Method for Tracking Concept Drift

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Building Text Classifiers Using Positive and Unlabeled Examples

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
RCV1: A New Benchmark Collection for Text Categorization Research

The Journal of Machine Learning Research
Systematic data selection to mine concept-drifting data streams

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Dynamic Classifier Selection for Effective Mining from Noisy Data Streams

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Decision Tree Evolution Using Limited Number of Labeled Data Items from Drifting Data Streams

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Investigations on event evolution in TDT

NAACLstudent '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Proceedings of the HLT-NAACL 2003 student research workshop - Volume 3
Semantic similarity methods in wordNet and their application to information retrieval on the web

Proceedings of the 7th annual ACM international workshop on Web information and data management
Text Classification without Negative Examples Revisit

IEEE Transactions on Knowledge and Data Engineering
Handling Local Concept Drift with Dynamic Integration of Classifiers: Domain of Antibiotic Resistance in Nosocomial Infections

CBMS '06 Proceedings of the 19th IEEE Symposium on Computer-Based Medical Systems
An automatic construction and organization strategy for ensemble learning on data streams

ACM SIGMOD Record
Clustering-training for Data Stream Mining

ICDMW '06 Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
Learning drifting concepts: Example selection vs. example weighting

Intelligent Data Analysis
An active learning system for mining time-changing data streams

Intelligent Data Analysis
The weighted majority algorithm

SFCS '89 Proceedings of the 30th Annual Symposium on Foundations of Computer Science
Learning to Classify Documents with Only a Small Positive Training Set

ECML '07 Proceedings of the 18th European conference on Machine Learning
Active Learning from Data Streams

ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Text classification from unlabeled documents with bootstrapping and feature projection techniques

Information Processing and Management: an International Journal
One-Class Classification of Text Streams with Concept Drift

ICDMW '08 Proceedings of the 2008 IEEE International Conference on Data Mining Workshops
Building a Text Classifier by a Keyword and Unlabeled Documents

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Text classification by labeling words

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
OcVFDT: one-class very fast decision tree for one-class classification of data streams

Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data
Learning to classify texts using positive and unlabeled data

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Document clustering using synthetic cluster prototypes

Data & Knowledge Engineering
SyMSS: A syntax-based measure for short-text semantic similarity

Data & Knowledge Engineering
Autopedia: automatic domain-independent Wikipedia article generation

Proceedings of the 20th international conference companion on World wide web
Unsupervised extraction of keywords from news archives

LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
One-class learning and concept summarization for data streams

Knowledge and Information Systems - Special Issue on Data Warehousing and Knowledge Discovery from Sensors and Streams
Learning from positive and unlabeled examples with different data distributions

ECML'05 Proceedings of the 16th European conference on Machine Learning

Identification of trends from patents using self-organizing maps

Expert Systems with Applications: An International Journal
Analyzing multilingual knowledge innovation in patents

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional approaches for text data stream classification usually require the manual labeling of a number of documents, which is an expensive and time consuming process. In this paper, to overcome this limitation, we propose to classify text streams by keywords without labeled documents so as to reduce the burden of labeling manually. We build our base text classifiers with the help of keywords and unlabeled documents to classify text streams, and utilize classifier ensemble algorithms to cope with concept drifting in text data streams. Experimental results demonstrate that the proposed method can build good classifiers by keywords without manual labeling, and when the ensemble based algorithm is used, the concept drift in the streams can be well detected and adapted, which performs better than the single window algorithm.