Classifying news stories using memory based reasoning
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
An example-based mapping method for text categorization and retrieval
ACM Transactions on Information Systems (TOIS)
The nature of statistical learning theory
The nature of statistical learning theory
Feature selection, perceptron learning, and a usability case study for text categorization
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Text Classification from Labeled and Unlabeled Documents using EM
Machine Learning - Special issue on information retrieval
Analyzing the effectiveness and applicability of co-training
Proceedings of the ninth international conference on Information and knowledge management
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Constrained K-means Clustering with Background Knowledge
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Combining clustering and co-training to enhance text classification using unlabelled data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Semi-supervised model-based document clustering: A comparative study
Machine Learning
Using clustering to enhance text classification
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Co-clustering based classification for out-of-domain documents
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A multiview approach for intelligent data analysis based on data operators
Information Sciences: an International Journal
Classification techniques with minimal labelling effort and application to medical reports
International Journal of Data Mining and Bioinformatics
A Semi-supervised Topic-Driven Approach for Clustering Textual Answers to Survey Questions
ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Semi-supervised Text Classification Using RBF Networks
IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Cluster based symbolic representation and feature selection for text classification
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Dissimilarity based feature selection for text classification: a cluster based approach
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
A subspace decision cluster classifier for text classification
Expert Systems with Applications: An International Journal
Clustering and categorization of Brazilian portuguese legal documents
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
An ensemble of decision cluster crotches for classification of high dimensional data
Knowledge-Based Systems
Online semi-supervised discriminative dictionary learning for sparse representation
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
The impact of semi-supervised clustering on text classification
Proceedings of the 17th Panhellenic Conference on Informatics
Improving semi-supervised text classification by using wikipedia knowledge
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Hi-index | 0.00 |
Semi-supervised learning methods construct classifiersusing both labeled and unlabeled training data samples.While unlabeled data samples can help to improve theaccuracy of trained models to certain extent, existingmethods still face difficulties when labeled data is notsufficient and biased against the underlying datadistribution. In this paper, we present a clustering basedclassification (CBC) approach. Using this approach,training data, including both the labeled and unlabeleddata, is first clustered with the guidance of the labeleddata. Some of unlabeled data samples are then labeledbased on the clusters obtained. Discriminative classifierscan subsequently be trained with the expanded labeleddataset. The effectiveness of the proposed method isjustified analytically. Our experimental resultsdemonstrated that CBC outperforms existing algorithmswhen the size of labeled dataset is very small.