Elements of information theory
Elements of information theory
An evaluation of phrasal and clustered representations on a text categorization task
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
A sequential algorithm for training text classifiers
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
On the exponential value of labeled samples
Pattern Recognition Letters
Combining classifiers in text categorization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Context-sensitive learning methods for text categorization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Threading electronic mail: a preliminary study
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Bayesian classification (AutoClass): theory and results
Advances in knowledge discovery and data mining
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Learning to extract symbolic knowledge from the World Wide Web
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Improving the mean field approximation via the use of mixture distributions
Learning in graphical models
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Machine Learning
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality
Data Mining and Knowledge Discovery
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Preventing "Overfitting" of Cross-Validation Data
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Hierarchically Classifying Documents Using Very Few Words
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Improving Text Classification by Shrinkage in a Hierarchy of Classes
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Employing EM and Pool-Based Active Learning for Text Classification
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Document classification using a finite mixture model
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
A new metric-based approach to model selection
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Active learning with committees for text categorization
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Syskill & webert: Identifying interesting web sites
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
The psychology of multimedia databases
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Analyzing the effectiveness and applicability of co-training
Proceedings of the ninth international conference on Information and knowledge management
Text categorization for multi-page documents: a hybrid naive Bayes HMM approach
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Using LSI for text classification in the presence of background text
Proceedings of the tenth international conference on Information and knowledge management
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Topic-oriented collaborative crawling
Proceedings of the eleventh international conference on Information and knowledge management
Automating the Construction of Internet Portals with Machine Learning
Information Retrieval
Hidden Markov Models for Text Categorization in Multi-Page Documents
Journal of Intelligent Information Systems
ITtalks: A Case Study in the Semantic Web and DAML+OIL
IEEE Intelligent Systems
Webmining: learning from the world wide web
Computational Statistics & Data Analysis - Nonlinear methods and data mining
Text classification using ESC-based stochastic decision lists
Information Processing and Management: an International Journal
ECML '00 Proceedings of the 11th European Conference on Machine Learning
Text Categorization Using Transductive Boosting
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Support Vector Machines for Polycategorical Classification
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Learning Classification with Both Labeled and Unlabeled Data
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Reliable Classifications with Machine Learning
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Towards Self-Exploring Discriminating Features
MLDM '01 Proceedings of the Second International Workshop on Machine Learning and Data Mining in Pattern Recognition
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
Boosting Mixture Models for Semi-supervised Learning
ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Semi-supervised Learning in Medical Image Database
PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Text Categorization Using Adaptive Context Trees
CICLing '01 Proceedings of the Second International Conference on Computational Linguistics and Intelligent Text Processing
Augmenting Supervised Neural Classifier Training Using a Corpus of Unlabeled Data
KI '02 Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence
Modeling Information in Textual Data Combining Labeled and Unlabeled Data
Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery
Theoretical Computer Science
A parallel learning algorithm for text classification
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
PEBL: positive example based learning for Web page classification using SVM
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Exploiting unlabeled data in ensemble methods
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Combining clustering and co-training to enhance text classification using unlabelled data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Single-shot detection of multiple categories of text using parametric mixture models
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
B-EM: a classifier incorporating bootstrap with EM approach for data mining
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Topic analysis using a finite mixture model
Information Processing and Management: an International Journal
A Comparison of Word- and Sense-Based Text Categorization Using Several Classification Algorithms
Journal of Intelligent Information Systems
The Journal of Machine Learning Research
Facial expression recognition from video sequences: temporal and static modeling
Computer Vision and Image Understanding - Special issue on Face recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Tractable Group Detection on Large Link Data Sets
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Building Text Classifiers Using Positive and Unlabeled Examples
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
CBC: Clustering Based Text Classification Requiring Minimal Labeled Data
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Exploiting Unlabeled Data for Improving Accuracy of Predictive Data Mining
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Privacy-preserving Distributed Clustering using Generative Models
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Gene functional classification by semi-supervised learning from heterogeneous data
Proceedings of the 2003 ACM symposium on Applied computing
Recognizing the relations between Web pages using artificial neural network
Proceedings of the 2003 ACM symposium on Applied computing
PEBL: Web Page Classification without Negative Examples
IEEE Transactions on Knowledge and Data Engineering
Domain-Specific Web Search with Keyword Spices
IEEE Transactions on Knowledge and Data Engineering
Generative model-based clustering of directional data
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Cross-training: learning probabilistic mappings between topics
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
XRules: an effective structural classifier for XML data
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Text classification from positive and unlabeled documents
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Exploitation of Unlabeled Sequences in Hidden Markov Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
Segmentation Given Partial Grouping Constraints
IEEE Transactions on Pattern Analysis and Machine Intelligence
Evolutionary semi-supervised fuzzy clustering
Pattern Recognition Letters
A new differential LSI space-based probabilistic document classifier
Information Processing Letters
Effect of term distributions on centroid-based text categorization
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Informatics and computer science intelligent systems applications
Semi-supervised learning for facial expression recognition
MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Machine learning in low-level microarray analysis
ACM SIGKDD Explorations Newsletter
Liveclassifier: creating hierarchical text classifiers through web corpora
Proceedings of the 13th international conference on World Wide Web
Websights: who owns streaming media?
IEEE Spectrum
Semantic video classification and feature subset selection under context and concept uncertainty
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
A GA-based neural network weight optimization technique for semi-supervised classifier learning
Design and application of hybrid intelligent systems
Dominant meanings classification model for web information
Design and application of hybrid intelligent systems
Word translation disambiguation using bilingual bootstrapping
Computational Linguistics
Information Processing and Management: an International Journal
Editorial: special issue on learning from imbalanced data sets
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Semi-Supervised Learning on Riemannian Manifolds
Machine Learning
Semantic video classification by integrating unlabeled samples for classifier training
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Learning to Decode Cognitive States from Brain Images
Machine Learning
Dealing with different distributions in learning from
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
A probabilistic framework for semi-supervised clustering
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Leveraging the margin more carefully
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Active learning using pre-clustering
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
A hierarchical graphical model for record linkage
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
A selective sampling approach to active feature selection
Artificial Intelligence
Semisupervised learning from different information sources
Knowledge and Information Systems
Toward Integrating Feature Selection Algorithms for Classification and Clustering
IEEE Transactions on Knowledge and Data Engineering
DIGIMIMIR: A Tool for Rapid Situation Analysis of Helpdesk and Support Email
LISA '04 Proceedings of the 18th USENIX conference on System administration
Text Classification without Labeled Negative Documents
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
An analysis of the relative hardness of Reuters-21578 subsets: Research Articles
Journal of the American Society for Information Science and Technology
The infocious web search engine: improving web searching through linguistic analysis
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
HLT '01 Proceedings of the first international conference on Human language technology research
Applying co-training methods to statistical parsing
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Weakly supervised natural language learning without redundant views
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Word sense disambiguation by learning from unlabeled data
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
A probabilistic model for retrospective news event detection
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Mining images on semantics via statistical learning
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Price prediction and insurance for online auctions
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
A hybrid unsupervised approach for document clustering
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
A multinomial clustering model for fast simulation of computer architecture designs
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Analyzing Gene Expression Time-Courses
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Constrained EM for parallel text alignment
Natural Language Engineering
Unsupervised named-entity extraction from the web: an experimental study
Artificial Intelligence
Tri-Training: Exploiting Unlabeled Data Using Three Classifiers
IEEE Transactions on Knowledge and Data Engineering
Open Set Face Recognition Using Transduction
IEEE Transactions on Pattern Analysis and Machine Intelligence
Mining officially unrecognized side effects of drugs by combining web search and machine learning
Proceedings of the 14th ACM international conference on Information and knowledge management
Taxonomies by the numbers: building high-performance taxonomies
Proceedings of the 14th ACM international conference on Information and knowledge management
A novel approach for privacy-preserving video sharing
Proceedings of the 14th ACM international conference on Information and knowledge management
Effects of web document evolution on genre classification
Proceedings of the 14th ACM international conference on Information and knowledge management
Automated rich presentation of a semantic topic
Proceedings of the 13th annual ACM international conference on Multimedia
Semi-supervised learning with an imperfect supervisor
Knowledge and Information Systems
Logistic regression with an auxiliary data source
ICML '05 Proceedings of the 22nd international conference on Machine learning
A model for handling approximate, noisy or incomplete labeling in text classification
ICML '05 Proceedings of the 22nd international conference on Machine learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Text Classification without Negative Examples Revisit
IEEE Transactions on Knowledge and Data Engineering
A Framework for Semi-Supervised Learning Based on Subjective and Objective Clustering Criteria
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
A Discriminative Learning Framework with Pairwise Constraints for Video Object Classification
IEEE Transactions on Pattern Analysis and Machine Intelligence
Data Clustering with Partial Supervision
Data Mining and Knowledge Discovery
Topic analysis using a finite mixture model
EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Two-dimensional clustering for text categorization
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
A differential LSI method for document classification
AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
Poisson naive Bayes for text classification with feature weighting
AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Training a naive bayes classifier via the EM algorithm with a class distribution constraint
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Bootstrapping coreference classifiers with multiple machine learning algorithms
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Mixture Modeling with Pairwise, Instance-Level Class Constraints
Neural Computation
2005 Special issue: A new classifier based on information theoretic learning with unlabeled data
Neural Networks - 2005 Special issue: IJCNN 2005
POLYPHONET: an advanced social network extraction system from the web
Proceedings of the 15th international conference on World Wide Web
Digital Content Recommender on the Internet
IEEE Intelligent Systems
Semi-supervised outlier detection
Proceedings of the 2006 ACM symposium on Applied computing
Nonstationary kernel combination
ICML '06 Proceedings of the 23rd international conference on Machine learning
Active learning via transductive experimental design
ICML '06 Proceedings of the 23rd international conference on Machine learning
A probabilistic model for approximate identity matching
dg.o '06 Proceedings of the 2006 international conference on Digital government research
Blocking objectionable web content by leveraging multiple information sources
ACM SIGKDD Explorations Newsletter
Text mining for product attribute extraction
ACM SIGKDD Explorations Newsletter
Enhancing relevance feedback in image retrieval using unlabeled data
ACM Transactions on Information Systems (TOIS)
Semi-supervised time series classification
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Incorporating large unlabeled data to enhance EM classification
Journal of Intelligent Information Systems
Privacy leakage in multi-relational databases: a semi-supervised learning perspective
The VLDB Journal — The International Journal on Very Large Data Bases
Learning from positive and unlabeled examples
Theoretical Computer Science - Algorithmic learning theory (ALT 2000)
Semi-supervised model-based document clustering: A comparative study
Machine Learning
Is linguistic information relevant for the classification of legal texts?
ICAIL '05 Proceedings of the 10th international conference on Artificial intelligence and law
Concept learning and transplantation for dynamic image databases
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Some Effective Techniques for Naive Bayes Text Classification
IEEE Transactions on Knowledge and Data Engineering
An active approach to spoken language processing
ACM Transactions on Speech and Language Processing (TSLP)
Automatic video annotation by semi-supervised learning with kernel density estimation
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Topic evolution and social interactions: how authors effect research
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Web-based text classification in the absence of manually labeled training documents
Journal of the American Society for Information Science and Technology
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Clustering with Bregman Divergences
The Journal of Machine Learning Research
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data
The Journal of Machine Learning Research
Active EM to reduce noise in activity recognition
Proceedings of the 12th international conference on Intelligent user interfaces
Statistical machine translation with word- and sentence-aligned parallel corpora
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
The sentimental factor: improving review classification via human-provided information
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Experiments in parallel-text based grammar induction
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Semi-supervised conditional random fields for improved sequence segmentation and labeling
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Semi-supervised training for statistical word alignment
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Time period identification of events in text
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A backoff model for bootstrapping resources for non-English languages
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Syntax-based semi-supervised named entity tagging
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
A semi-supervised regression model for mixed numerical and categorical variables
Pattern Recognition
A hybrid generative/discriminative approach to text classification with additional information
Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Improving classification performance using unlabeled data: Naive Bayesian case
Knowledge-Based Systems
Text classification: A least square support vector machine approach
Applied Soft Computing
Inference and evaluation of the multinomial mixture model for text clustering
Information Processing and Management: an International Journal
Bayesian analysis of finite mixtures of multinomial and negative-multinomial distributions
Computational Statistics & Data Analysis
Semi-supervised single-label text categorization using centroid-based classifiers
Proceedings of the 2007 ACM symposium on Applied computing
Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples
The Journal of Machine Learning Research
HOTOS'05 Proceedings of the 10th conference on Hot Topics in Operating Systems - Volume 10
Knowledge and Information Systems
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Two-view feature generation model for semi-supervised learning
Proceedings of the 24th international conference on Machine learning
Self-taught learning: transfer learning from unlabeled data
Proceedings of the 24th international conference on Machine learning
On the relation between multi-instance learning and semi-supervised learning
Proceedings of the 24th international conference on Machine learning
Journal of Visual Communication and Image Representation
Co-clustering based classification for out-of-domain documents
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Iterative cross-training: An algorithm for web page categorization
Intelligent Data Analysis
Intelligent Data Analysis
Extending boosting for large scale spoken language understanding
Machine Learning
Semisupervised Regression with Cotraining-Style Algorithms
IEEE Transactions on Knowledge and Data Engineering
Artificial Intelligence in Medicine
POLYPHONET: An advanced social network extraction system from the Web
Web Semantics: Science, Services and Agents on the World Wide Web
A clustering framework based on subjective and objective validity criteria
ACM Transactions on Knowledge Discovery from Data (TKDD)
An incremental cluster-based approach to spam filtering
Expert Systems with Applications: An International Journal
Label Propagation through Linear Neighborhoods
IEEE Transactions on Knowledge and Data Engineering
Model-driven formative evaluation of exploratory search: A study under a sensemaking framework
Information Processing and Management: an International Journal
Sensitive webpage classification for content advertising
Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising
Feature synthesized EM algorithm for image retrieval
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Part-of-speech tagging of modern hebrew text
Natural Language Engineering
Citation data clustering for author name disambiguation
Proceedings of the 2nd international conference on Scalable information systems
Semi-supervised learning integrated with classifier combination for word sense disambiguation
Computer Speech and Language
Proceedings of the 17th international conference on World Wide Web
Can chinese web pages be classified with english data source?
Proceedings of the 17th international conference on World Wide Web
MATH'07 Proceedings of the 12th WSEAS International Conference on Applied Mathematics
Semisupervised learning from dissimilarity data
Computational Statistics & Data Analysis
A two-step classification approach to unsupervised record linkage
AusDM '07 Proceedings of the sixth Australasian conference on Data mining and analytics - Volume 70
Extending boosting for large scale spoken language understanding
Machine Learning
Exploring hedge identification in biomedical literature
Journal of Biomedical Informatics
The asymptotics of semi-supervised learning in discriminative probabilistic models
Proceedings of the 25th international conference on Machine learning
On multi-view active learning and the combination with semi-supervised learning
Proceedings of the 25th international conference on Machine learning
A boosting algorithm for learning bipartite ranking functions with partially labeled data
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Topic-bridged PLSA for cross-domain text classification
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
An efficient spatial semi-supervised learning algorithm
International Journal of Parallel, Emergent and Distributed Systems
Structured entity identification and document categorization: two tasks with one joint model
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Journal of Computational Methods in Sciences and Engineering - Computational and Mathematical Methods for Science and Engineering Conference 2002 - CMMSE-2002
Multinomial mixture model with feature selection for text clustering
Knowledge-Based Systems
Discrete data clustering using finite mixture models
Pattern Recognition
Training the Hidden Vector State Model from Un-annotated Corpus
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Transductive Learning from Relational Data
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Learning to Classify Documents with Only a Small Positive Training Set
ECML '07 Proceedings of the 18th European conference on Machine Learning
Analyzing Co-training Style Algorithms
ECML '07 Proceedings of the 18th European conference on Machine Learning
Improving Automatic Image Annotation Based on Word Co-occurrence
Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Comparing Non-parametric Ensemble Methods for Document Clustering
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
A Fault Prediction Model with Limited Fault Data to Improve Test Process
PROFES '08 Proceedings of the 9th international conference on Product-Focused Software Process Improvement
Iterative Reinforcement Cross-Domain Text Classification
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Watch, Listen & Learn: Co-training on Captioned Images and Videos
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
A Web-Based Self-training Approach for Authorship Attribution
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Identifying web spam with user behavior analysis
AIRWeb '08 Proceedings of the 4th international workshop on Adversarial information retrieval on the web
Improving supervised learning performance by using fuzzy clustering method to select training data
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Fuzzy theory and technology with applications
Journal of Computer and System Sciences
Cross-Domain Knowledge Transfer Using Semi-supervised Classification
AI '08 Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
A Learning Scheme for Recognizing Sub-classes from Model Trained on Aggregate Classes
SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
An Ontology-Based Sentiment Classification Methodology for Online Consumer Reviews
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Classification techniques with minimal labelling effort and application to medical reports
International Journal of Data Mining and Bioinformatics
Semi-supervised kernel density estimation for video annotation
Computer Vision and Image Understanding
Graph-based semi-supervised learning with multiple labels
Journal of Visual Communication and Image Representation
A multi-view approach to semi-supervised document classification with incremental Naive Bayes
Computers & Mathematics with Applications
Employee turnover: a novel prediction solution with effective feature selection
CEA'09 Proceedings of the 3rd WSEAS international conference on Computer engineering and applications
Semi-Supervised Learning to Classify Evaluative Expressions from Labeled and Unlabeled Texts
IEICE - Transactions on Information and Systems
Effects of Term Distributions on Binary Classification
IEICE - Transactions on Information and Systems
Using the Web as corpus for self-training text categorization
Information Retrieval
A sentence level probabilistic model for evolutionary theme pattern mining from news corpora
Proceedings of the 2009 ACM symposium on Applied Computing
Semi-supervised document retrieval
Information Processing and Management: an International Journal
Kernel-Based Transductive Learning with Nearest Neighbors
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Latent Variable Models for Causal Knowledge Acquisition
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
A Comparative Study of Utilizing Topic Models for Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Investigating Learning Approaches for Blog Post Opinion Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Building a Text Classifier by a Keyword and Unlabeled Documents
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
A statistical approach to crosslingual natural language tasks
Journal of Algorithms
A survey on sentiment detection of reviews
Expert Systems with Applications: An International Journal
Artificial neural network reduction through oracle learning
Intelligent Data Analysis
Employee turnover: a novel prediction solution with effective feature selection
WSEAS Transactions on Information Science and Applications
Analysis of the effect of Headline News in financial market through text categorisation
International Journal of Computer Applications in Technology
Information theoretic regularization for semi-supervised boosting
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient Learning from Few Labeled Examples
ISNN '09 Proceedings of the 6th International Symposium on Neural Networks on Advances in Neural Networks
Multi-view Semi-supervised Learning: An Approach to Obtain Different Views from Text Datasets
Proceedings of the 2005 conference on Advances in Logic Based Intelligent Systems: Selected Papers of LAPTEC 2005
Challenges and Research Directions for Adaptive Biometric Recognition Systems
ICB '09 Proceedings of the Third International Conference on Advances in Biometrics
Expectation maximization enhancement with evolutionstrategy for stochastic ontology mapping
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Semi-supervised fuzzy clustering: A kernel-based approach
Knowledge-Based Systems
Hybrid Hierarchical Classifiers for Hyperspectral Data Analysis
MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
A general procedure for learning mixtures of independent component analyzers
Pattern Recognition
Using LDA to detect semantically incoherent documents
CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
Mining the web for reciprocal relationships
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Methods for domain-independent information extraction from the web: an experimental comparison
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Text classification by labeling words
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Towards modeling threaded discussions using induced ontology knowledge
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Extracting knowledge about users' activities from raw workstation contents
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Multi-conditional learning: generative/discriminative training for clustering and classification
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Homotopy-based semi-supervised Hidden Markov Models for sequence labeling
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Text data acquisition for domain-specific language models
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
FeatureEng '05 Proceedings of the ACL Workshop on Feature Engineering for Machine Learning in Natural Language Processing
A distributed approach to enabling privacy-preserving model-based classifier training
Knowledge and Information Systems
Maximum entropy modeling in sparse semantic tagging
HLT-SRWS '04 Proceedings of the Student Research Workshop at HLT-NAACL 2004
Selecting relevant text subsets from web-data for building topic specific language models
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Building a Text Classifier by a Keyword and Wikipedia Knowledge
ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Semi-supervised Text Classification Using RBF Networks
IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Subspace Regularization: A New Semi-supervised Learning Method
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
A hybrid generative/discriminative approach to semi-supervised classifier design
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Semi-supervised sequence modeling with syntactic topic models
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
A probabilistic classification approach for lexical textual entailment
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Word sense disambiguation with semi-supervised learning
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Transferring naive bayes classifiers for text classification
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Semi-supervised learning with very few labeled training examples
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Humans perform semi-supervised classification too
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
On discriminative semi-supervised classification
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Importance of semantic representation: dataless classification
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Text categorization with knowledge transfer from heterogeneous data sources
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Semi-supervised learning for blog classification
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Learning and inference with constraints
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Automatic Arabic document categorization based on the Naïve Bayes algorithm
Semitic '04 Proceedings of the Workshop on Computational Approaches to Arabic Script-based Languages
Learning from labeled and unlabeled data: an empirical study across techniques and domains
Journal of Artificial Intelligence Research
A machine learning approach to building domain-specific search engines
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Detection of cognitive states from fMRI data using machine learning techniques
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Semi-supervised learning for multi-component data classification
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Constructing diverse classifier ensembles using artificial training examples
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Evaluating classifiers by means of test data with noisy labels
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Semi-supervised learning with explicit misclassification modeling
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Learning to classify texts using positive and unlabeled data
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Integrating background knowledge into text classification
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Rank Aggregation Based Text Feature Selection
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
A Software System for Topic Extraction and Document Classification
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Learning domain-specific information extraction patterns from the Web
IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Semi-supervised regression with co-training
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Unsupervised named-entity extraction from the Web: An experimental study
Artificial Intelligence
A risk minimization framework for domain adaptation
Proceedings of the 18th ACM conference on Information and knowledge management
Combining labeled and unlabeled data with word-class distribution learning
Proceedings of the 18th ACM conference on Information and knowledge management
IEEE Transactions on Information Technology in Biomedicine - Special section on computational intelligence in medical systems
SCTWC: An online semi-supervised clustering approach to topical web crawlers
Applied Soft Computing
Learning mixture models via component-wise parameter smoothing
Computational Statistics & Data Analysis
Exponential family hybrid semi-supervised learning
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Domain adaptation via transfer component analysis
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Co-training for cross-lingual sentiment classification
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
A framework of feature selection methods for text categorization
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Semi-supervised cause identification from aviation safety reports
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Fully Automatic Text Categorization by Exploiting WordNet
AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Using Nearest Neighbor Information to Improve Cross-Language Text Classification
MICAI '09 Proceedings of the 8th Mexican International Conference on Artificial Intelligence
Query Selection via Weighted Entropy in Graph-Based Semi-supervised Classification
ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Populating the Semantic Web by Macro-reading Internet Text
ISWC '09 Proceedings of the 8th International Semantic Web Conference
New Labeling Strategy for Semi-supervised Document Categorization
KSEM '09 Proceedings of the 3rd International Conference on Knowledge Science, Engineering and Management
On the use of virtual evidence in conditional random fields
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
A speed-up algorithm for Poisson propagation
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Using weak supervision in learning Gaussian mixture models
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Domain kernels for text categorization
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
A discriminative model for semi-supervised learning
Journal of the ACM (JACM)
Semi-supervised learning in knowledge discovery
Fuzzy Sets and Systems
Learning to recognize video-based spatiotemporal events
IEEE Transactions on Intelligent Transportation Systems
Diagnosis of recurrent faults using log files
CASCON '09 Proceedings of the 2009 Conference of the Center for Advanced Studies on Collaborative Research
Learning to integrate web taxonomies
Web Semantics: Science, Services and Agents on the World Wide Web
Information and Software Technology
Analyzing knowledge communities using foreground and background clusters
ACM Transactions on Knowledge Discovery from Data (TKDD)
Learning with unlabeled data and its application to image retrieval
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
A semi-supervised clustering algorithm for data exploration
IFSA'03 Proceedings of the 10th international fuzzy systems association World Congress conference on Fuzzy sets and systems
Image retrieval using mixture models and EM algorithm
SCIA'03 Proceedings of the 13th Scandinavian conference on Image analysis
Scaling up semi-supervised learning: an efficient and effective LLGC variant
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Estimation of class membership probabilities in the document classification
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Experiments on kernel tree support vector machines for text categorization
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Supervised and unsupervised learning algorithms for thai web pages identification
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Encoding classifications into lightweight ontologies
Journal on data semantics VIII
Taking advantage of the web for text classification with imbalanced classes
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Best-match method used in co-training algorithm
PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Synopsis information extraction in documents through probabilistic text classifiers
ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Resource-bounded fraud detection
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
A refinement framework for cross language text categorization
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Person name disambiguation in web pages using social network, compound words and latent topics
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Comparing LDA with pLSI as a dimensionality reduction method in document clustering
LKR'08 Proceedings of the 3rd international conference on Large-scale knowledge resources: construction and application
Semi-supervised document classification with a mislabeling error model
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Expert Systems with Applications: An International Journal
A text categorization method based on local document frequency
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Multiple-view multiple-learner active learning
Pattern Recognition
Language models learning for domain-specific natural language user interaction
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Joint learning of labels and distance metric
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on game theory
Coarse-to-fine boundary location with a SOM-like method
IEEE Transactions on Neural Networks
Semi-supervised learning based on nearest neighbor rule and cut edges
Knowledge-Based Systems
A classification algorithm based on local cluster centers with a few labeled training examples
Knowledge-Based Systems
Document clustering via dirichlet process mixture model with feature selection
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Semi-supervised sequence classification using abstraction augmented Markov models
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
Distributional similarity vs. PU learning for entity set expansion
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Expert Systems with Applications: An International Journal
Negative training data can be harmful to text classification
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A robust semi-supervised classification method for transfer learning
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Learning naïve bayes transfer classifier throughclass-wise test distribution estimation
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Semi-supervised Bayesian ARTMAP
Applied Intelligence
Grouping product features using semi-supervised learning with soft-constraints
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Improving co-training with agreement-based sampling
RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
A brief survey on sequence classification
ACM SIGKDD Explorations Newsletter
Ant based semi-supervised classification
ANTS'10 Proceedings of the 7th international conference on Swarm intelligence
A novel initialization method for semi-supervised clustering
KSEM'10 Proceedings of the 4th international conference on Knowledge science, engineering and management
Weakly supervised classification of objects in images using soft random forests
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Semi-supervised abstraction-augmented string kernel for multi-level bio-relation extraction
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Learning aspect models with partially labeled data
Pattern Recognition Letters
Human-computer collaborative object recognition for intelligent support
PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
A misleading attack against semi-supervised learning for intrusion detection
MDAI'10 Proceedings of the 7th international conference on Modeling decisions for artificial intelligence
Predicting consumer sentiments from online text
Decision Support Systems
On the selection of tags for tag clouds
Proceedings of the fourth ACM international conference on Web search and data mining
Clustering product features for opinion mining
Proceedings of the fourth ACM international conference on Web search and data mining
Semi-supervised multi-class Adaboost by exploiting unlabeled data
Expert Systems with Applications: An International Journal
Mining concept-drifting data streams containing labeled and unlabeled instances
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
Which clustering do you want? inducing your ideal clustering with minimal feedback
Journal of Artificial Intelligence Research
Combining committee-based semi-supervised learning and active learning
Journal of Computer Science and Technology
A refinement approach to handling model misfit in semi-supervised learning
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Text classification on a grid environment
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Bootstrapping SVM active learning by incorporating unlabelled images for image retrieval
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Identity matching using personal and social identity features
Information Systems Frontiers
Mixed-membership naive Bayes models
Data Mining and Knowledge Discovery
Modeling reciprocity in social interactions with probabilistic latent space models
Natural Language Engineering
A hierarchical Naïve Bayes model for approximate identity matching
Decision Support Systems
Reverse spatial and textual k nearest neighbor search
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Software defect detection with rocus
Journal of Computer Science and Technology
Text segmentation: A topic modeling perspective
Information Processing and Management: an International Journal
Applying machine learning in accounting research
Expert Systems with Applications: An International Journal
A new co-training-style random forest for computer aided diagnosis
Journal of Intelligent Information Systems
An alternative approach for statistical single-label document classification of newspaper articles
Journal of Information Science
Joint bilingual sentiment classification with unlabeled parallel corpora
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Unsupervised decomposition of a document into authorial components
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Text mining techniques for leveraging positively labeled data
BioNLP '11 Proceedings of BioNLP 2011 Workshop
On theme location discovery for travelogue services
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Integrating hierarchical feature selection and classifier training for multi-label image annotation
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Recommending ephemeral items at web scale
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Journal of Biomedical Informatics
The unsymmetrical-style co-training
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Instance selection in semi-supervised learning
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Filling the gap: semi-supervised learning for opinion detection across domains
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Handling missing data in software effort prediction with naive Bayes and EM algorithm
Proceedings of the 7th International Conference on Predictive Models in Software Engineering
An iterative semi-supervised approach to software fault prediction
Proceedings of the 7th International Conference on Predictive Models in Software Engineering
Global/local hybrid learning of mixture-of-experts from labeled and unlabeled data
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I
Fuzzy semi-supervised support vector machines
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Learning from partially annotated sequences
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Multiview semi-supervised learning for ranking multilingual documents
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Semi-SAD: applying semi-supervised learning to shilling attack detection
Proceedings of the fifth ACM conference on Recommender systems
Can irrelevant data help semi-supervised learning, why and how?
Proceedings of the 20th ACM international conference on Information and knowledge management
Automatic Moderation of Online Discussion Sites
International Journal of Electronic Commerce
Finding audio-visual events in informal social gatherings
ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Continuation methods for mixing heterogeneous sources
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Dynamic categorization of clinical research eligibility criteria by hierarchical clustering
Journal of Biomedical Informatics
Bilingual co-training for sentiment classification of chinese product reviews
Computational Linguistics
Multi-view EM algorithm for finite mixture models
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Tri-training and data editing based semi-supervised clustering algorithm
MICAI'06 Proceedings of the 5th Mexican international conference on Artificial Intelligence
Transductive learning for text classification using explicit knowledge models
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Revisiting fisher kernels for document similarities
ECML'06 Proceedings of the 17th European conference on Machine Learning
A semi-naive bayesian learning method for utilizing unlabeled data
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Encoding classifications into lightweight ontologies
ESWC'06 Proceedings of the 3rd European conference on The Semantic Web: research and applications
Using weighted nearest neighbor to benefit from unlabeled data
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Comparison of documents classification techniques to classify medical reports
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Markov blankets and meta-heuristics search: sentiment extraction from unstructured texts
WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
A classifier design based on combining multiple components by maximum entropy principle
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Causal relation extraction using cue phrase and lexical pair probabilities
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Learning to filter junk e-mail from positive and unlabeled examples
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
A comparative study on the use of labeled and unlabeled data for large margin classifiers
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Parallel text categorization for multi-dimensional data
PDCAT'04 Proceedings of the 5th international conference on Parallel and Distributed Computing: applications and Technologies
A multi-layer Naïve bayes model for approximate identity matching
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
A cross-corpus study of unsupervised subjectivity identification based on calibrated EM
WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Identifying Web Spam with the Wisdom of the Crowds
ACM Transactions on the Web (TWEB)
Leave-one-out manifold regularization
Expert Systems with Applications: An International Journal
Learning to separate text content and style for classification
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Automated retraining methods for document classification and their parameter tuning
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Classifying web data in directory structures
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Application of semi-supervised learning to evaluative expression classification
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Training conditional random fields with unlabeled data and limited number of labeled examples
ICMLC'05 Proceedings of the 4th international conference on Advances in Machine Learning and Cybernetics
Semi-supervised dynamic counter propagation network
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Sample-based software defect prediction with active and semi-supervised learning
Automated Software Engineering
Learning Instance Weighted Naive Bayes from labeled and unlabeled data
Journal of Intelligent Information Systems
Sampling of virtual examples to improve classification accuracy for nominal attribute data
RSCTC'06 Proceedings of the 5th international conference on Rough Sets and Current Trends in Computing
A new approach for semi-supervised online news classification
HSI'05 Proceedings of the 3rd international conference on Human Society@Internet: web and Communication Technologies and Internet-Related Social Issues
Learning from positive and unlabeled examples with different data distributions
ECML'05 Proceedings of the 16th European conference on Machine Learning
On discriminative joint density modeling
ECML'05 Proceedings of the 16th European conference on Machine Learning
Semi-supervised multiple classifier systems: background and research directions
MCS'05 Proceedings of the 6th international conference on Multiple Classifier Systems
Exploiting class hierarchies for knowledge transfer in hyperspectral data
MCS'05 Proceedings of the 6th international conference on Multiple Classifier Systems
Social network and spatial semantics for real-world information service
MMAS'04 Proceedings of the First international conference on Massively Multi-Agent Systems
A PAC-Style model for learning from labeled and unlabeled data
COLT'05 Proceedings of the 18th annual conference on Learning Theory
User-Interest-Based document filtering via semi-supervised clustering
ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
SETRED: self-training with editing
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Pulse: mining customer opinions from free text
IDA'05 Proceedings of the 6th international conference on Advances in Intelligent Data Analysis
Topics modeling based on selective Zipf distribution
Expert Systems with Applications: An International Journal
AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Text classification using web corpora and EM algorithms
AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
Cost-sensitive classification with inadequate labeled data
Information Systems
Robust Video Content Analysis via Transductive Learning
ACM Transactions on Intelligent Systems and Technology (TIST)
Class normalization in centroid-based text categorization
Information Sciences: an International Journal
Enhancing text classification by information embedded in the test set
CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
SimSpectrum: a similarity based spectral clustering approach to generate a tag cloud
ICWE'11 Proceedings of the 11th international conference on Current Trends in Web Engineering
Iterative refinement of HMM and HCRF for sequence classification
PSL'11 Proceedings of the First IAPR TC3 conference on Partially Supervised Learning
Finding experts in tag based knowledge sharing communities
KSEM'11 Proceedings of the 5th international conference on Knowledge Science, Engineering and Management
Towards mobile intelligence: Learning from GPS history data for collaborative recommendation
Artificial Intelligence
Pareto charting using multifield freestyle text data applied to Toyota Camry user reviews
Applied Stochastic Models in Business and Industry
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
On supervised mining of dynamic content-based networks1
Statistical Analysis and Data Mining
Sentiment detection with auxiliary data
Information Retrieval
HySAD: a semi-supervised hybrid shilling attack detector for trustworthy product recommendation
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A term association translation model for naive bayes text classification
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Towards personalized context-aware recommendation by mining context logs through topic models
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Training pool selection for semi-supervised learning
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Building high-performance classifiers using positive and unlabeled examples for text classification
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Change Detection of Remote Sensing Images with Semi-supervised Multilayer Perceptron
Fundamenta Informaticae
Semi-supervised vehicle recognition: an approximate region constrained approach
RSKT'12 Proceedings of the 7th international conference on Rough Sets and Knowledge Technology
A new relational Tri-training system with adaptive data editing for inductive logic programming
Knowledge-Based Systems
Behavioral factors in interactive training of text classifiers
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Unified expectation maximization
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Cross-lingual mixture model for sentiment classification
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Bootstrapping via graph propagation
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Latent semantic transliteration using dirichlet mixture
NEWS '12 Proceedings of the 4th Named Entity Workshop
If you are happy and you know it... tweet
Proceedings of the 21st ACM international conference on Information and knowledge management
From sBoW to dCoT marginalized encoders for text representation
Proceedings of the 21st ACM international conference on Information and knowledge management
Sentiment classification with supervised sequence embedding
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Bidirectional semi-supervised learning with graphs
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Concept comparison engines: A new frontier of search
Decision Support Systems
Sentiment analysis by augmenting expectation maximisation with lexical knowledge
WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
High performance query expansion using adaptive co-training
Information Processing and Management: an International Journal
Sampling the Web as Training Data for Text Classification
International Journal of Digital Library Systems
CSS'12 Proceedings of the 4th international conference on Cyberspace Safety and Security
Context-Aware Expert Finding in Tag Based Knowledge Sharing Communities
International Journal of Knowledge and Systems Science
Audience targeting by B-to-B advertisement classification: A neural network approach
Expert Systems with Applications: An International Journal
Clustering tagged documents with labeled and unlabeled documents
Information Processing and Management: an International Journal
A Comparative Study of Cross-Lingual Sentiment Classification
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Aggregation pheromone metaphor for semi-supervised classification
Pattern Recognition
Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans
Fuzzy Sets and Systems
Semi-supervised learning with density-ratio estimation
Machine Learning
Chinese terminology extraction using EM-Based transfer learning method
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Stock price prediction based on a complex interrelation network of economic factors
Engineering Applications of Artificial Intelligence
On collocations and topic models
ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 2
Emerging topic detection for organizations from microblogs
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Document classification by topic labeling
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Prediction of movement direction in crude oil prices based on semi-supervised learning
Decision Support Systems
Cross-lingual web spam classification
Proceedings of the 22nd international conference on World Wide Web companion
Researcher homepage classification using unlabeled data
Proceedings of the 22nd international conference on World Wide Web
A biterm topic model for short texts
Proceedings of the 22nd international conference on World Wide Web
Machine learning for interactive systems and robots: a brief introduction
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Building a second opinion: learning cross-company data
Proceedings of the 9th International Conference on Predictive Models in Software Engineering
Commonsense-based topic modeling
Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
Combining crowd-generated media and personal data: semi-supervised learning for context recognition
Proceedings of the 1st ACM international workshop on Personal data meets distributed multimedia
A second order cone programming approach for semi-supervised learning
Pattern Recognition
Improving semi-supervised text classification by using wikipedia knowledge
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
A purity measure based transductive learning algorithm
ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
ATT: analyzing temporal dynamics of topics and authors in social media
Proceedings of the 3rd International Web Science Conference
Computer Vision and Image Understanding
What's buzzing in the blizzard of buzz? Automotive component isolation in social media postings
Decision Support Systems
A multi-manifold semi-supervised Gaussian mixture model for pattern classification
Pattern Recognition Letters
Towards scalable activity recognition: adapting zero-effort crowdsourced acoustic models
Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia
Proceedings of the Fourth Symposium on Information and Communication Technology
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Future Generation Computer Systems
Pattern classification and clustering: A review of partially supervised learning approaches
Pattern Recognition Letters
Joint semi-supervised learning of Hidden Conditional Random Fields and Hidden Markov Models
Pattern Recognition Letters
A study of supervised term weighting scheme for sentiment analysis
Expert Systems with Applications: An International Journal
Identity matching and information acquisition: Estimation of optimal threshold parameters
Decision Support Systems
CALA: An unsupervised URL-based web page classification system
Knowledge-Based Systems
Classifying evolving data streams with partially labeled data
Intelligent Data Analysis
A new fuzzy rule-based classification system for word sense disambiguation
Intelligent Data Analysis
Improving multi-view semi-supervised learning with agreement-based sampling
Intelligent Data Analysis - Combined Learning Methods and Mining Complex Data
Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms
Intelligent Data Analysis
Enhancing K-Means using class labels
Intelligent Data Analysis
Hi-index | 0.02 |
This paper shows that the accuracy of learned textclassifiers can be improved by augmenting a small number of labeledtraining documents with a large pool of unlabeled documents. This isimportant because in many text classification problems obtainingtraining labels is expensive, while large quantities of unlabeleddocuments are readily available.We introduce an algorithm for learning from labeled and unlabeleddocuments based on the combination of Expectation-Maximization (EM)and a naive Bayes classifier. The algorithm first trains a classifierusing the available labeled documents, and probabilistically labelsthe unlabeled documents. It then trains a new classifier using thelabels for all the documents, and iterates to convergence. This basicEM procedure works well when the data conform to the generativeassumptions of the model. However these assumptions are oftenviolated in practice, and poor performance can result. We present twoextensions to the algorithm that improve classification accuracy underthese conditions: (1) a weighting factor to modulate the contributionof the unlabeled data, and (2) the use of multiple mixture componentsper class. Experimental results, obtained using text from threedifferent real-world tasks, show that the use of unlabeled datareduces classification error by up to 30%.