Machine Learning
Learning to extract symbolic knowledge from the World Wide Web
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Hierarchical classification of Web content
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A classifier for semi-structured documents
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Text Classification from Labeled and Unlabeled Documents using EM
Machine Learning - Special issue on information retrieval
DEADLINER: building a new niche search engine
Proceedings of the ninth international conference on Information and knowledge management
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Improving Category Specific Web Search by Learning Query Modifications
SAINT '01 Proceedings of the 2001 Symposium on Applications and the Internet (SAINT 2001)
Reducing multiclass to binary: a unifying approach for margin classifiers
The Journal of Machine Learning Research
One-class svms for document classification
The Journal of Machine Learning Research
Uniform object generation for optimizing one-class classifiers
The Journal of Machine Learning Research
Database research at the University of Illinois at Urbana-Champaign
ACM SIGMOD Record
Data Mining for Web Intelligence
Computer
General MC: Estimating Boundary of Positive Class from Small Positive Data
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Building Text Classifiers Using Positive and Unlabeled Examples
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
PEBL: Web Page Classification without Negative Examples
IEEE Transactions on Knowledge and Data Engineering
Cross-training: learning probabilistic mappings between topics
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Classifying large data sets using SVMs with hierarchical clusters
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Text classification from positive and unlabeled documents
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Negative pseudo-relevance feedback in content-based video retrieval
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Dealing with different distributions in learning from
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Text Classification without Labeled Negative Documents
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Making SVMs Scalable to Large Data Sets using Hierarchical Cluster Indexing
Data Mining and Knowledge Discovery
Text Classification without Negative Examples Revisit
IEEE Transactions on Knowledge and Data Engineering
Single-Class Classification with Mapping Convergence
Machine Learning
Information Processing and Management: an International Journal
Authors vs. readers: a comparative study of document metadata and content in the www
Proceedings of the 2007 ACM symposium on Document engineering
A two-step classification approach to unsupervised record linkage
AusDM '07 Proceedings of the sixth Australasian conference on Data mining and analytics - Volume 70
Efficient algorithms for incremental Web log mining with dynamic thresholds
The VLDB Journal — The International Journal on Very Large Data Bases
SVM based adaptive learning method for text classification from positive and unlabeled documents
Knowledge and Information Systems
PE-PUC: A Graph Based PU-Learning Approach for Text Classification
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Learning to Classify Documents with Only a Small Positive Training Set
ECML '07 Proceedings of the 18th European conference on Machine Learning
Using k-Interactive Measure in Optimization-Based Data Mining
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Classification techniques with minimal labelling effort and application to medical reports
International Journal of Data Mining and Bioinformatics
Building a Text Classifier by a Keyword and Unlabeled Documents
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Named entity mining from click-through data using weakly supervised latent dirichlet allocation
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Support vector machines for query-focused summarization trained and evaluated on pyramid data
ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Text classification by labeling words
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Partially supervised sense disambiguation by learning sense number from tagged and untagged corpora
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Building a Text Classifier by a Keyword and Wikipedia Knowledge
ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Learning to identify unexpected instances in the test set
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
SVMC: single-class classification with support vector machines
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Learning to classify texts using positive and unlabeled data
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Automated ontology instantiation from tabular web sources-The AllRight system
Web Semantics: Science, Services and Agents on the World Wide Web
Content based image retrieval using unclean positive examples
IEEE Transactions on Image Processing
Sentiment analysis of Chinese documents: From sentence to document level
Journal of the American Society for Information Science and Technology
Researcher affiliation extraction from homepages
NLPIR4DL '09 Proceedings of the 2009 Workshop on Text and Citation Analysis for Scholarly Digital Libraries
MCS'03 Proceedings of the 4th international conference on Multiple classifier systems
Mining rough association from text documents for web information gathering
Transactions on rough sets VII
A framework for modeling positive class expansion with single snapshot
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Automatic training example selection for scalable unsupervised record linkage
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Distributional similarity vs. PU learning for entity set expansion
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Negative training data can be harmful to text classification
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Semi-supervised learning from only positive and unlabeled data using entropy
WAIM'10 Proceedings of the 11th international conference on Web-age information management
A new smooth support vector machine
AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
A survey of recent trends in one class classification
AICS'09 Proceedings of the 20th Irish conference on Artificial intelligence and cognitive science
Journal of Intelligent Information Systems
Editorial: Classifying text streams by keywords using classifier ensemble
Data & Knowledge Engineering
Bayesian classifiers for positive unlabeled learning
WAIM'11 Proceedings of the 12th international conference on Web-age information management
On positive and unlabeled learning for text classification
TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Automatic Moderation of Online Discussion Sites
International Journal of Electronic Commerce
A biological text retrieval system based on background knowledge and user feedback
VDMB'06 Proceedings of the First international conference on Data Mining and Bioinformatics
Extracting initial and reliable negative documents to enhance classification performance
KDLL'06 Proceedings of the 2006 international conference on Knowledge Discovery in Life Science Literature
Spying out accurate user preferences for search engine adaptation
WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
Learning to filter junk e-mail from positive and unlabeled examples
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
A new approach for semi-supervised online news classification
HSI'05 Proceedings of the 3rd international conference on Human Society@Internet: web and Communication Technologies and Internet-Related Social Issues
Learning from positive and unlabeled examples with different data distributions
ECML'05 Proceedings of the 16th European conference on Machine Learning
Partially supervised classification – based on weighted unlabeled samples support vector machine
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
A new PU learning algorithm for text classification
MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Protein-Protein interactions classification from text via local learning with class priors
NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
Accurate measurements of pointing performance from in situ observations
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Estimate unlabeled-data-distribution for semi-supervised PU learning
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Building high-performance classifiers using positive and unlabeled examples for text classification
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Text classification with relatively small positive documents and unlabeled data
Proceedings of the 21st ACM international conference on Information and knowledge management
Privacy-Preserving speaker authentication
ISC'12 Proceedings of the 15th international conference on Information Security
Learning from positive and unlabelled examples using maximum margin clustering
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Heat pump detection from coarse grained smart meter data with positive and unlabeled learning
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning from data streams with only positive and unlabeled data
Journal of Intelligent Information Systems
What's the deal?: identifying online bargains
AWC '13 Proceedings of the First Australasian Web Conference - Volume 144
Hi-index | 0.00 |
Web page classification is one of the essential techniques for Web mining. Specifically, classifying Web pages of a user-interesting class is the first step of mining interesting information from the Web. However, constructing a classifier for an interesting class requires laborious pre-processing such as collecting positive and negative training examples. For instance, in order to construct a "homepage" classifier, one needs to collect a sample of homepages (positive examples) and a sample of non-homepages (negative examples). In particular, collecting negative training examples requires arduous work and special caution to avoid biasing them. We introduce in this paper the Positive Example Based Learning (PEBL) framework for Web page classification which eliminates the need for manually collecting negative training examples in pre-processing. We present an algorithm called Mapping-Convergence (M-C) that achieves classification accuracy (with positive and unlabeled data) as high as that of traditional SVM (with positive and negative data). Our experiments show that when the M-C algorithm uses the same amount of positive examples as that of traditional SVM, the M-C algorithm performs as well as traditional SVM.