Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic query expansion using query logs
Proceedings of the 11th international conference on World Wide Web
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
PEBL: positive example based learning for Web page classification using SVM
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Journal of the American Society for Information Science and Technology
A maximum entropy approach to named entity recognition
A maximum entropy approach to named entity recognition
The Journal of Machine Learning Research
Building Text Classifiers Using Positive and Unlabeled Examples
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Nymble: a high-performance learning name-finder
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
University of Sheffield: description of the LaSIE system as used for MUC-6
MUC6 '95 Proceedings of the 6th conference on Message understanding
Unsupervised named-entity extraction from the web: an experimental study
Artificial Intelligence
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Generating query substitutions
Proceedings of the 15th international conference on World Wide Web
Improving web search ranking by incorporating user behavior information
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
LDA-based document models for ad-hoc retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 16th international conference on World Wide Web
Unsupervised prediction of citation influences
Proceedings of the 24th international conference on Machine learning
Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Weakly-supervised discovery of named entities using web search queries
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Topic modeling with network regularization
Proceedings of the 17th international conference on World Wide Web
Context-aware query suggestion by mining click-through and session data
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning semantic categories from clickthrough logs
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Towards rich query interpretation: walking back and forth for mining query templates
Proceedings of the 19th international conference on World wide web
Exploiting click-through data for entity retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Language pyramid and multi-scale text analysis
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Detecting hot events from web search logs
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Entity set expansion in opinion documents
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Domain adaptation for text categorization by feature labeling
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Multi-view random walk framework for search task discovery from click-through log
Proceedings of the 20th ACM international conference on Information and knowledge management
Learning to rank audience for behavioral targeting in display ads
Proceedings of the 20th ACM international conference on Information and knowledge management
Unsupervised extraction of template structure in web search queries
Proceedings of the 21st international conference on World Wide Web
Detecting and Tracking Topics and Events from Web Search Logs
ACM Transactions on Information Systems (TOIS)
Identifying salient entities in web pages
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Learning open-domain comparable entity graphs from user search queries
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
This paper addresses Named Entity Mining (NEM), in which we mine knowledge about named entities such as movies, games, and books from a huge amount of data. NEM is potentially useful in many applications including web search, online advertisement, and recommender system. There are three challenges for the task: finding suitable data source, coping with the ambiguities of named entity classes, and incorporating necessary human supervision into the mining process. This paper proposes conducting NEM by using click-through data collected at a web search engine, employing a topic model that generates the click-through data, and learning the topic model by weak supervision from humans. Specifically, it characterizes each named entity by its associated queries and URLs in the click-through data. It uses the topic model to resolve ambiguities of named entity classes by representing the classes as topics. It employs a method, referred to as Weakly Supervised Latent Dirichlet Allocation (WS-LDA), to accurately learn the topic model with partially labeled named entities. Experiments on a large scale click-through data containing over 1.5 billion query-URL pairs show that the proposed approach can conduct very accurate NEM and significantly outperforms the baseline.