Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Table extraction using conditional random fields
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Unsupervised word sense disambiguation rivaling supervised methods
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Understanding the Yarowsky Algorithm
Computational Linguistics
Unsupervised learning of field segmentation models for information extraction
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Semi-supervised conditional random fields for improved sequence segmentation and labeling
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Webpage understanding: an integrated approach
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Hidden Conditional Random Fields
IEEE Transactions on Pattern Analysis and Machine Intelligence
An unsupervised framework for extracting and normalizing product attributes from multiple web sites
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Learning query intent from regularized click graphs
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Learning from labeled features using generalized expectation criteria
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
The linguistic structure of English web-search queries
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Semi-supervised learning of semantic classes for query understanding: from the web and for the web
Proceedings of the 18th ACM conference on Information and knowledge management
Retrieval experiments using pseudo-desktop collections
Proceedings of the 18th ACM conference on Information and knowledge management
On the use of virtual evidence in conditional random fields
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Unsupervised query segmentation using click data: preliminary results
Proceedings of the 19th international conference on World wide web
Structured annotations of web queries
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Clicked phrase document expansion for sponsored search ad retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Minimally-supervised extraction of entities from text advertisements
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Understanding the semantic structure of noun phrase queries
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Probabilistic first pass retrieval for search advertising: from theory to practice
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Improving verbose queries using subset distribution
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Alignment of short length parallel corpora with an application to web search
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Result enrichment in commerce search using browse trails
Proceedings of the fourth ACM international conference on Web search and data mining
Improving recommendation for long-tail queries via templates
Proceedings of the 20th international conference on World wide web
Facet discovery for structured web search: a query-log mining approach
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
The sum of its parts: reducing sparsity in click estimation with query segments
Information Retrieval
Mining query structure from click data: a case study of product queries
Proceedings of the 20th ACM international conference on Information and knowledge management
Sequence clustering and labeling for unsupervised query intent discovery
Proceedings of the fifth ACM international conference on Web search and data mining
A field relevance model for structured document retrieval
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Confidence-aware graph regularization with heterogeneous pairwise features
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
An analysis of free-text queries for a multi-field web form
Proceedings of the 4th Information Interaction in Context Symposium
Labeling queries for a people search engine
NLDB'12 Proceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems
Mining search query logs for spoken language understanding
SDCTD '12 NAACL-HLT Workshop on Future Directions and Needs in the Spoken Dialog Community: Tools and Data
Using search-logs to improve query tagging
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Measuring website similarity using an entity-aware click graph
Proceedings of the 21st ACM international conference on Information and knowledge management
Structured query reformulations in commerce search
Proceedings of the 21st ACM international conference on Information and knowledge management
A probabilistic mixture model for mining and analyzing product search log
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Unsupervised identification of synonymous query intent templates for attribute intents
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Crowdsourcing-assisted query structure interpretation
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Supporting keyword search in product database: a probabilistic approach
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
When search is against structured documents, it is beneficial to extract information from user queries in a format that is consistent with the backend data structure. As one step toward this goal, we study the problem of query tagging which is to assign each query term to a pre-defined category. Our problem could be approached by learning a conditional random field (CRF) model (or other statistical models) in a supervised fashion, but this would require substantial human-annotation effort. In this work, we focus on a semi-supervised learning method for CRFs that utilizes two data sources: (1) a small amount of manually-labeled queries, and (2) a large amount of queries in which some word tokens have derived labels, i.e., label information automatically obtained from additional resources. We present two principled ways of encoding derived label information in a CRF model. Such information is viewed as hard evidence in one setting and as soft evidence in the other. In addition to the general methodology of how to use derived labels in semi-supervised CRFs, we also present a practical method on how to obtain them by leveraging user click data and an in-domain database that contains structured documents. Evaluation on product search queries shows the effectiveness of our approach in improving tagging accuracies.