SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
A stochastic finite-state word-segmentation algorithm for Chinese
Computational Linguistics
Exploiting clustering and phrases for context-based information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Phase-based information retrieval
Information Processing and Management: an International Journal
A minimum description length approach to grammar inference
Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing
Self-Supervised Chinese Word Segmentation
IDA '01 Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis
Unsupervised language acquisition
Unsupervised language acquisition
Mostly-unsupervised statistical segmentation of Japanese Kanji sequences
Natural Language Engineering
Fast statistical parsing of noun phrases for document indexing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Noun-phrase analysis in unrestricted text for information retrieval
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Clustering words with the MDL principle
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Man vs. machine: a case study in base noun phrase learning
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Query reformulation using automatically generated query concepts from a document space
Information Processing and Management: an International Journal
Generating query substitutions
Proceedings of the 15th international conference on World Wide Web
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Chinese segmentation and new word detection using conditional random fields
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Exploiting semantic role labeling, WordNet and Wikipedia for coreference resolution
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Statistical recognition of noun phrases in unrestricted text
IDA'05 Proceedings of the 6th international conference on Advances in Intelligent Data Analysis
Analysis of long queries in a large scale search log
Proceedings of the 2009 workshop on Web Search Click Data
Query segmentation using conditional random fields
Proceedings of the First International Workshop on Keyword Search on Structured Data
Named entity recognition in query
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Two-stage query segmentation for information retrieval
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Effects of word confusion networks on voice search
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
A Query Substitution-Search Result Refinement Approach for Long Query Web Searches
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Characterizing commercial intent
Proceedings of the 18th ACM conference on Information and knowledge management
Helping editors choose better seed sets for entity set expansion
Proceedings of the 18th ACM conference on Information and knowledge management
Context sensitive synonym discovery for web search queries
Proceedings of the 18th ACM conference on Information and knowledge management
Query segmentation based on eigenspace similarity
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
HAMSTER: using search clicklogs for schema and taxonomy matching
Proceedings of the VLDB Endowment
Entity extraction via ensemble semantics
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Learning concept importance using a weighted dependence model
Proceedings of the third ACM international conference on Web search and data mining
Exploring web scale language models for search query processing
Proceedings of the 19th international conference on World wide web
Query parsing in mobile voice search
Proceedings of the 19th international conference on World wide web
Evaluating verbose query processing techniques
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
The power of naive query segmentation
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Profiting from mark-up: hyper-text annotations for guided parsing
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
From frequency to meaning: vector space models of semantics
Journal of Artificial Intelligence Research
Language pyramid and multi-scale text analysis
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Probabilistic first pass retrieval for search advertising: from theory to practice
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Modeling reformulation using passage analysis
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Structural annotation of search queries using pseudo-relevance feedback
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Using web-scale N-grams to improve base NP parsing performance
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Search with synonyms: problems and solutions
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Unsupervised query segmentation using only query logs
Proceedings of the 20th international conference companion on World wide web
Proceedings of the 20th international conference on World wide web
The sum of its parts: reducing sparsity in click estimation with query segments
Information Retrieval
Language resources extracted from Wikipedia
Proceedings of the sixth international conference on Knowledge capture
Joint annotation of search queries
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Fine-grained class label markup of search queries
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Unsupervised query segmentation using clickthrough for information retrieval
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Identifying aspects for web-search queries
Journal of Artificial Intelligence Research
Automatically building training examples for entity extraction
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Keyword query cleaning with query logs
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Suggestion set utility maximization using session logs
Proceedings of the 20th ACM international conference on Information and knowledge management
Proceedings of the 5th International Workshop on Web APIs and Service Mashups
Automated dictionary discovery for the online marketplace
Proceedings of the 2012 iConference
Random selection assisted long web search query optimization
Proceedings of the 50th Annual Southeast Regional Conference
Ontology based segmentation of geo-referenced queries
ICWE'11 Proceedings of the 11th international conference on Current Trends in Web Engineering
Mining for insights in the search engine query stream
Proceedings of the 21st international conference companion on World Wide Web
DNIS'11 Proceedings of the 7th international conference on Databases in Networked Information Systems
A web 2.0 approach for organizing search results using wikipedia
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Information Retrieval
A generalized hidden Markov model with discriminative training for query spelling correction
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
An IR-based evaluation framework for web search query segmentation
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Extending BM25 with multiple query operators
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Linguistically-adapted structural query annotation for digital libraries in the social sciences
LaTeCH '12 Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Automatically mining question reformulation patterns from search log data
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Towards optimum query segmentation: in doubt without
Proceedings of the 21st ACM international conference on Information and knowledge management
Role-explicit query identification and intent role annotation
Proceedings of the 21st ACM international conference on Information and knowledge management
Hierarchical target type identification for entity-oriented queries
Proceedings of the 21st ACM international conference on Information and knowledge management
Modeling reformulation using query distributions
ACM Transactions on Information Systems (TOIS)
Question answering on interlinked data
Proceedings of the 22nd international conference on World Wide Web
On segmentation of eCommerce queries
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Towards Concept-Based Translation Models Using Search Logs for Query Expansion
Proceedings of the 21st ACM international conference on Information and knowledge management
Probabilistic query rewriting for efficient and effective keyword search on graph data
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
In this paper, we propose a novel unsupervised approach to query segmentation, an important task in Web search. We use a generative query model to recover a query's underlying concepts that compose its original segmented form. The model's parameters are estimated using an expectation-maximization (EM) algorithm, optimizing the minimum description length objective function on a partial corpus that is specific to the query. To augment this unsupervised learning, we incorporate evidence from Wikipedia. Experiments show that our approach dramatically improves performance over the traditional approach that is based on mutual information, and produces comparable results with a supervised method. In particular, the basic generative language model contributes a 7.4% improvement over the mutual information based method (measured by segment F1 on the Intersection test set). EM optimization further improves the performance by 14.3%. Additional knowledge from Wikipedia provides another improvement of 24.3%, adding up to a total of 46% improvement (from 0.530 to 0.774).