Information retrieval using a singular value decomposition model of latent semantic structure
SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
A guided tour to approximate string matching
ACM Computing Surveys (CSUR)
Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Reference reconciliation in complex information spaces
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Generating query substitutions
Proceedings of the 15th international conference on World Wide Web
Dryad: distributed data-parallel programs from sequential building blocks
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Random walks on the click graph
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Scalable ad-hoc entity extraction from text collections
Proceedings of the VLDB Endowment
Query suggestion using hitting time
Proceedings of the 17th ACM conference on Information and knowledge management
Exploiting web search to generate synonyms for entities
Proceedings of the 18th international conference on World wide web
Mining document collections to facilitate accurate approximate entity matching
Proceedings of the VLDB Endowment
Web-scale distributional similarity and entity set expansion
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Near-synonym lexical choice in latent semantic space
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Unsupervised identification of synonymous query intent templates for attribute intents
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Discovering attribute and entity synonyms for knowledge integration and semantic web search
Proceedings of the 3rd International Workshop on Semantic Search Over the Web
Hi-index | 0.00 |
Entity synonyms are critical for many applications like information retrieval and named entity recognition in documents. The current trend is to automatically discover entity synonyms using statistical techniques on web data. Prior techniques suffer from several limitations like click log sparsity and inability to distinguish between entities of different concept classes. In this paper, we propose a general framework for robustly discovering entity synonym with two novel similarity functions that overcome the limitations of prior techniques. We develop efficient and scalable techniques leveraging the MapReduce framework to discover synonyms at large scale. To handle long entity names with extraneous tokens, we propose techniques to effectively map long entity names to short queries in query log. Our experiments on real data from different entity domains demonstrate the superior quality of our synonyms as well as the efficiency of our algorithms. The entity synonyms produced by our system is in production in Bing Shopping and Video search, with experiments showing the significance it brings in improving search experience.