Exploring distributional similarity based models for query spelling correction

  • Authors:
  • Mu Li;Yang Zhang;Muhua Zhu;Ming Zhou

  • Affiliations:
  • Microsoft Research Asia, Haidian District, Beijing, China;Tianjin University, Tianjin, China;Northeastern University, Shenyang, Liaoning, China;Microsoft Research Asia, Haidian District, Beijing, China

  • Venue:
  • ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

A query speller is crucial to search engine in improving web search relevance. This paper describes novel methods for use of distributional similarity estimated from query logs in learning improved query spelling correction models. The key to our methods is the property of distributional similarity between two terms: it is high between a frequently occurring misspelling and its correction, and low between two irrelevant terms only with similar spellings. We present two models that are able to take advantage of this property. Experimental results demonstrate that the distributional similarity based models can significantly outperform their baseline systems in the web query spelling correction task.