Automatic Expansion of Chinese Abbreviations by Web Mining

  • Authors:
  • Hui Liu;Yuquan Chen;Lei Liu

  • Affiliations:
  • Shanghai Institute of Foreign Trade, China;Shanghai Jiao Tong University, China;Shanghai Jiao Tong University, China

  • Venue:
  • AICI '09 Proceedings of the International Conference on Artificial Intelligence and Computational Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abbreviations are common in everyday Chinese. For applications like information retrieval, we want not only to recognize the abbreviations, but also to know what they stand for. To tackle the emergence of all kinds of new abbreviations, this paper proposes a novel method that expands an abbreviation to its full name employing the Web as the main information source. Snippets containing full names of an abbreviation are obtained through a search engine by learned "help words". Then the snippets are examined using linguistic heuristics to generate a list of candidates. We select the optimal candidate according to a kNN-based ranking mechanism. Experiment shows that this method achieves satisfactory results.