ICE-TEA: in-context expansion and translation of English abbreviations

  • Authors:
  • Waleed Ammar;Kareem Darwish;Ali El Kahki;Khaled Hafez

  • Affiliations:
  • Cairo Microsoft Innovation Center, Microsoft, Maadi, Cairo, Egypt;Cairo Microsoft Innovation Center, Microsoft, Maadi, Cairo, Egypt;Cairo Microsoft Innovation Center, Microsoft, Maadi, Cairo, Egypt;IBM Technology Development Center in Cairo and Cairo Microsoft Innovation Center, Microsoft, Maadi, Cairo, Egypt

  • Venue:
  • CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The wide use of abbreviations in modern texts poses interesting challenges and opportunities in the field of NLP. In addition to their dynamic nature, abbreviations are highly polysemous with respect to regular words. Technologies that exhibit some level of language understanding may be adversely impacted by the presence of abbreviations. This paper addresses two related problems: (1) expansion of abbreviations given a context, and (2) translation of sentences with abbreviations. First, an efficient retrieval-based method for English abbreviation expansion is presented. Then, a hybrid system is used to pick among simple abbreviation-translation methods. The hybrid system achieves an improvement of 1.48 BLEU points over the baseline MT system, using sentences that contain abbreviations as a test set.