Phonetic models for generating spelling variants

  • Authors:
  • Rahul Bhagat;Eduard Hovy

  • Affiliations:
  • Information Sciences Institute, University of Southern California, Marina Del Rey, CA;Information Sciences Institute, University of Southern California, Marina Del Rey, CA

  • Venue:
  • IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Proper names, whether English or non-English, have several different spellings when transliterated from a non-English source language into English. Knowing the different variations can significantly improve the results of name-searches on various source texts, especially when recall is important. In this paper we propose two novel phonetic models to generate numerous candidate variant spellings of a name. Our methods show threefold improvement over the baseline and generate four times as many good name variants compared to a human while maintaining a respectable precision of 0.68.