An empirical study on language model adaptation

  • Authors:
  • Jianfeng Gao;Hisami Suzuki;Wei Yuan

  • Affiliations:
  • Suzuki Microsoft Research, Redmond, WA;Suzuki Microsoft Research, Redmond, WA;Shanghai Jiao Tong University, Shanghai, China

  • Venue:
  • ACM Transactions on Asian Language Information Processing (TALIP)
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article presents an empirical study of four techniques for adapting language models, including a maximum a posteriori (MAP) method and three discriminative training models, in the application of Japanese Kana-Kanji conversion. We compare the performance of these methods from various angles by adapting the baseline model to four adaptation domains. In particular, we attempt to interpret the results in terms of the character error rate (CER) by correlating them with the characteristics of the adaptation domain, measured by using the information-theoretic notion of cross entropy. We show that such a metric correlates well with the CER performance of the adaptation methods, and also show that the discriminative methods are not only superior to a MAP-based method in achieving larger CER reduction, but also in having fewer side effects and being more robust against the similarity between background and adaptation domains.