Extracting loanwords from Mongolian corpora and producing a Japanese-Mongolian bilingual dictionary

  • Authors:
  • Badam-Osor Khaltar;Atsushi Fujii;Tetsuya Ishikawa

  • Affiliations:
  • University of Tsukuba, Kasuga Tsukuba, Japan;University of Tsukuba, Kasuga Tsukuba, Japan;The University of Tokyo, Bunkyo-ku, Tokyo, Japan

  • Venue:
  • ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes methods for extracting loanwords from Cyrillic Mongolian corpora and producing a Japanese-Mongolian bilingual dictionary. We extract loanwords from Mongolian corpora using our own handcrafted rules. To complement the rule-based extraction, we also extract words in Mongolian corpora that are phonetically similar to Japanese Katakana words as loanwords. In addition, we correspond the extracted loanwords to Japanese words and produce a bilingual dictionary. We propose a stemming method for Mongolian to extract loanwords correctly. We verify the effectiveness of our methods experimentally.