Translating collocations for bilingual lexicons: a statistical approach
Computational Linguistics
Learning phonetic similarity for matching named entity translations and mining new translations
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A Method for Recognizing Noisy Romanized Japanese Words in Learner English
IEICE - Transactions on Information and Systems
A lemmatization method for Mongolian and its application to indexing for information retrieval
Information Processing and Management: an International Journal
Recognizing noisy romanized Japanese words in learner English
EANL '08 Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications
Hi-index | 0.00 |
This paper proposes methods for extracting loanwords from Cyrillic Mongolian corpora and producing a Japanese-Mongolian bilingual dictionary. We extract loanwords from Mongolian corpora using our own handcrafted rules. To complement the rule-based extraction, we also extract words in Mongolian corpora that are phonetically similar to Japanese Katakana words as loanwords. In addition, we correspond the extracted loanwords to Japanese words and produce a bilingual dictionary. We propose a stemming method for Mongolian to extract loanwords correctly. We verify the effectiveness of our methods experimentally.