Practical translation pattern acquisition from combined language resources

  • Authors:
  • Mihoko Kitamura;Yuji Matsumoto

  • Affiliations:
  • Graduate School of Information Science, Nara Institute of Science and Technology, Nara, Japan;Corporate Research & Development Center, Oki Electric Industry Co., Ltd, Osaka, Japan

  • Venue:
  • IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic extraction of translation patterns from parallel corpora is an efficient way to automatically develop translation dictionaries, and therefore various approaches have been proposed. This paper presents a practical translation pattern extraction method that greedily extracts translation patterns based on co-occurrence of English and Japanese word sequences, which can also be effectively combined with manual confirmation and linguistic resources, such as chunking information and translation dictionaries. Use of these extra linguistic resources enables it to acquire results of higher precision and broader coverage regardless of the amount of documents.