A statistical approach to machine translation
Computational Linguistics
Word sense disambiguation using a second language monolingual corpus
Computational Linguistics
A cooccurrence-based thesaurus and two applications to information retrieval
Information Processing and Management: an International Journal
SVDPACKC (Version 1.0) User''s Guide
SVDPACKC (Version 1.0) User''s Guide
Word-sense disambiguation using statistical methods
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Extraction of lexical translations from non-aligned corpora
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Unsupervised word sense disambiguation using bilingual comparable corpora
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Iterative translation disambiguation for cross-language information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Hi-index | 0.00 |
A term-list is a list of content words that characterize a consistent text or a concept. This paper presents a new method for translating a term-list by using a corpus in the target language. The method first retrieves alternative translations for each input word from a bilingual dictionary. It then determines the most 'coherent' combination of alternative translations, where the coherence of a set of words is defined as the proximity among multi-dimensional vectors produced from the words on the basis of co-occurrence statistics. The method was applied to term-lists extracted from newspaper articles and achieved 81% translation accuracy for ambiguous words (i.e., words with multiple translations).