IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical methods for speech recognition
Statistical methods for speech recognition
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
Computational Linguistics
Machine translation of very close languages
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
String transformation learning
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Proper name translation in cross-language information retrieval
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Alignment of multiple languages for historical comparison
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Aligning sentences in bilingual corpora using lexical information
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An improved error model for noisy channel spelling correction
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Determining recurrent sound correspondences by inducing translation models
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Cognates can improve statistical translation models
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Learning a translation lexicon from monolingual corpora
ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
Alignment and extraction of bilingual legal terminology from context profiles
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
Inducing translation lexicons via diverse similarity measures and bridge languages
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Statistical machine translation by parsing
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Inducing a multilingual dictionary from a parallel multitext in related languages
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Methods for extracting and classifying pairs of cognates and false friends
Machine Translation
English-Arabic proper-noun transliteration-pairs creation
Journal of the American Society for Information Science and Technology
Approximate String Matching Techniques for Effective CLIR Among Indian Languages
WILF '07 Proceedings of the 7th international workshop on Fuzzy Logic and Applications: Applications of Fuzzy Sets Theory
Automatic prediction of cognate orthography using support vector machines
ACL '07 Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop
Adaptive string distance measures for bilingual dialect lexicon induction
ACL '07 Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Tagging Portuguese with a Spanish tagger using cognates
CrossLangInduction '06 Proceedings of the International Workshop on Cross-Language Knowledge Induction
Cross-lingual bootstrapping of semantic lexicons: the case of FrameNet
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Evaluation of several phonetic similarity algorithms on the task of cognate identification
LD '06 Proceedings of the Workshop on Linguistic Distances
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Computing word similarity and identifying cognates with pair hidden Markov models
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Panlingual lexical translation via probabilistic inference
Artificial Intelligence
Identifying complex sound correspondences in bilingual wordlists
CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Finding cognate groups using phylogenies
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Word-based dialect identification with georeferenced rules
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Unsupervised multilingual learning
Unsupervised multilingual learning
DIALECTS '11 Proceedings of the First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Journal of Artificial Intelligence Research
Recovering dialect geography from an unaligned comparable corpus
EACL 2012 Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
Regularized interlingual projections: evaluation on multilingual transliteration
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Hi-index | 0.00 |
This paper presents a method for inducing translation lexicons based on transduction models of cognate pairs via bridge languages. Bilingual lexicons within languages families are induced using probabilistic string edit distance models. Translation lexicons for arbitrary distant language pairs are then generated by a combination of these intra-family translation models and one or more cross-family on-line dictionaries. Up to 95% exact match accuracy is achieved on the target vocabulary (30-68% of inter-family test pairs). Thus substantial portions of translation lexicons can be generated accurately for languages where no bilingual dictionary or parallel corpora may exist.