Algorithms for Arabic name transliteration
IBM Journal of Research and Development
The automatic component of the LINGSTAT machine-aided translation system
HLT '94 Proceedings of the workshop on Human Language Technology
Weighted rational transductions and their application to human language processing
HLT '94 Proceedings of the workshop on Human Language Technology
Finding the Right Words: An Analysis of Not-Translated Words in Machine Translation
AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
Translation by the Numbers: Language Weaver
AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Transliteration of proper names in cross-language applications
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Statistical transliteration for english-arabic cross language information retrieval
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Bitext maps and alignment via pattern recognition
Computational Linguistics
Extracting named entity translingual equivalence with limited resources
ACM Transactions on Asian Language Information Processing (TALIP)
Making MIRACLEs: Interactive translingual search for Cebuano and Hindi
ACM Transactions on Asian Language Information Processing (TALIP)
An English to Korean transliteration model of extended Markov window
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
English-to-Korean transliteration using multiple unbounded overlapping phoneme chunks
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Mandarin-English Information (MEI): investigating translingual speech retrieval
HLT '01 Proceedings of the first international conference on Human language technology research
An English-Korean transliteration model using pronunciation and contextual rules
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Mandarin-English Information (MEI): investigating translingual speech retrieval
NAACL-ANLP-EMTS '00 Proceedings of the 2000 NAACL-ANLP Workshop on Embedded machine translation systems - Volume 5
Machine transliteration of names in Arabic text
SEMITIC '02 Proceedings of the ACL-02 workshop on Computational approaches to semitic languages
Corpus-based Pinyin name resolution
SIGHAN '02 Proceedings of the first SIGHAN workshop on Chinese language processing - Volume 18
MultiNER '03 Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition - Volume 15
Transliteration of proper names in cross-lingual information retrieval
MultiNER '03 Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition - Volume 15
GeoName: a system for back-transliterating Pinyin place names
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Automatic generation of Japanese–English bilingual thesauri based on bilingual corpora
Journal of the American Society for Information Science and Technology - Research Articles
An ensemble of transliteration models for information retrieval
Information Processing and Management: an International Journal
A machine transliteration model based on correspondence between graphemes and phonemes
ACM Transactions on Asian Language Information Processing (TALIP)
Named entity translation matching and learning: With application for mining unseen translations
ACM Transactions on Information Systems (TOIS)
Weakly supervised named entity transliteration and discovery from multilingual comparable corpora
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Punjabi machine transliteration
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Detecting transliterated orthographic variants via two similarity metrics
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Cluster-specific named entity transliteration
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
OCR post-processing for low density languages
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Named entity transliteration and discovery from multilingual comparable corpora
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Combining probability models and web mining models: a framework for proper name transliteration
Information Technology and Management
Proceedings of the 2nd ACM workshop on Improving non english web searching
Introducing a Translation Dictionary into Phrase-Based SMT
IEICE - Transactions on Information and Systems
Active sample selection for named entity transliteration
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Lightly supervised transliteration for machine translation
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Learning to match names across languages
MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Clustering and classifying person names by origin
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Loss-sensitive discriminative training of machine transliteration models
SRWS '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
Translating names and technical terms in Arabic text
Semitic '98 Proceedings of the Workshop on Computational Approaches to Semitic Languages
A comparison of different machine transliteration models
Journal of Artificial Intelligence Research
Phonetic models for generating spelling variants
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
NICT@WMT09: model adaptation and transliteration for Spanish-English SMT
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Source-language entailment modeling for translating unknown terms
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Graphical models over multiple strings
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Transliteration by bidirectional statistical machine translation
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Combining MDL transliteration training with discriminative modeling
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
ε-extension Hidden Markov Models and weighted transducers for machine transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Name matching between Chinese and Roman scripts: machine complements human
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Transliteration for Resource-Scarce Languages
ACM Transactions on Asian Language Information Processing (TALIP)
Everybody loves a rich cousin: an empirical study of transliteration through bridge languages
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
NEWS '10 Proceedings of the 2010 Named Entities Workshop
Machine transliteration survey
ACM Computing Surveys (CSUR)
Using Sublexical Translations to Handle the OOV Problem in Machine Translation
ACM Transactions on Asian Language Information Processing (TALIP)
Improving machine transliteration performance by using multiple transliteration models
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Phoneme-Based transliteration of foreign names for OOV problem
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
An ensemble of grapheme and phoneme for machine transliteration
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Improving transliteration with precise alignment of phoneme chunks and using contextual features
AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
Toward statistical machine translation without parallel corpora
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Processing informal, romanized Pakistani text messages
LSM '12 Proceedings of the Second Workshop on Language in Social Media
Hi-index | 0.00 |
It is challenging to translate names and technical terms across languages with different alphabets and sound inventories. These items are commonly transliterated, i.e., replaced with approximate phonetic equivalents. For example, computer in English comes out as (konpyuutaa) in Japanese. Translating such items from Japanese back to English is even more challenging, and of practical interest, as transliterated items make up the bulk of text phrases not found in bilingual dictionaries. We describe and evaluate a method for performing backwards transliterations by machine. This method uses a generative model, incorporating several distinct stages in the transliteration process.