Self-organized language modeling for speech recognition
Readings in speech recognition
C4.5: programs for machine learning
C4.5: programs for machine learning
Computational Linguistics
An English to Korean transliteration model of extended Markov window
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Transliteration of proper names in cross-lingual information retrieval
MultiNER '03 Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition - Volume 15
An ensemble of transliteration models for information retrieval
Information Processing and Management: an International Journal
Learning transliteration lexicons from the web
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A modified joint source-channel model for transliteration
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
A generic framework for machine transliteration
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Babel: a machine transliteration workbench
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A phonetic similarity model for automatic extraction of transliteration pairs
ACM Transactions on Asian Language Information Processing (TALIP)
A Structure-Based Model for Chinese Organization Name Translation
ACM Transactions on Asian Language Information Processing (TALIP)
Active learning for constructing transliteration lexicons from the Web
Journal of the American Society for Information Science and Technology
Similarity of Names Across Scripts: Edit Distance Using Learned Costs of N-Grams
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Harvesting Regional Transliteration Variants with Guided Search
ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
Shahmukhi to Gurmukhi transliteration system
COLING '08 22nd International Conference on on Computational Linguistics: Demonstration Papers
Induction of cross-language affix and letter sequence correspondence
CrossLangInduction '06 Proceedings of the International Workshop on Cross-Language Knowledge Induction
Modeling impression in probabilistic transliteration into Chinese
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Discriminative methods for transliteration
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Learning to match names across languages
MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Mining and modeling relations between formal and informal Chinese phrases from web corpora
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning phoneme mappings for transliteration without parallel data
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Unsupervised constraint driven learning for transliteration discovery
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Learning better transliterations
Proceedings of the 18th ACM conference on Information and knowledge management
Homophones and tonal patterns in English-Chinese transliteration
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Discriminative substring decoding for transliteration
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Report of NEWS 2009 machine transliteration shared task
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Whitepaper of NEWS 2009 machine transliteration shared task
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
DirecTL: a language-independent approach to transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Named entity transcription with pair n-gram models
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Transliteration by bidirectional statistical machine translation
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Transliteration of name entity via improved statistical translation on character sequences
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Learning multi character alignment rules and classification of training data for transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Fast decoding and easy implementation: transliteration as sequential labeling
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
English to Hindi machine transliteration system at NEWS 2009
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
A noisy channel model for grapheme-based machine transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
A syllable-based name transliteration system
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
English-Hindi transliteration using context-informed PB-SMT: the DCU system for NEWS 2009
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Language independent transliteration system using phrase based SMT approach on substrings
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Combining MDL transliteration training with discriminative modeling
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
ε-extension Hidden Markov Models and weighted transducers for machine transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Name transliteration with bidirectional perceptron edit models
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Name matching between Chinese and Roman scripts: machine complements human
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Graphemic approximation of phonological context for English-Chinese transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Mining Synonymous Transliterations from the World Wide Web
ACM Transactions on Asian Language Information Processing (TALIP)
Transliteration for Resource-Scarce Languages
ACM Transactions on Asian Language Information Processing (TALIP)
Integrating joint n-gram features into a discriminative training framework
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Hindi-to-Urdu machine translation through transliteration
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Report of NEWS 2010 transliteration generation shared task
NEWS '10 Proceedings of the 2010 Named Entities Workshop
Whitepaper of NEWS 2010 shared task on transliteration generation
NEWS '10 Proceedings of the 2010 Named Entities Workshop
NEWS '10 Proceedings of the 2010 Named Entities Workshop
Reranking with multiple features for better transliteration
NEWS '10 Proceedings of the 2010 Named Entities Workshop
English to Indian languages machine transliteration system at NEWS 2010
NEWS '10 Proceedings of the 2010 Named Entities Workshop
Mining name translations from entity graph mapping
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Machine transliteration survey
ACM Computing Surveys (CSUR)
Machine transliteration: leveraging on third languages
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Nonparametric Bayesian machine transliteration with synchronous adaptor grammars
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Learning regional transliteration variants
Information Processing and Management: an International Journal
Mining entity translations from comparable corpora: a holistic graph mapping approach
Proceedings of the 20th ACM international conference on Information and knowledge management
Using latent semantics for NE translation
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Direct combination of spelling and pronunciation information for robust back-transliteration
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Dialect translation: integrating Bayesian co-segmentation models with pivot-based SMT
DIALECTS '11 Proceedings of the First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties
An ensemble of grapheme and phoneme for machine transliteration
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A phrase-based context-dependent joint probability model for named entity translation
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Leveraging supplemental representations for sequential transduction
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Efficient Entity Translation Mining: A Parallelized Graph Alignment Approach
ACM Transactions on Information Systems (TOIS)
Processing informal, romanized Pakistani text messages
LSM '12 Proceedings of the Second Workshop on Language in Social Media
A statistical model for unsupervised and semi-supervised transliteration mining
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Regularized interlingual projections: evaluation on multilingual transliteration
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Whitepaper of NEWS 2012 shared task on machine transliteration
NEWS '12 Proceedings of the 4th Named Entity Workshop
Report of NEWS 2012 machine transliteration shared task
NEWS '12 Proceedings of the 4th Named Entity Workshop
Latent semantic transliteration using dirichlet mixture
NEWS '12 Proceedings of the 4th Named Entity Workshop
Syllable-based machine transliteration with extra phrase features
NEWS '12 Proceedings of the 4th Named Entity Workshop
NEWS '12 Proceedings of the 4th Named Entity Workshop
A joint model to identify and align bilingual named entities
Computational Linguistics
A Bayesian Alignment Approach to Transliteration Mining
ACM Transactions on Asian Language Information Processing (TALIP)
Substring-based machine translation
Machine Translation
MDL-based models for transliteration generation
SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Hi-index | 0.00 |
Most foreign names are transliterated into Chinese, Japanese or Korean with approximate phonetic equivalents. The transliteration is usually achieved through intermediate phonemic mapping. This paper presents a new framework that allows direct orthographical mapping (DOM) between two different languages, through a joint source-channel model, also called n-gram transliteration model (TM). With the n-gram TM model, we automate the orthographic alignment process to derive the aligned transliteration units from a bilingual dictionary. The n-gram TM under the DOM framework greatly reduces system development effort and provides a quantum leap in improvement in transliteration accuracy over that of other state-of-the-art machine learning algorithms. The modeling framework is validated through several experiments for English-Chinese language pair.