IEEE Transactions on Pattern Analysis and Machine Intelligence
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
A new algorithm for the alignment of phonetic sequences
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Joint-sequence models for grapheme-to-phoneme conversion
Speech Communication
LIBLINEAR: A Library for Large Linear Classification
The Journal of Machine Learning Research
Report of NEWS 2009 machine transliteration shared task
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Whitepaper of NEWS 2009 machine transliteration shared task
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
DirecTL: a language-independent approach to transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Integrating joint n-gram features into a discriminative training framework
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Letter-phoneme alignment: an exploration
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Report of NEWS 2010 transliteration generation shared task
NEWS '10 Proceedings of the 2010 Named Entities Workshop
NEWS '10 Proceedings of the 2010 Named Entities Workshop
Machine transliteration: leveraging on third languages
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Leveraging supplemental representations for sequential transduction
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Hi-index | 0.00 |
Grapheme-to-phoneme conversion (G2P) of names is an important and challenging problem. The correct pronunciation of a name is often reflected in its transliterations, which are expressed within a different phonological inventory. We investigate the problem of using transliterations to correct errors produced by state-of-the-art G2P systems. We present a novel re-ranking approach that incorporates a variety of score and n-gram features, in order to leverage transliterations from multiple languages. Our experiments demonstrate significant accuracy improvements when re-ranking is applied to n-best lists generated by three different G2P programs.