Named entity transcription with pair n-gram models
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Predicting word pronunciation in Japanese
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Automatic speech recognition for under-resourced languages: A survey
Speech Communication
Web-based tools and methods for rapid pronunciation dictionary creation
Speech Communication
Hi-index | 0.00 |
Pronunciation information is available in large quantities on the Web, in the form of IPA and ad-hoc transcriptions. We describe techniques for extracting candidate pronunciations from Web pages and associating them with orthographic words, filtering out poorly extracted pronunciations, normalizing IPA pronunciations to better conform to a common transcription standard, and generating phonemic from ad-hoc transcriptions. We show improvements on a letter-to-phoneme task when using web-derived vs. Pronlex pronunciations.