Reversible sound-to-letter/letter-to-sound modeling based on syllable structure

Authors:
Stephanie Seneff
Affiliations:
MIT Computer Science and Artificial Intelligence Laboratory, The Stata Center, Cambridge, MA
Venue:
NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Year:
2007

Citing 4
Cited 0

Reversible letter-to-sound/sound-to-letter generation based on parsing word morphology

Speech Communication
Efficient multilingual phoneme-to-grapheme conversion based on HMM

Computational Linguistics
A multistrategy approach to improving pronunciation by analogy

Computational Linguistics
Automatic acquisition of names using speak and spell mode in spoken dialogue systems

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a new grapheme-to-phoneme framework, based on a combination of formal linguistic and statistical methods. A context-free grammar is used to parse words into their underlying syllable structure, and a set of sub-word "spellneme" units encoding both phonemic and graphemic information can be automatically derived from the parsed words. A statistical n-gram model can then be trained on a large lexicon of words represented in terms of these linguistically motivated subword units. The framework has potential applications in modeling unknown words and in linking spoken spellings with spoken pronunciations for fully automatic new-word acquisition via dialogue interaction. Results are reported on sound-to-letter experiments for the nouns in the Phonebook corpus.