Communications of the ACM - Special issue on parallelism
Pronouncing names by a combination of rule-based and case-based reasoning
Pronouncing names by a combination of rule-based and case-based reasoning
Novel-word pronunciation: a cross-language study
Speech Communication - Speech science and technology: a selection from the papers presented at the Fourth International Conference in Speech Science and Technology (SST-92)
Analogical natural language processing
Analogical natural language processing
IGTree: Using Trees for Compression and Classification in Lazy LearningAlgorithms
Artificial Intelligence Review - Special issue on lazy learning
An introduction to text-to-speech synthesis
An introduction to text-to-speech synthesis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Mathematical Techniques in Multisensor Data Fusion
Mathematical Techniques in Multisensor Data Fusion
Data-oriented methods for grapheme-to-phoneme conversion
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Paradigmatic cascades: a linguistically sound model of pronunciation by analogy
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Classifier combination for improved lexical disambiguation
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Improving data driven wordclass tagging by system combination
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Using an on-line dictionary to find rhyming words and pronunciations for unknown words
ACL '85 Proceedings of the 23rd annual meeting on Association for Computational Linguistics
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Automatic acquisition of names using speak and spell mode in spoken dialogue systems
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Pronunciation by analogy in normal and impaired readers
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Can syllabification improve pronunciation by analogy of English?
Natural Language Engineering
Joint-sequence models for grapheme-to-phoneme conversion
Speech Communication
Pronunciation prediction with Default&Refine
Computer Speech and Language
Multilingual pronunciation by analogy
Natural Language Engineering
Computer Speech and Language
Web derived pronunciations for spoken term detection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Reversible sound-to-letter/letter-to-sound modeling based on syllable structure
NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
A Comparison of Data-Driven Automatic Syllabification Methods
SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
SRWS '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
A framework for mixed-language text-to-speech synthesis
CIMMACS'09 Proceedings of the 8th WSEAS International Conference on Computational intelligence, man-machine systems and cybernetics
The effect of lexicon composition in pronunciation by analogy
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Toward language-independent text-to-speech synthesis
WSEAS Transactions on Information Science and Applications
Letter-phoneme alignment: an exploration
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
An overview of text-to-speech synthesis techniques
CIT'10 Proceedings of the 4th international conference on Communications and information technology
Automatic conversion between pronunciations of different English accents
Speech Communication
Automatic rule extraction for modeling pronunciation variation
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Estimating dyslexia in the web
Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
Diphone-based concatenative speech synthesis systems for arabic language
CSECS'11/MECHANICS'11 Proceedings of the 10th WSEAS international conference on Circuits, Systems, Electronics, Control & Signal Processing, and Proceedings of the 7th WSEAS international conference on Applied and Theoretical Mechanics
Leveraging supplemental representations for sequential transduction
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
A probabilistic approach to pronunciation by analogy
Computer Speech and Language
Hi-index | 0.00 |
Pronunciation by analogy (PbA) is a data-driven method for relating letters to sound, with potential application to next-generation text-to-speech systems. This paper extends previous work on PbA in several directions. First, we have included "full" pattern matching between input letter string and dictionary entries, as well as including lexical stress in letter-to-phoneme conversion. Second, we have extended the method to phoneme-to-letter conversion. Third, and most important, we have experimented with multiple, different strategies for scoring the candidate pronunciations. Individual scores for each strategy are obtained on the basis of rank and either multiplied or summed to produce a final, overall score. Five strategies have been studied and results obtained from all 31 possible combinations. The two combination methods perform comparably, with the product rule only very marginally superior to the sum rule. Nonparametric statistical analysis reveals that performance improves as more strategies are included in the combination: this trend is very highly significant (p « 0.0005). Accordingly for letter-to-phoneme conversion, best results are obtained when all five strategies are combined: word accuracy is raised to 65.5% relative to 61.7% for our best previous result and 63.0% for the best-performing single strategy. These improvements are very highly significant (p ~ 0 and p = 0.00011 respectively). Similar results were found for phoneme-to-letter and letter-to-stress conversion, although the former was an easier problem for PbA than letter-to-phoneme conversion and the latter was harder. The main sources of error for the multistrategy approach are very similar to those for the best single strategy, and mostly involve vowel letters and phonemes.