Multigranular scale speech recognizers: technological and cognitive view

Authors:
Francesco Cutugno;Gianpaolo Coro;Massimo Petrillo
Affiliations:
Department of Physics, University Federico II, Naples, Italy;Department of Physics, University Federico II, Naples, Italy;Department of Physics, University Federico II, Naples, Italy
Venue:
AI*IA'05 Proceedings of the 9th conference on Advances in Artificial Intelligence
Year:
2005

Citing 5
Cited 0

The Hearsay-II Speech-Understanding System: Integrating Knowledge to Resolve Uncertainty

ACM Computing Surveys (CSUR)
A comparison of Statecharts step semantics

Theoretical Computer Science
The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time'

Speech Communication - Special issue on the nature of speech perception (the psychophysics of speech perception III)
Incorporating information from syllable-length time scales into automatic speech recognition

Incorporating information from syllable-length time scales into automatic speech recognition
A syllable, articulatory-feature, and stress-accent model of speech recognition

A syllable, articulatory-feature, and stress-accent model of speech recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a Multigranular Automatic Speech Recognizer. The hypothesis is that speech signal contains information distributed on more different time scales. Many works from various scientific fields ranging from neurobiology to speech technologies, seem to concord on this assumption. In a broad sense, it seems that speech recognition in human is optimal because of a partial parallelization process according to which the left-to-right stream of speech is captured in a multilevel grid in which several linguistic analyses take place contemporarily. Our investigation aims, in this view, to apply these new ideas to the project of more robust and efficient recognizers.