Numerical recipes in C (2nd ed.): the art of scientific computing
Numerical recipes in C (2nd ed.): the art of scientific computing
Fundamentals of speech recognition
Fundamentals of speech recognition
Concurrent Self-Organizing Maps for Pattern Classification
ICCI '02 Proceedings of the 1st IEEE International Conference on Cognitive Informatics
Spoken-Digit Recognition Using Self-Organizing Maps with Perceptual Pre-processing
IWANN '97 Proceedings of the International Work-Conference on Artificial and Natural Neural Networks: Biological and Artificial Computation: From Neuroscience to Technology
New Developments and Applications of Self-Organizing Maps
NICROSP '96 Proceedings of the 1996 International Workshop on Neural Networks for Identification, Control, Robotics, and Signal/Image Processing (NICROSP '96)
Digital Signal Processing Using MATLAB (Bookware Companion)
Digital Signal Processing Using MATLAB (Bookware Companion)
On the use of residual cepstrum in speech recognition
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Telephone speech recognition using neural networks and hidden Markov models
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Product of Gaussians for speech recognition
Computer Speech and Language
Study on pharyngeal and uvular consonants in foreign accented Arabic for ASR
Computer Speech and Language
Hi-index | 0.00 |
Traditional speech recognition systems have relied on power spectral densities, Mel-frequency cepstral, linear prediction coding and formant analysis. This paper introduces two novel input feature sets and their extraction methods for intelligent phoneme identification. These input sets are based on intrinsic phonetic characteristics of Arabic speech comprising of the dimensionally reduced Power Spectral Densities (DPSD) and Location, Trend, Gradient (LTG) values of the captured speech signal spectrum. These characteristics have been subsequently utilized as inputs to four different neural network based recognition classifiers. The classifiers have been tested for twenty-eight Arabic phonemes utterances from over one hundred nonnative speakers. The results obtained using the proposed feature sets have been compared and it has been observed that LTG based input feature set provides an average phoneme identification accuracy of 86% as compared to 70% obtained through applying DPSD based inputs for similar classifiers. It is worthwhile to note that the methods proposed in this paper are generic and are equally applicable to other regional languages such as Persian and Urdu.