Arabic phoneme identification using conventional and concurrent neural networks in non native speakers

Authors:
Mian M. Awais;Shahid Masud;Junaid Ahktar;Shafay Shamail
Affiliations:
Department of Computer Science, Lahore University of Management Sciences, Sector-U, D.H.A., Lahore, Pakistan;Department of Computer Science, Lahore University of Management Sciences, Sector-U, D.H.A., Lahore, Pakistan;Department of Computer Science, Lahore University of Management Sciences, Sector-U, D.H.A., Lahore, Pakistan;Department of Computer Science, Lahore University of Management Sciences, Sector-U, D.H.A., Lahore, Pakistan
Venue:
ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Year:
2007

Citing 10
Cited 1

Numerical recipes in C (2nd ed.): the art of scientific computing

Numerical recipes in C (2nd ed.): the art of scientific computing
Fundamentals of speech recognition

Fundamentals of speech recognition
Physiological interpretation of the Self-Organizing Map algorithm

Neural Networks
Concurrent Self-Organizing Maps for Pattern Classification

ICCI '02 Proceedings of the 1st IEEE International Conference on Cognitive Informatics
Spoken-Digit Recognition Using Self-Organizing Maps with Perceptual Pre-processing

IWANN '97 Proceedings of the International Work-Conference on Artificial and Natural Neural Networks: Biological and Artificial Computation: From Neuroscience to Technology
New Developments and Applications of Self-Organizing Maps

NICROSP '96 Proceedings of the 1996 International Workshop on Neural Networks for Identification, Control, Robotics, and Signal/Image Processing (NICROSP '96)
Digital Signal Processing Using MATLAB (Bookware Companion)

Digital Signal Processing Using MATLAB (Bookware Companion)
On the use of residual cepstrum in speech recognition

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Telephone speech recognition using neural networks and hidden Markov models

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Product of Gaussians for speech recognition

Computer Speech and Language

Study on pharyngeal and uvular consonants in foreign accented Arabic for ASR

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional speech recognition systems have relied on power spectral densities, Mel-frequency cepstral, linear prediction coding and formant analysis. This paper introduces two novel input feature sets and their extraction methods for intelligent phoneme identification. These input sets are based on intrinsic phonetic characteristics of Arabic speech comprising of the dimensionally reduced Power Spectral Densities (DPSD) and Location, Trend, Gradient (LTG) values of the captured speech signal spectrum. These characteristics have been subsequently utilized as inputs to four different neural network based recognition classifiers. The classifiers have been tested for twenty-eight Arabic phonemes utterances from over one hundred nonnative speakers. The results obtained using the proposed feature sets have been compared and it has been observed that LTG based input feature set provides an average phoneme identification accuracy of 86% as compared to 70% obtained through applying DPSD based inputs for similar classifiers. It is worthwhile to note that the methods proposed in this paper are generic and are equally applicable to other regional languages such as Persian and Urdu.