Biologically motivated computationally intensive approaches to image pattern recognition
Future Generation Computer Systems - Special double issue: high performance computing and networking (HPCN)
Automatic scoring of pronunciation quality
Speech Communication
Phone-level pronunciation scoring and assessment for interactive language learning
Speech Communication
Combination of machine scores for automatic grading of pronunciation quality
Speech Communication
Explicit, N-Best Formant Features for Vowel Classification
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Automatic Pronunciation Scoring for Language Instruction
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Comparison of texture features based on Gabor filters
IEEE Transactions on Image Processing
Hi-index | 0.00 |
This paper discusses Mandarin vowel pronunciation quality assessment. The phonetic pronunciation quality is traditionally evaluated under the speech recognition framework by the phonetic posterior probability score, which may be computed by normalizing the frame-based posterior probability or be calculated on the phone segment directly. By the first method, we can achieve a human-machine scoring correlation coefficient (CC) of 0.832 for vowel; and by the second, the CC can be up to 0.847. This paper proposes a novel kind of formant feature and applies the feature to the evaluation of vowel: we transform the formant plots on the time-frequency plane to a bitmap and extract its Gabor feature for pattern classification; when use the classification probability for pronunciation assessment, we can get a CC of 0.842. Finally we combine the three scores with various linear or nonlinear methods; the best CC of 0.913 is gotten by using neural network.