Integration of complementary honerecognizers for phonotactic language recognition

Authors:
Yan Deng;Weiqiang Zhang;Yanmin Qian;Jia Liu
Affiliations:
Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing, China;Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing, China;Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing, China;Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing, China
Venue:
ICICA'10 Proceedings of the First international conference on Information computing and applications
Year:
2010

Citing 2
Cited 0

A Vector Space Modeling Approach to Spoken Language Identification

IEEE Transactions on Audio, Speech, and Language Processing
On Acoustic Diversification Front-End for Spoken Language Identification

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper takes an investigation into building and fusing multiple phone recognizers in the phonotactic system for language recognition. The phone recognizers are built using both phonetic and acoustic diversification. The phonetic diversification is achieved by training multiple phone recognizers on speech corpus of different languages. While the acoustic diversification is implemented in several ways, including using different acoustic features, different phone modeling techniques and training paradigms. As some phone recognizers are highly correlated with each other, we propose a performance optimization (PO) criterion to select a set of complementary phone recognizers for fusion. Experimental results on the NIST 2007 Language Recognition Evaluation (LRE) 30-s test set show the effectiveness of the proposed approach.