Exploiting high-level information provided by ALISP in speaker recognition

Authors:
Asmaa El Hannani;Dijana Petrovska-Delacrétaz
Affiliations:
DIVA Group, Informatics Dept., University of Fribourg, Switzerland;DIVA Group, Informatics Dept., University of Fribourg, Switzerland
Venue:
NOLISP'05 Proceedings of the 3rd international conference on Non-Linear Analyses and Algorithms for Speech Processing
Year:
2005

Citing 2
Cited 2

On Combining Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
Neural Networks: A Comprehensive Foundation

Neural Networks: A Comprehensive Foundation

Automatic discrimination between laughter and speech

Speech Communication
Text-independent speaker verification: state of the art and challenges

Progress in nonlinear speech processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The best performing systems in the area of automatic speaker recognition have focused on using short-term, low-level acoustic information, such as cepstral features. Recently, various works have demonstrated that high-level features convey more speaker information and can be added to the low-level features in order to increase the robustness of the system. This paper describes a text-independent speaker recognition system exploiting high-level information provided by ALISP (Automatic Language Independent Speech Processing), a data-driven segmentation. This system, denoted here as ALISP n-gram system, captures the speaker specific information only by analyzing sequences of ALISP units. The ALISP n-gram system was fused with an acoustic ALISP-based Gaussian Mixture Models (GMM) system exploiting the speaker discriminating properties of individual speech classes. The resulting fused system reduced the error rate over the individual systems on the NIST 2004 Speaker Recognition Evaluation data.