Exploiting high-level information provided by ALISP in speaker recognition

  • Authors:
  • Asmaa El Hannani;Dijana Petrovska-Delacrétaz

  • Affiliations:
  • DIVA Group, Informatics Dept., University of Fribourg, Switzerland;DIVA Group, Informatics Dept., University of Fribourg, Switzerland

  • Venue:
  • NOLISP'05 Proceedings of the 3rd international conference on Non-Linear Analyses and Algorithms for Speech Processing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The best performing systems in the area of automatic speaker recognition have focused on using short-term, low-level acoustic information, such as cepstral features. Recently, various works have demonstrated that high-level features convey more speaker information and can be added to the low-level features in order to increase the robustness of the system. This paper describes a text-independent speaker recognition system exploiting high-level information provided by ALISP (Automatic Language Independent Speech Processing), a data-driven segmentation. This system, denoted here as ALISP n-gram system, captures the speaker specific information only by analyzing sequences of ALISP units. The ALISP n-gram system was fused with an acoustic ALISP-based Gaussian Mixture Models (GMM) system exploiting the speaker discriminating properties of individual speech classes. The resulting fused system reduced the error rate over the individual systems on the NIST 2004 Speaker Recognition Evaluation data.