Statistical Evaluation of Biometric Evidence in Forensic Automatic Speaker Recognition

Authors:
Andrzej Drygajlo
Affiliations:
Speech Processing and Biometrics Group, Swiss Federal Institute of Technology Lausanne (EPFL), Lausanne, Switzerland CH-1015
Venue:
IWCF '09 Proceedings of the 3rd International Workshop on Computational Forensics
Year:
2009

Citing 2
Cited 0

The inference of identity in forensic speaker recognition

Speech Communication - Speaker recognition and its commercial and forensic applications
Handbook of Biometrics

Handbook of Biometrics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Forensic speaker recognition is the process of determining if a specific individual (suspected speaker) is the source of a questioned voice recording (trace). This paper aims at presenting forensic automatic speaker recognition (FASR) methods that provide a coherent way of quantifying and presenting recorded voice as biometric evidence. In such methods, the biometric evidence consists of the quantified degree of similarity between speaker-dependent features extracted from the trace and speaker-dependent features extracted from recorded speech of a suspect. The interpretation of recorded voice as evidence in the forensic context presents particular challenges, including within-speaker (within-source) variability and between-speakers (between-sources) variability. Consequently, FASR methods must provide a statistical evaluation which gives the court an indication of the strength of the evidence given the estimated within-source and between-sources variabilities. This paper reports on the first ENFSI evaluation campaign through a fake case, organized by the Netherlands Forensic Institute (NFI), as an example, where an automatic method using the Gaussian mixture models (GMMs) and the Bayesian interpretation (BI) framework were implemented for the forensic speaker recognition task.