Hierarchical organization of a set of Gaussian mixture speaker models for scaling up indexing and retrieval in audio documents

Authors:
J. E. Rougui;M. Rziza;D. Aboutajdine;M. Gelgon;J. Martinez
Affiliations:
Mohammed V University, RP Rabat- Morocco;Mohammed V University, RP Rabat- Morocco;Mohammed V University, RP Rabat- Morocco;l'université de Nantes, Nantes cedex, France;l'université de Nantes, Nantes cedex, France
Venue:
Proceedings of the 2006 ACM symposium on Applied computing
Year:
2006

Citing 1
Cited 0

Structuring and Querying Documents in an Audio Database Management System

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work considers the case of spoken radio archives, in which a continuous ow of speech is introduced into the indexing system. To retrieval segments originating from a given speaker, scanning the set of enrolled speaker is needed. We propose a technique for organizing hierarchically the set of speaker models, to reduce the cost of speaker identification. This organization can be obtained and updated at low cost, thanks to a particular algorithm that exploits the Gaussian mixture form of each speaker model. This parcimony is due to having computations carried out of model parameters rather than feature vectors. A first validation on real data illustrates the proposal.