Organizing Gaussian mixture models into a tree for scaling up speaker retrieval

  • Authors:
  • J. E. Rougui;M. Gelgon;D. Aboutajdine;N. Mouaddib;M. Rziza

  • Affiliations:
  • Ecole polytechnique de l'université de Nantes, INRIA Atlas project-team, LINA (FRE CNRS 2729), BP 50609, 44306 Nantes Cedex 03, France and Groupe Signaux Communications et Multimedia, Facult& ...;Ecole polytechnique de l'université de Nantes, INRIA Atlas project-team, LINA (FRE CNRS 2729), BP 50609, 44306 Nantes Cedex 03, France;Groupe Signaux Communications et Multimedia, Faculté des Sciences Rabat-Agdal, 4, Av Ibn Battouta, Rabat, Morocco;Ecole polytechnique de l'université de Nantes, INRIA Atlas project-team, LINA (FRE CNRS 2729), BP 50609, 44306 Nantes Cedex 03, France;Groupe Signaux Communications et Multimedia, Faculté des Sciences Rabat-Agdal, 4, Av Ibn Battouta, Rabat, Morocco

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2007

Quantified Score

Hi-index 0.10

Visualization

Abstract

Numerous pattern recognition tasks set in the probabilistic framework face the following issue: it is expensive to evaluate the likelihood function for test data, when there are given very many candidate probabilistic models for explaining this data. We consider the application of this general and important problem to speaker recognition for indexing and retrieval purposes in radio archives. More precisely, we propose to reduce complexity at query time, by prior organization of speaker models into a hierarchy. This is very classically done for multi-dimensional vectors, but we propose herein a technique for building a hierarchy of probabilistic models, in the case these models take the form of a Gaussian mixture. From a closed-form approximation of Kullback-Leibler divergence between parent and children, an optimality criterion and an optimization technique are derived, from which we propose an efficient approach for building a tree of models, using clustering techniques (dendrogram-based or k-means-like). The proposed scheme is evaluated on real data.