A simulated annealing approach to speaker segmentation in audio databases

  • Authors:
  • José M. Leiva-Murillo;Sancho Salcedo-Sanz;Ascensión Gallardo-Antolín;Antonio Artés-Rodríguez

  • Affiliations:
  • Department of Signal Theory and Communications, Universidad Carlos III de Madrid, Spain;Department of Signal Theory and Communications, Universidad de Alcalá, 28871 Alcalá de Henares, Madrid, Spain;Department of Signal Theory and Communications, Universidad Carlos III de Madrid, Spain;Department of Signal Theory and Communications, Universidad Carlos III de Madrid, Spain

  • Venue:
  • Engineering Applications of Artificial Intelligence
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a novel approach to the problem of speaker segmentation, which is an unavoidable previous step to audio indexing. Mutual information is used for evaluating the accuracy of the segmentation, as a function to be maximized by a simulated annealing (SA) algorithm. We introduce a novel mutation operator for the SA, the Consecutive Bits Mutation operator, which improves the performance of the SA in this problem. We also use the so-called Compaction Factor, which allows the SA to operate in a reduced search space. Our algorithm has been tested in the segmentation of real audio databases, and it has been compared to several existing algorithms for speaker segmentation, obtaining very good results in the test problems considered.