A study on the relation between the frame pruning and the robust speaker identification with multivariate t-distribution

  • Authors:
  • Younjeong Lee;Joohun Lee;Hernsoo Hahn

  • Affiliations:
  • School of Electronic Engineering, Soongsil University, Seoul, Korea;Dept. of Internet Broadcasting, Dong-Ah Broadcasting College, Anseong, Korea;School of Electronic Engineering, Soongsil University, Seoul, Korea

  • Venue:
  • PCM'05 Proceedings of the 6th Pacific-Rim conference on Advances in Multimedia Information Processing - Volume Part I
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we performed the robust speaker identification based on the frame pruning and multivariate t-distribution respectively, and then studied on a theoretical basis for the frame pruning using the other methods. Based on the results from two methods, we showed that the robust algorithms based on the weight of frames become the theoretical basis of the frame pruning method by considering the correspondence between the weight of frame pruning and the conditional expectation of t-distribution. Both methods showed good performance when coping with the outliers occurring in a given time period, while the frame pruning method removing less reliable frames is recommended as one of good methods and, also, the multivariate t-distributions are generally used instead of Gaussian mixture models (GMM) as a robust approach for the speaker identification. In experiments, we found that the robust speaker identification has higher performance than the typical GMM algorithm. Moreover, we showed that the trend of frame likelihood using the frame pruning is similar to one of robust algorithms.