Feature transformation based on discriminant analysis preserving local structure for speech recognition

  • Authors:
  • Makoto Sakai;Norihide Kitaoka;Kazuya Takeda

  • Affiliations:
  • DENSO CORPORATION, Nisshin 470-0111, Japan;Nagoya University, 464-8601, Japan;Nagoya University, 464-8601, Japan

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

To improve speech recognition performance, a feature transformation based on discriminant analysis has been widely used to reduce redundant dimensions of features. Linear discriminant analysis (LDA) and Heteroscedastic discriminant analysis (HDA) are often used for this purpose, and a generalization method for LDA and HDA called Power LDA (PLDA) has been proposed. However, these methods may result in unexpected dimensionality reduction for multimodal data. It is important to preserve the local structure of the data in reducing the dimensionality of multimodal data. In this paper we introduce two methods, locality preserving HDA and locality preserving PLDA. We also give an efficient calculation scheme to obtain an optimal projection.