On the use of perceptual Line Spectral pairs Frequencies and higher-order residual moments for Speaker Identification

Authors:
Md. Sahidullah;Sandipan Chakroborty;Goutam Saha
Affiliations:
Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721 302, India.;Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721 302, India.;Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721 302, India
Venue:
International Journal of Biometrics
Year:
2010

Citing 14
Cited 2

On Image Analysis by the Methods of Moments

IEEE Transactions on Pattern Analysis and Machine Intelligence
3-D Moment Forms: Their Construction and Application to Object Identification and Positioning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Fundamentals of speech recognition

Fundamentals of speech recognition
On Image Analysis by Moments

IEEE Transactions on Pattern Analysis and Machine Intelligence
On Combining Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical properties of line spectrum pairs

Signal Processing
Robust speaker verification with state duration modeling

Speech Communication
Comparative Study of Speaker Identification Methods: dPLRM, SVM and GMM

IEICE - Transactions on Information and Systems
Properties of line spectrum pair polynomials: a review

Signal Processing - Special section: Distributed source coding
Capturing Complementary Information via Reversed Filter Bank and Parallel Implementation with MFCC for Improved Text-Independent Speaker Identification

ICCTA '07 Proceedings of the International Conference on Computing: Theory and Applications
Robust speaker modeling using perceptually motivated feature

Pattern Recognition Letters
Review: Line spectral pairs

Signal Processing
Investigation on LP-residual representations for speaker identification

Pattern Recognition
An overview of text-independent speaker recognition: From features to supervectors

Speech Communication

Robust speaker identification in the presence of car noise

International Journal of Biometrics
Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition

Speech Communication

Quantified Score

Hi-index	0.01

Visualization

Abstract

Conventional Speaker Identification (SI) systems utilise spectral features like Mel-Frequency Cepstral Coefficients (MFCC) or Perceptual Linear Prediction (PLP) as a frontend module. Line Spectral pairs Frequencies (LSF) are popular alternative representation of Linear Prediction Coefficients (LPC). In this paper, an investigation is carried out to extract LSF from perceptually modified speech. A new feature set extracted from the residual signal is also proposed. SI system based on this residual feature containing complementary information to spectral characteristics, when fused with the conventional spectral feature based system as well as the proposed perceptually modified LSF, shows improved performance.