Spectral histogram of oriented gradients (SHOGs) for Tamil language male/female speaker classification

Authors:
A. Muthamizh Selvan;R. Rajesh
Affiliations:
Dept. of Computer Applications, School of Computer Science and Engineering, Bharathiar University, Coimbatore, India 641 046;Dept. of Computer Applications, School of Computer Science and Engineering, Bharathiar University, Coimbatore, India 641 046
Venue:
International Journal of Speech Technology
Year:
2012

Citing 19
Cited 0

Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Bilateral Filtering for Gray and Color Images

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Fast Human Detection by Boosting Histograms of Oriented Gradients

ICIG '07 Proceedings of the Fourth International Conference on Image and Graphics
Integrated phoneme subspace method for speech feature extraction

EURASIP Journal on Audio, Speech, and Music Processing
Automatic speech recognition systems for the evaluation of voice and speech disorders in head and neck cancer

EURASIP Journal on Audio, Speech, and Music Processing - Special issue on atypical speech
Combining auditory preprocessing and Bayesian estimation for robust formant tracking

IEEE Transactions on Audio, Speech, and Language Processing
Compact acoustic models for embedded speech recognition

EURASIP Journal on Audio, Speech, and Music Processing
Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments

IEEE Transactions on Audio, Speech, and Language Processing
Pitch-and formant-based order adaptation of the fractional Fourier transform and its application to speech recognition

EURASIP Journal on Audio, Speech, and Music Processing
Independent component analysis and time-frequency masking for speech recognition in multitalker conditions

EURASIP Journal on Audio, Speech, and Music Processing
Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method

IEEE Transactions on Audio, Speech, and Language Processing
Speech Recognition Using Linear Dynamic Models

IEEE Transactions on Audio, Speech, and Language Processing
Feature Extraction Based on Pitch-Synchronous Averaging for Robust Speech Recognition

IEEE Transactions on Audio, Speech, and Language Processing
Feature Compensation Techniques for ASR on Band-Limited Speech

IEEE Transactions on Audio, Speech, and Language Processing
Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients

IEEE Transactions on Audio, Speech, and Language Processing
Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features

IEEE Transactions on Audio, Speech, and Language Processing
A speech/music discriminator based on RMS and zero-crossings

IEEE Transactions on Multimedia
A Speech/Music Discriminator of Radio Recordings Based on Dynamic Programming and Bayesian Networks

IEEE Transactions on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Gender (Male/Female) classification plays a primary vital role to develop a robust Automatic Tamil Speech Recognition (ASR) applications due to the diversity in the vocal tract of speakers. Various features including Formants (F1, F2, F3, F4), Zero Crossings, and Mel-Frequency Cepstral Coefficients (MFCCs) etc. have appeared in the literature especially for speech/signal classification/recognition. Recently Dalal et al. have proposed a feature called as Histogram of Oriented Gradients (HOG) for extracting feature from an image for efficient detection/classification of objects. We extend and apply the HOG for spectrogram of speech signal and hence called as Spectral Histogram of Oriented Gradients (SHOGs). The results of Tamil language male/female speaker classification using SHOGs features shows good improvement in the classification rate when compared to other features. The results of combination of various features with SHOGs are also promissing.