Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Bilateral Filtering for Gray and Color Images
ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Fast Human Detection by Boosting Histograms of Oriented Gradients
ICIG '07 Proceedings of the Fourth International Conference on Image and Graphics
Integrated phoneme subspace method for speech feature extraction
EURASIP Journal on Audio, Speech, and Music Processing
EURASIP Journal on Audio, Speech, and Music Processing - Special issue on atypical speech
Combining auditory preprocessing and Bayesian estimation for robust formant tracking
IEEE Transactions on Audio, Speech, and Language Processing
Compact acoustic models for embedded speech recognition
EURASIP Journal on Audio, Speech, and Music Processing
Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments
IEEE Transactions on Audio, Speech, and Language Processing
EURASIP Journal on Audio, Speech, and Music Processing
EURASIP Journal on Audio, Speech, and Music Processing
IEEE Transactions on Audio, Speech, and Language Processing
Speech Recognition Using Linear Dynamic Models
IEEE Transactions on Audio, Speech, and Language Processing
Feature Extraction Based on Pitch-Synchronous Averaging for Robust Speech Recognition
IEEE Transactions on Audio, Speech, and Language Processing
Feature Compensation Techniques for ASR on Band-Limited Speech
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing
Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features
IEEE Transactions on Audio, Speech, and Language Processing
A speech/music discriminator based on RMS and zero-crossings
IEEE Transactions on Multimedia
A Speech/Music Discriminator of Radio Recordings Based on Dynamic Programming and Bayesian Networks
IEEE Transactions on Multimedia
Hi-index | 0.00 |
Gender (Male/Female) classification plays a primary vital role to develop a robust Automatic Tamil Speech Recognition (ASR) applications due to the diversity in the vocal tract of speakers. Various features including Formants (F1, F2, F3, F4), Zero Crossings, and Mel-Frequency Cepstral Coefficients (MFCCs) etc. have appeared in the literature especially for speech/signal classification/recognition. Recently Dalal et al. have proposed a feature called as Histogram of Oriented Gradients (HOG) for extracting feature from an image for efficient detection/classification of objects. We extend and apply the HOG for spectrogram of speech signal and hence called as Spectral Histogram of Oriented Gradients (SHOGs). The results of Tamil language male/female speaker classification using SHOGs features shows good improvement in the classification rate when compared to other features. The results of combination of various features with SHOGs are also promissing.