Gender identification using a general audio classifier

Authors:
H. Harb;Liming Chen
Affiliations:
Dept. of Mathematiques Informatique, Ecole Centrale de Lyon, France;Dept. of Mathematiques Informatique, Ecole Centrale de Lyon, France
Venue:
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Year:
2003

Citing 0
Cited 9

Voice-based gender identification in multimedia applications

Journal of Intelligent Information Systems - Special issue: Intelligent multimedia applications
Gender Recognition Based on Fusion of Face and Multi-view Gait

ICB '09 Proceedings of the Third International Conference on Advances in Biometrics
A study on gait-based gender classification

IEEE Transactions on Image Processing
Voice-based gender identification via multiresolution frame classification of spectro-temporal maps

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Language independent voice-based gender identification system

Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India
Neuro-fuzzy-based biometric system using speech features

International Journal of Biometrics
Practical considerations for real-time implementation of speech-based gender detection

CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
Acoustic classification and segmentation using modified spectral roll-off and variance-based features

Digital Signal Processing
Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum's statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.