Optimal multi-step k-nearest neighbor search
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
Statistical Pattern Recognition: A Review
IEEE Transactions on Pattern Analysis and Machine Intelligence
The development of the HTK Broadcast News transcription system: an overview
Speech Communication - Special issue on automatic transcription of broadcast news data
The LIMSI Broadcast News transcription system
Speech Communication - Special issue on automatic transcription of broadcast news data
Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach
Speech Communication - Special issue on automatic transcription of broadcast news data
Improved modeling and efficiency for automatic transcription of Broadcast News
Speech Communication - Special issue on automatic transcription of broadcast news data
Machine Learning
Expert Systems with Applications: An International Journal
Automatic speech recognition and speech variability: A review
Speech Communication
ZemPod: A semantic web approach to podcasting
Web Semantics: Science, Services and Agents on the World Wide Web
Detection of speech and music based on spectral tracking
Speech Communication
Classification of audio signals using SVM and RBFNN
Expert Systems with Applications: An International Journal
Speaker identification based on the frame linear predictive coding spectrum technique
Expert Systems with Applications: An International Journal
Unsupervised speaker segmentation with residual phase and MFCC features
Expert Systems with Applications: An International Journal
An overview of text-independent speaker recognition: From features to supervectors
Speech Communication
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
IEEE Transactions on Audio, Speech, and Language Processing
Adaptive phoneme alignment based on rough set theory
RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Robust speech/non-speech classification in heterogeneous multimedia content
Speech Communication
Pattern classification models for classifying and indexing audio signals
Engineering Applications of Artificial Intelligence
Expert Systems with Applications: An International Journal
Emotion recognition using a hierarchical binary decision tree approach
Speech Communication
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
The present paper focuses on the investigation of various audio pattern classifiers in broadcast-audio semantic analysis, using radio-programme-adaptive classification strategies with supervised training. Multiple neural network topologies and training configurations are evaluated and compared in combination with feature-extraction, ranking and feature-selection procedures. Different pattern classification taxonomies are implemented, using programme-adapted multi-class definitions and hierarchical schemes. Hierarchical and hybrid classification taxonomies are deployed in speech analysis tasks, facilitating efficient speaker recognition/identification, speech/music discrimination, and generally speech/non-speech detection-segmentation. Exhaustive qualitative and quantitative evaluation is conducted, including indicative comparison with non-neural approaches. Hierarchical approaches offer classification-similarities for easy adaptation to generic radio-broadcast semantic analysis tasks. The proposed strategy exhibits increased efficiency in radio-programme content segmentation and classification, which is one of the most demanding audio semantics tasks. This strategy can be easily adapted in broader audio detection and classification problems, including additional real-world speech-communication demanding scenarios.