Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification

Authors:
R. Kotsakis;G. Kalliris;C. Dimoulas
Affiliations:
Laboratory of Electronic Media, Dept. of Journalism and Mass Communication, Aristotle University of Thessaloniki, Greece;Laboratory of Electronic Media, Dept. of Journalism and Mass Communication, Aristotle University of Thessaloniki, Greece;Laboratory of Electronic Media, Dept. of Journalism and Mass Communication, Aristotle University of Thessaloniki, Greece
Venue:
Speech Communication
Year:
2012

Citing 28
Cited 0

Optimal multi-step k-nearest neighbor search

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Fast training of support vector machines using sequential minimal optimization

Advances in kernel methods
Statistical Pattern Recognition: A Review

IEEE Transactions on Pattern Analysis and Machine Intelligence
The development of the HTK Broadcast News transcription system: an overview

Speech Communication - Special issue on automatic transcription of broadcast news data
The LIMSI Broadcast News transcription system

Speech Communication - Special issue on automatic transcription of broadcast news data
Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach

Speech Communication - Special issue on automatic transcription of broadcast news data
Improved modeling and efficiency for automatic transcription of Broadcast News

Speech Communication - Special issue on automatic transcription of broadcast news data
Induction of Decision Trees

Machine Learning
Speech/music segmentation using entropy and dynamism features in a HMM classification framework

Speech Communication
Long-term signal detection, segmentation and summarization using wavelets and fractal dimension: A bioacoustics application in gastrointestinal-motility monitoring

Computers in Biology and Medicine
Bowel-sound pattern analysis using wavelets and neural networks with application to long-term, unsupervised, gastrointestinal motility monitoring

Expert Systems with Applications: An International Journal
Automatic speech recognition and speech variability: A review

Speech Communication
ZemPod: A semantic web approach to podcasting

Web Semantics: Science, Services and Agents on the World Wide Web
Detection of speech and music based on spectral tracking

Speech Communication
Classification of audio signals using SVM and RBFNN

Expert Systems with Applications: An International Journal
Speaker identification based on the frame linear predictive coding spectrum technique

Expert Systems with Applications: An International Journal
Unsupervised speaker segmentation with residual phase and MFCC features

Expert Systems with Applications: An International Journal
An overview of text-independent speaker recognition: From features to supervectors

Speech Communication
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Story segmentation and topic classification of broadcast news via a topic-based segmental model and a genetic algorithm

IEEE Transactions on Audio, Speech, and Language Processing
Adaptive phoneme alignment based on rough set theory

RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Robust speech/non-speech classification in heterogeneous multimedia content

Speech Communication
Pattern classification models for classifying and indexing audio signals

Engineering Applications of Artificial Intelligence
Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features

Speech Communication
Robust speech detection in real acoustic backgrounds with perceptually motivated features

Speech Communication
Pattern classification and audiovisual content management techniques using hybrid expert systems: A video-assisted bioacoustics application in Abdominal Sounds pattern analysis

Expert Systems with Applications: An International Journal
Emotion recognition using a hierarchical binary decision tree approach

Speech Communication
Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The present paper focuses on the investigation of various audio pattern classifiers in broadcast-audio semantic analysis, using radio-programme-adaptive classification strategies with supervised training. Multiple neural network topologies and training configurations are evaluated and compared in combination with feature-extraction, ranking and feature-selection procedures. Different pattern classification taxonomies are implemented, using programme-adapted multi-class definitions and hierarchical schemes. Hierarchical and hybrid classification taxonomies are deployed in speech analysis tasks, facilitating efficient speaker recognition/identification, speech/music discrimination, and generally speech/non-speech detection-segmentation. Exhaustive qualitative and quantitative evaluation is conducted, including indicative comparison with non-neural approaches. Hierarchical approaches offer classification-similarities for easy adaptation to generic radio-broadcast semantic analysis tasks. The proposed strategy exhibits increased efficiency in radio-programme content segmentation and classification, which is one of the most demanding audio semantics tasks. This strategy can be easily adapted in broader audio detection and classification problems, including additional real-world speech-communication demanding scenarios.