Audio-based context recognition

Authors:
A. J. Eronen;V. T. Peltonen;J. T. Tuomi;A. P. Klapuri;S. Fagerlund;T. Sorsa;G. Lorho;J. Huopaniemi
Affiliations:
Nokia Res. Center, Tampere, Finland;-;-;-;-;-;-;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2006

Citing 0
Cited 26

TUT Acoustic Event Detection System 2007

Multimodal Technologies for Perception of Humans
Classification of audio signals using SVM and RBFNN

Expert Systems with Applications: An International Journal
Unstructured audio classification for environment recognition

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Environmental sound recognition with time-frequency audio features

IEEE Transactions on Audio, Speech, and Language Processing
Feature selection under a complexity constraint

IEEE Transactions on Multimedia - Special section on communities and media computing
Adaptive audio-based context recognition

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Environmental sound classification based on feature collaboration

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Analysis/synthesis of sounds generated by sustained contact between rigid objects

IEEE Transactions on Audio, Speech, and Language Processing
Classification of audio signals using AANN and GMM

Applied Soft Computing
Audio-based semantic concept classification for consumer video

IEEE Transactions on Audio, Speech, and Language Processing
Audio query by example using similarity measures between probability density functions of features

EURASIP Journal on Audio, Speech, and Music Processing - Special issue on scalable audio-content analysis
Automatic detection of malicious sound using segmental two-dimensional mel-frequency cepstral coefficients and histograms of oriented gradients

Proceedings of the international conference on Multimedia
Classification of similar impact sounds

ICISP'10 Proceedings of the 4th international conference on Image and signal processing
Pattern classification models for classifying and indexing audio signals

Engineering Applications of Artificial Intelligence
Towards context-sensitive support of vitality in old-age

CHI '11 Extended Abstracts on Human Factors in Computing Systems
Indoor localization without infrastructure using the acoustic background spectrum

MobiSys '11 Proceedings of the 9th international conference on Mobile systems, applications, and services
Environmental sound recognition for robot audition using matching-pursuit

IEA/AIE'11 Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part II
Environmental sound classification for scene recognition using local discriminant bases and HMM

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Is the contextual information relevant in text clustering by compression?

Expert Systems with Applications: An International Journal
Detecting epileptic seizures using electroencephalogram: a novel frequency domain feature extraction technique for seizure classification using fast ANFIS

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
NMF-based environmental sound source separation using time-variant gain features

Computers & Mathematics with Applications
Recognition of environmental sound recorded by mobile phone

ACM SIGMOBILE Mobile Computing and Communications Review
RoomSense: an indoor positioning system for smartphones using active sound probing

Proceedings of the 4th Augmented Human International Conference
Combining crowd-generated media and personal data: semi-supervised learning for context recognition

Proceedings of the 1st ACM international workshop on Personal data meets distributed multimedia
Reduce the Number of Sensors: Sensing Acoustic Emissions to Estimate Appliance Energy Usage

Proceedings of the 5th ACM Workshop on Embedded Systems For Energy-Efficient Buildings
Towards scalable activity recognition: adapting zero-effort crowdsourced acoustic models

Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

The aim of this paper is to investigate the feasibility of an audio-based context recognition system. Here, context recognition refers to the automatic classification of the context or an environment around a device. A system is developed and compared to the accuracy of human listeners in the same task. Particular emphasis is placed on the computational complexity of the methods, since the application is of particular interest in resource-constrained portable devices. Simplistic low-dimensional feature vectors are evaluated against more standard spectral features. Using discriminative training, competitive recognition accuracies are achieved with very low-order hidden Markov models (1-3 Gaussian components). Slight improvement in recognition accuracy is observed when linear data-driven feature transformations are applied to mel-cepstral features. The recognition rate of the system as a function of the test sequence length appears to converge only after about 30 to 60 s. Some degree of accuracy can be achieved even with less than 1-s test sequence lengths. The average reaction time of the human listeners was 14 s, i.e., somewhat smaller, but of the same order as that of the system. The average recognition accuracy of the system was 58% against 69%, obtained in the listening tests in recognizing between 24 everyday contexts. The accuracies in recognizing six high-level classes were 82% for the system and 88% for the subjects.