Robust distributed speech recognition in noise and packet loss conditions

Authors:
Ronan Flynn;Edward Jones
Affiliations:
Department of Electronic Engineering, Athlone Institute of Technology, Ireland;Department of Electronic Engineering, National University of Ireland, Galway, Ireland
Venue:
Digital Signal Processing
Year:
2010

Citing 5
Cited 2

A comparison of front-ends for bitstream-based ASR over IP

Signal Processing
Combined speech enhancement and auditory modelling for robust distributed speech recognition

Speech Communication
Robust speech recognition over mobile and IP networks in burst-like packet loss

IEEE Transactions on Audio, Speech, and Language Processing
Recognizing voice over IP: a robust front-end for speechrecognition on the world wide web

IEEE Transactions on Multimedia
Combining Media-Specific FEC and Error Concealment for Robust Distributed Speech Recognition Over Loss-Prone Packet Channels

IEEE Transactions on Multimedia

Web-based remote voice control of robotized cells

Robotics and Computer-Integrated Manufacturing
Low bit rate compression methods of feature vectors for distributed speech recognition

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper examines the performance of a Distributed Speech Recognition (DSR) system in the presence of both background noise and packet loss. Recognition performance is examined for feature vectors extracted from speech using a physiologically-based auditory model, as an alternative to the more commonly-used Mel Frequency Cepstral Coefficient (MFCC) front-end. The feature vectors produced by the auditory model are vector quantised and combined in pairs for transmission over a statistically modelled channel that is subject to packet burst loss. In order to improve recognition performance in the presence of noise, the speech is enhanced prior to feature extraction using Wiener filtering. Packet loss mitigation to compensate for missing features is also used to further improve performance. Speech recognition results show the benefit of combining speech enhancement and packet loss mitigation to compensate for channel and environmental degradations.