A robust scheme for distributed speech recognition over loss-prone packet channels

  • Authors:
  • Angel M. Gómez;Antonio M. Peinado;Victoria Sánchez;Jose L. Carmona

  • Affiliations:
  • Dept. Teoría de la Señal, Telemática y Comunicaciones, University of Granada, Facultad de Ciencias, Campus de Fuentenueva S/N, Granada 18071, Spain;Dept. Teoría de la Señal, Telemática y Comunicaciones, University of Granada, Facultad de Ciencias, Campus de Fuentenueva S/N, Granada 18071, Spain;Dept. Teoría de la Señal, Telemática y Comunicaciones, University of Granada, Facultad de Ciencias, Campus de Fuentenueva S/N, Granada 18071, Spain;Dept. Teoría de la Señal, Telemática y Comunicaciones, University of Granada, Facultad de Ciencias, Campus de Fuentenueva S/N, Granada 18071, Spain

  • Venue:
  • Speech Communication
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a whole recovery scheme designed to improve robustness against packet losses in distributed speech recognition systems. This scheme integrates two sender-driven techniques, namely, media-specific forward error correction (FEC) and frame interleaving, along with a receiver-based error concealment (EC) technique, the weighted Viterbi algorithm (WVA). Although these techniques have been already tested separately, providing a significant increase of performance in clean acoustic environments, in this paper they are jointly applied and their performance in adverse acoustic conditions is evaluated. In particular, a noisy speech database and the ETSI Advanced Front-end are used, while the dynamic features, which play an important role in adverse acoustic environments, and their confidences for the WVA algorithm are examined. In order to solve the issue of mixing two sender-driven techniques (both causing a delay) whose direct composition causes an increase of the global latency, we propose a double stream scheme which limits the latency to the maximum delay of both techniques. As a result, with very few overhead bits and a very limited delay, the integrated scheme achieves a significant improvement in the performance of a DSR system over a degraded transmission channel, both in clean and noisy acoustic conditions.