Progress in Broadcast News transcription at Dragon Systems

Authors:
S. Wegmann;Puming Zhan;L. Gillick
Affiliations:
Dragon Syst. Inc., Newton, MA, USA;-;-
Venue:
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Year:
1999

Citing 0
Cited 6

A Fast Anchor Person Searching Scheme in News Sequences

AVBPA '01 Proceedings of the Third International Conference on Audio- and Video-Based Biometric Person Authentication
A Low Missing Rate Audio Search Technique for Cantonese Radio Broadcast Recording

PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Structuring broadcast audio for information access

EURASIP Journal on Applied Signal Processing
Time-frequency correlation-based missing-feature reconstruction for robust speech recognition in band-restricted conditions

IEEE Transactions on Audio, Speech, and Language Processing
Access to content

Proceedings of the 2005 joint Chinese-German conference on Cognitive systems
Toward a sound analysis system for telemedicine

FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

We report on progress in acoustic modelling and preprocessing in our Broadcast News transcription system. We have gone back to basics in acoustic modelling, and re-examined some of our standard practices, in particular the use of IMELDA and frequency warping, in the context of the Broadcast News corpus. We also report on some preliminary experiments with a generalization of IMELDA, "semi-tied covariances". In combination, these improvements lead to a 3.5% absolute improvement over our eval97 models. We also describe our attempts to fix our rather primitive, silence-based preprocessing system, including initial results using a new speaker-change detection algorithm based on Hotelling's T/sup 2/-test.