A Fast Anchor Person Searching Scheme in News Sequences
AVBPA '01 Proceedings of the Third International Conference on Audio- and Video-Based Biometric Person Authentication
A Low Missing Rate Audio Search Technique for Cantonese Radio Broadcast Recording
PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Structuring broadcast audio for information access
EURASIP Journal on Applied Signal Processing
IEEE Transactions on Audio, Speech, and Language Processing
Proceedings of the 2005 joint Chinese-German conference on Cognitive systems
Toward a sound analysis system for telemedicine
FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
Hi-index | 0.00 |
We report on progress in acoustic modelling and preprocessing in our Broadcast News transcription system. We have gone back to basics in acoustic modelling, and re-examined some of our standard practices, in particular the use of IMELDA and frequency warping, in the context of the Broadcast News corpus. We also report on some preliminary experiments with a generalization of IMELDA, "semi-tied covariances". In combination, these improvements lead to a 3.5% absolute improvement over our eval97 models. We also describe our attempts to fix our rather primitive, silence-based preprocessing system, including initial results using a new speaker-change detection algorithm based on Hotelling's T/sup 2/-test.