Recent Progress in Corpus-Based Spontaneous Speech Recognition

Authors:
Sadaoki Furui
Affiliations:
The author is with Tokyo Institute of Technology, Tokyo, 152-8552 Japan. E-mail: furui@cs.titech.ac.jp
Venue:
IEICE - Transactions on Information and Systems
Year:
2005

Citing 0
Cited 4

Towards a taxonomy of error-handling strategies in recognition-based multi-modal human-computer interfaces

Signal Processing - Special section: Multimodal human-computer interfaces
Collaborative editing for improved usefulness and usability of transcript-enhanced webcasts

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Simultaneous translation of lectures and speeches

Machine Translation
Speech recognition approach: desktop items activation with comparative analysis

ACS'10 Proceedings of the 10th WSEAS international conference on Applied computer science

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper overviews recent progress in the development of corpus-based spontaneous speech recognition technology. Although speech is in almost any situation spontaneous, recognition of spontaneous speech is an area which has only recently emerged in the field of automatic speech recognition. Broadening the application of speech recognition depends crucially on raising recognition performance for spontaneous speech. For this purpose, it is necessary to build large spontaneous speech corpora for constructing acoustic and language models. This paper focuses on various achievements of a Japanese 5-year national project "Spontaneous Speech: Corpus and Processing Technology" that has recently been completed. Because of various spontaneous-speech specific phenomena, such as filled pauses, repairs, hesitations, repetitions and disfluencies, recognition of spontaneous speech requires various new techniques. These new techniques include flexible acoustic modeling, sentence boundary detection, pronunciation modeling, acoustic as well as language model adaptation, and automatic summarization. Particularly automatic summarization including indexing, a process which extracts important and reliable parts of the automatic transcription, is expected to play an important role in building various speech archives, speech-based information retrieval systems, and human-computer dialogue systems.