A new approach to automatic speech summarization

Authors:
C. Hori;S. Furui
Affiliations:
Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan;-
Venue:
IEEE Transactions on Multimedia
Year:
2003

Citing 0
Cited 17

Time is of the essence: an evaluation of temporal compression algorithms

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Accessing speech data using strategic fixation

Computer Speech and Language
A statistical approach to automatic speech summarization

EURASIP Journal on Applied Signal Processing
Access to recorded interviews: A research agenda

Journal on Computing and Cultural Heritage (JOCCH)
Scalable summaries of spoken conversations

Proceedings of the 13th international conference on Intelligent user interfaces
Comparing the roles of textual, acoustic and spoken-language features on spontaneous-conversation summarization

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Speech summarization without lexical features for Mandarin broadcast news

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Automatic generation of information-seeking questions using concept clusters

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
A syntax-free approach to Japanese sentence compression

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
A video digest and delivery system: "ChocoParaTV"

Proceedings of the 2007 conference on Human interface: Part I
A two-stage speech activity detection system considering fractal aspects of prosody

Pattern Recognition Letters
Relevant document retrieval using a spoken document

ISCIT'09 Proceedings of the 9th international conference on Communications and information technologies
Using confusion networks for speech summarization

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Learning to model domain-specific utterance sequences for extractive summarization of contact center dialogues

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Imposing hierarchical browsing structures onto spoken documents

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A normalized-cut alignment model for mapping hierarchical semantic structures onto spoken documents

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Active learning with semi-automatic annotation for extractive speech summarization

ACM Transactions on Speech and Language Processing (TSLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a new automatic speech summarization method. In this method, a set of words maximizing a summarization score is extracted from automatically transcribed speech. This extraction is performed according to a target compression ratio using a dynamic programming (DP) technique. The extracted set of words is then connected to build a summarization sentence. The summarization score consists of a word significance measure, a confidence measure, linguistic likelihood, and a word concatenation probability. The word concatenation score is determined by a dependency structure in the original speech given by stochastic dependency context free grammar (SDCFG). Japanese broadcast news speech transcribed using a large-vocabulary continuous-speech recognition (LVCSR) system is summarized using our proposed method and compared with manual summarization by human subjects. The manual summarization results are combined to build a word network. This word network is used to calculate the word accuracy of each automatic summarization result using the most similar word string in the network. Experimental results show that the proposed method effectively extracts relatively important information by removing redundant and irrelevant information.