Summarizing speech without text using hidden Markov models

Authors:
Sameer Maskey;Julia Hirschberg
Affiliations:
Columbia University, New York, NY;Columbia University, New York, NY
Venue:
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Year:
2006

Citing 6
Cited 13

A trainable document summarizer

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Ultra-summarization (poster abstract): a statistical approach to generating highly condensed non-extractive summaries

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Prosody-based automatic segmentation of speech into sentences and topics

Speech Communication - Special issue on accessing information in spoken audio
Automatic generation of concise summaries of spoken dialogues in unrestricted domains

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Communication and prosody: functional aspects of prosody

Speech Communication - Dialogue and prosody
Efficient Hidden Semi-Markov Model Inference for Structured Video Sequences

ICCCN '05 Proceedings of the 14th International Conference on Computer Communications and Networks

Extractive spoken document summarization for information retrieval

Pattern Recognition Letters
Scalable summaries of spoken conversations

Proceedings of the 13th international conference on Intelligent user interfaces
Semi-automated logging of contact center telephone calls

Proceedings of the 17th ACM conference on Information and knowledge management
A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization

ACM Transactions on Asian Language Information Processing (TALIP)
Speech summarization without lexical features for Mandarin broadcast news

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
From extractive to abstractive meeting summaries: can it be done by sentence compression?

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Improving supervised learning for meeting summarization using sampling and regression

Computer Speech and Language
Extractive speech summarization using shallow rhetorical structure modeling

IEEE Transactions on Audio, Speech, and Language Processing
Automatic summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts of ACL 2011
Active learning with semi-automatic annotation for extractive speech summarization

ACM Transactions on Speech and Language Processing (TSLP)
The nonverbal structure of patient case discussions in multidisciplinary medical team meetings

ACM Transactions on Information Systems (TOIS)
Using regression for spectral estimation of HMMs

SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Treemaps to visualise and navigate speech audio

Proceedings of the 25th Australian Computer-Human Interaction Conference: Augmentation, Application, Innovation, Collaboration

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a method for summarizing speech documents without using any type of transcript/text in a Hidden Markov Model framework. The hidden variables or states in the model represent whether a sentence is to be included in a summary or not, and the acoustic/prosodic features are the observation vectors. The model predicts the optimal sequence of segments that best summarize the document. We evaluate our method by comparing the predicted summary with one generated by a human summarizer. Our results indicate that we can generate 'good' summaries even when using only acoustic/prosodic information, which points toward the possibility of text-independent summarization for spoken documents.