Automatic generation of concise summaries of spoken dialogues in unrestricted domains

Authors:
Klaus Zechner
Affiliations:
Carnegie Mellon Univ., Pittsburgh, PA
Venue:
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2001

Citing 14
Cited 14

C4.5: programs for machine learning

C4.5: programs for machine learning
Some advances in transformation-based part of speech tagging

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
A trainable document summarizer

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
The use of MMR, diversity-based reranking for reordering documents and producing summaries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
SCAN: designing and evaluating user interfaces to support retrieval from speech archives

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Advances in Automatic Text Summarization

Advances in Automatic Text Summarization
Verbmobil - Translation of Face-To-Face Dialogs

Grundlagen und Anwendungen der Künstlichen Intelligenz, 17. Fachtagung für Künstliche Intelligenz, Humboldt-Universität zu
Dialogue act modeling for automatic tagging and recognition of conversational speech

Computational Linguistics
TextTiling: segmenting text into multi-paragraph subtopic passages

Computational Linguistics
Minimizing word error rate in textual summaries of spoken language

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
High performance segmentation of spontaneous speech using part of speech and trigger word information

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Intonational boundaries, speech repairs and discourse markers: modeling spoken dialog

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
DiaSumm: flexible summarization of spontaneous dialogues in unrestricted domains

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Summarizing multilingual spoken negotiation dialogues

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics

Automatic summarization of voicemail messages using lexical and prosodic features

ACM Transactions on Speech and Language Processing (TSLP)
Sharing problems and solutions for machine translation of spoken and written interaction

S2S '02 Proceedings of the ACL-02 workshop on Speech-to-speech translation: algorithms and systems - Volume 7
An interactive speech interface for summarizing agile project planning meetings

CHI '06 Extended Abstracts on Human Factors in Computing Systems
Speech segmentation without speech recognition

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Digesting virtual "geek" culture: the summarization of technical internet relay chats

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A statistical approach to automatic speech summarization

EURASIP Journal on Applied Signal Processing
Automatic summarising: The state of the art

Information Processing and Management: an International Journal
Automatic summarization of English broadcast news speech

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Finding question-answer pairs from online forums

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A feature based approach to leveraging context for classifying newsgroup style discussion segments

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Museli: a multi-source evidence integration approach to topic segmentation of spontaneous dialogue

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Summarizing speech without text using hidden Markov models

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Learning from the report-writing behavior of individuals

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
IMASS: an intelligent microblog analysis and summarization system

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic summarization of open domain spoken dialogues is a new research area. This paper introduces the task, the challenges involved, and presents an approach to obtain automatic extract summaries for multi-party dialogues of four different genres, without any restriction on domain. We address the following issues which are intrinsic to spoken dialogue summarization and typically can be ignored when summarizing written text such as newswire data: (i) detection and removal of speech disfluencies; (ii) detection and insertion of sentence boundaries; (iii) detection and linking of cross-speaker information units (question-answer pairs). A global system evaluation using a corpus of 23 relevance annotated dialogues containing 80 topical segments shows that for the two more informal genres, our summarization system using dialogue specific components significantly outperforms a baseline using TFIDF term weighting with maximum marginal relevance ranking (MMR).