Automatic summarization of open-domain multiparty dialogues in diverse genres

Authors:
Klaus Zechner
Affiliations:
Educational Testing Service, Rosedale Road MS 11-R, Princeton, NJ
Venue:
Computational Linguistics - Summarization
Year:
2002

Citing 23
Cited 33

Attention, intentions, and the structure of discourse

Computational Linguistics
C4.5: programs for machine learning

C4.5: programs for machine learning
A full-text retrieval system with a dynamic abstract generation function

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Some advances in transformation-based part of speech tagging

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
A trainable document summarizer

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
The use of MMR, diversity-based reranking for reordering documents and producing summaries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
SCAN: designing and evaluating user interfaces to support retrieval from speech archives

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
OCELOT: a system for summarizing Web pages

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Prosody-based automatic segmentation of speech into sentences and topics

Speech Communication - Special issue on accessing information in spoken audio
Advances in Automatic Text Summarization

Advances in Automatic Text Summarization
Statistics-Based Summarization - Step One: Sentence Compression

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
JANUS-III: Speech-to-Speech Translation in Multiple Languages

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Dialogue act modeling for automatic tagging and recognition of conversational speech

Computational Linguistics
The reliability of a dialogue structure coding scheme

Computational Linguistics
TextTiling: segmenting text into multi-paragraph subtopic passages

Computational Linguistics
Discourse segmentation by human and automated means

Computational Linguistics
Speech repairs, intonational phrases, and discourse markers: modeling speakers' utterances in spoken dialogue

Computational Linguistics
Minimizing word error rate in textual summaries of spoken language

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
High performance segmentation of spontaneous speech using part of speech and trigger word information

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Trainable, scalable summarization using robust NLP and machine learning

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
DiaSumm: flexible summarization of spontaneous dialogues in unrestricted domains

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Summarizing multilingual spoken negotiation dialogues

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
The SMART Retrieval System—Experiments in Automatic Document Processing

The SMART Retrieval System—Experiments in Automatic Document Processing

Introduction to the special issue on summarization

Computational Linguistics - Summarization
Extracting information from multimedia meeting collections

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Semantic similarity applied to spoken dialogue summarization

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Incorporating speaker and discourse features into speech summarization

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Automatic summarising: The state of the art

Information Processing and Management: an International Journal
Improving extractive dialogue summarization by utilizing human feedback

AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
Single-document and multi-document summarization techniques for email threads using sentence compression

Information Processing and Management: an International Journal
Meta Comments for Summarizing Meeting Speech

MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
Semi-automated logging of contact center telephone calls

Proceedings of the 17th ACM conference on Information and knowledge management
Improving meeting summarization by focusing on user needs: a task-oriented evaluation

Proceedings of the 14th international conference on Intelligent user interfaces
Using Question-Answer Pairs in Extractive Summarization of Email Conversations

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Extrinsic summarization evaluation: A decision audit task

ACM Transactions on Speech and Language Processing (TSLP)
An approach to summarizing short stories

EACL '06 Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
A skip-chain conditional random field for ranking meeting utterances by importance

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Summarizing spoken and written conversations

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Summarizing email threads

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Summarization from medical documents: a survey

Artificial Intelligence in Medicine
Extracting decisions from multi-party dialogue using directed graphical models and semantic similarity

SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Summarizing short stories

Computational Linguistics
Improving supervised learning for meeting summarization using sampling and regression

Computer Speech and Language
Summarizing software artifacts: a case study of bug reports

Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
Long story short - Global unsupervised models for keyphrase based meeting summarization

Speech Communication
Automatic summarisation of discussion fora

Natural Language Engineering
Recent advances in automatic speech summarization

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Automatic identification of discourse markers in dialogues: An in-depth study of like and well

Computer Speech and Language
Abstractive summarization of voice communications

LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
Automatic summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts of ACL 2011
A pilot study of opinion summarization in conversations

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Plans toward automated chat summarization

WASDGML '11 Proceedings of the Workshop on Automatic Summarization for Different Genres, Media, and Languages
Summarizing decisions in spoken meetings

WASDGML '11 Proceedings of the Workshop on Automatic Summarization for Different Genres, Media, and Languages
Why is "SXSW" trending?: exploring multiple text sources for Twitter topic summarization

LSM '11 Proceedings of the Workshop on Languages in Social Media
Unsupervised topic modeling approaches to decision summarization in spoken meetings

SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Focused meeting summarization via unsupervised relation extraction

SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic summarization of open-domain spoken dialogues is a relatively new research area. This article introduces the task and the challenges involved and motivates and presents an approach for obtaining automatic-extract summaries for human transcripts of multiparty dialogues of four different genres, without any restriction on domain.We address the following issues, which are intrinsic to spoken-dialogue summarization and typically can be ignored when summarizing written text such as news wire data: (1) detection and removal of speech disfluencies; (2) detection and insertion of sentence boundaries; and (3) detection and linking of cross-speaker information units (question-answer pairs).A system evaluation is performed using a corpus of 23 dialogue excerpts with an average duration of about 10 minutes, comprising 80 topical segments and about 47,000 words total. The corpus was manually annotated for relevant text spans by six human annotators. The global evaluation shows that for the two more informal genres, our summarization system using dialogue-specific components significantly outperforms two baselines: (1) a maximum-marginal-relevance ranking algorithm using TF*IDF term weighting, and (2) a LEAD baseline that extracts the first n words from a text.