Scalable summaries of spoken conversations

Authors:
Sumit Basu;Surabhi Gupta;Milind Mahajan;Patrick Nguyen;John C. Platt
Affiliations:
Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA and Stanford University, Stanford, CA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA
Venue:
Proceedings of the 13th international conference on Intelligent user interfaces
Year:
2008

Citing 11
Cited 3

Statistical Models for Text Segmentation

Machine Learning - Special issue on natural language learning
Auto-summarization of audio-video presentations

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Navigation patterns and usability of zoomable user interfaces with and without an overview

ACM Transactions on Computer-Human Interaction (TOCHI)
Segmenting Conversations by Topic, Initiative, and Style

Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
TextTiling: segmenting text into multi-paragraph subtopic passages

Computational Linguistics
Discourse cues for broadcast news segmentation

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
DiaSumm: flexible summarization of spontaneous dialogues in unrestricted domains

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Automatic summarization of voicemail messages using lexical and prosodic features

ACM Transactions on Speech and Language Processing (TSLP)
Summarizing speech without text using hidden Markov models

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
A new approach to automatic speech summarization

IEEE Transactions on Multimedia

Have a say over what you see: evaluating interactive compression techniques

Proceedings of the 14th international conference on Intelligent user interfaces
Conversation clusters: grouping conversation topics through human-computer dialog

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Social mirrors as social signals: transforming audio into gaphics

IEEE Computer Graphics and Applications - Special issue on creating musical-fountain shows virtual reality for the Digital Olympic Museum

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this work, we present a novel means of browsing recorded audio conversations. The method we develop produces scalable summaries of the recognized speech, in which we can increase the amount of text continuously with the desired level of detail to best fill the available space. We present an interface in which a user can view an entire conversation in one screen, but can also quickly zoom in to see the full transcript; the corresponding audio can be easily played as well. The scaling is achieved via a combination of topic segmentation and informative phrase selection, where the threshold for informativeness decreases with increasing level of detail. Finally, we evaluate our method and interface against a baseline interface with a user study.