Statistical Models for Text Segmentation
Machine Learning - Special issue on natural language learning
Auto-summarization of audio-video presentations
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Navigation patterns and usability of zoomable user interfaces with and without an overview
ACM Transactions on Computer-Human Interaction (TOCHI)
Segmenting Conversations by Topic, Initiative, and Style
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
TextTiling: segmenting text into multi-paragraph subtopic passages
Computational Linguistics
Discourse cues for broadcast news segmentation
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
DiaSumm: flexible summarization of spontaneous dialogues in unrestricted domains
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Automatic summarization of voicemail messages using lexical and prosodic features
ACM Transactions on Speech and Language Processing (TSLP)
Summarizing speech without text using hidden Markov models
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
A new approach to automatic speech summarization
IEEE Transactions on Multimedia
Have a say over what you see: evaluating interactive compression techniques
Proceedings of the 14th international conference on Intelligent user interfaces
Conversation clusters: grouping conversation topics through human-computer dialog
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Social mirrors as social signals: transforming audio into gaphics
IEEE Computer Graphics and Applications - Special issue on creating musical-fountain shows virtual reality for the Digital Olympic Museum
Hi-index | 0.01 |
In this work, we present a novel means of browsing recorded audio conversations. The method we develop produces scalable summaries of the recognized speech, in which we can increase the amount of text continuously with the desired level of detail to best fill the available space. We present an interface in which a user can view an entire conversation in one screen, but can also quickly zoom in to see the full transcript; the corresponding audio can be easily played as well. The scaling is achieved via a combination of topic segmentation and informative phrase selection, where the threshold for informativeness decreases with increasing level of detail. Finally, we evaluate our method and interface against a baseline interface with a user study.