Building searchable collections of enterprise speech data

Authors:
James W. Cooper;Mahesh Viswanathan;Donna Byron;Margaret Chan
Affiliations:
-;-;-;-
Venue:
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Year:
2001

Citing 9
Cited 1

Automatic condensation of electronic publications by sentence selection

Information Processing and Management: an International Journal - Special issue: summarizing text
Interactive term suggestion for users of digital libraries: using subject thesauri and co-occurrence lists for information retrieval

Proceedings of the first ACM international conference on Digital libraries
Query expansion using local and global document analysis

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Lexical navigation: visually prompted query expansion and refinement

DL '97 Proceedings of the second ACM international conference on Digital libraries
Effects of out of vocabulary words in spoken document retrieval (poster session)

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
OBIWAN - A Visual Interface for Prompted Query Refinement

HICSS '98 Proceedings of the Thirty-First Annual Hawaii International Conference on System Sciences - Volume 2
Anti-Serendipity: Finding Useless Documents and Similar Documents

HICSS '00 Proceedings of the 33rd Hawaii International Conference on System Sciences-Volume 3 - Volume 3
Retrieval from Spoken Documents Using Content and Speaker Information

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
JavaServer Pages

JavaServer Pages

Extracting Keyphrases from Spoken Audio Documents

Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].

Quantified Score

Hi-index	0.00

Visualization

Abstract

We have applied speech recognition and text-mining technologies to a set of recorded outbound marketing calls and analyzed the results. Since speaker-independent speech recognition technology results in a significantly lower recognition rate than that found when the recognizer is trained for a particular speaker, we applied a number of post-processing algorithms to the output of the recognizer to render it suitable for the Textract text mining system. We indexed the call transcripts using a search engine and used Textract and associated Java technologies to place the relevant terms for each document in a relational database. Following a search query, we generated a thumbnail display of the results of each call with the salient terms highlighted. We illustrate these results and discuss their utility. We took the results of these experiments and continued this analysis on a set of talks and presentations.We describe a distinct document genre based on the note-taking concept of document content, and propose a significant new method for measuring speech recognition accuracy. This procedure is generally relevant to the problem of capturing meetings and talks and providing a searchable index of these presentations on the web.