Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
On term selection for query expansion
Journal of Documentation
Readings in speech recognition
Readings in speech recognition
Techniques for information retrieval from speech messages
The Lincoln Laboratory Journal
A system for retrieving speech documents
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Fundamentals of speech recognition
Fundamentals of speech recognition
SpeechSkimmer: interactively skimming recorded speech
UIST '93 Proceedings of the 6th annual ACM symposium on User interface software and technology
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Metadata for integrating speech documents in a text retrieval system
ACM SIGMOD Record
The future of speech and audio in the interface: a CHI '94 workshop
ACM SIGCHI Bulletin
Combining the evidence of multiple query representations for information retrieval
TREC-2 Proceedings of the second conference on Text retrieval conference
Automatic content-based retrieval of broadcast news
Proceedings of the third ACM international conference on Multimedia
Speaker segmentation for browsing recorded audio
CHI '95 Conference Companion on Human Factors in Computing Systems
Experiments in spoken document retrieval
Information Processing and Management: an International Journal - Special issue on history of information science
Towards increasing speech recognition error rates
Speech Communication
Pivoted document length normalization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieving spoken documents by combining multiple index sources
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Open-vocabulary speech indexing for voice and video mail retrieval
MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
SpeechSkimmer: a system for interactively skimming recorded speech
ACM Transactions on Computer-Human Interaction (TOCHI) - Special issue on speech as data
Cross-language speech retrieval: establishing a baseline performance
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
DL '97 Proceedings of the second ACM international conference on Digital libraries
Intelligent multimedia information retrieval
Intelligent multimedia information retrieval
Informedia: news-on-demand multimedia information acquisition and retrieval
Intelligent multimedia information retrieval
A graphical interface for speech-based retrieval
Proceedings of the third ACM conference on Digital libraries
Statistical methods for speech recognition
Statistical methods for speech recognition
New techniques for open-vocabulary spoken document retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
An overview of audio information retrieval
Multimedia Systems - Special issue on audio and multimedia
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
SCAN: designing and evaluating user interfaces to support retrieval from speech archives
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Document expansion for speech retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Improving retrieval on imperfect speech transcriptions (poster abstract)
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Heuristic approach for generic audio data segmentation and annotation
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Rough'n'Ready: a meeting recorder and browser
ACM Computing Surveys (CSUR)
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Complementary video and audio analysis for broadcast news archives
Communications of the ACM
Transcribing broadcast news for audio and video indexing
Communications of the ACM
Measurements in support of research accomplishments
Communications of the ACM
Modeling pronunciation variation for ASR: a survey of the literature
Speech Communication - Special issue on modeling pronunciation variation for automatic speech recognition
Phonetic confusion matrix based spoken document retrieval
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Effects of out of vocabulary words in spoken document retrieval (poster session)
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Mandarin spoken document retrieval based on syllable lattice matching
Pattern Recognition Letters
Automatically extracting highlights for TV Baseball programs
MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Spoken document representations for probabilistic retrieval
Speech Communication - Special issue on accessing information in spoken audio
A system for the retrieval of Italian broadcast news
Speech Communication - Special issue on accessing information in spoken audio
Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese
Speech Communication - Special issue on accessing information in spoken audio
Experiments in spoken document retrieval using phoneme n-grams
Speech Communication - Special issue on accessing information in spoken audio
Subword-based approaches for spoken document retrieval
Speech Communication
Relevance based language models
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
SCANMail: a voicemail interface that makes speech browsable, readable and searchable
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
Information Retrieval
Supporting access to large digital oral history archives
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Speech and Audio Signal Processing: Processing and Perception of Speech and Music
Speech and Audio Signal Processing: Processing and Perception of Speech and Music
Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing
Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Multilingual phone models for vocabulary-independent speech recognition tasks
Speech Communication
Fusion Via a Linear Combination of Scores
Information Retrieval
New Approaches to Spoken Document Retrieval
Information Retrieval
Thematic indexing of spoken documents by using self-organizing maps
Speech Communication
IFINDER: an MPEG-7-based retrieval system for distributed multimedia content
Proceedings of the tenth ACM international conference on Multimedia
Distributed meetings: a meeting capture and broadcasting system
Proceedings of the tenth ACM international conference on Multimedia
SVM Classification Using Sequences of Phonemes and Syllables
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Extracting Keyphrases from Spoken Audio Documents
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
Perspectives on Information Retrieval and Speech
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
Mixing and Merging for Spoken Document Retrieval
ECDL '98 Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries
Taiscéalaí: Information Retrieval from an Archive of Spoken Radio News
ECDL '98 Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries
Automated Alignment and Annotation of Audio-Visual Presentations
ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
Speech Recognition Issues for Dutch Spoken Document Retrieval
TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Speech Transcript Analysis for Automatic Search
HICSS '01 Proceedings of the 34th Annual Hawaii International Conference on System Sciences ( HICSS-34)-Volume 4 - Volume 4
Speech recognition in the Informedia Digital Video Library: uses and limitations
TAI '95 Proceedings of the Seventh International Conference on Tools with Artificial Intelligence
Confidence Measures for Spontaneous Speech Recognition
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Retrieval from Spoken Documents Using Content and Speaker Information
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Indexing and Search of Multimodal Information
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Automatic Genre Identification for Content-Based Video Categorization
ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Integration of continuous speech recognition and information retrieval for mutually optimal performance
VideoQA: question answering on news video
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion
ACM Transactions on Asian Language Information Processing (TALIP)
Dialogue act modeling for automatic tagging and recognition of conversational speech
Computational Linguistics
Advances in domain independent linear text segmentation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Data-oriented methods for grapheme-to-phoneme conversion
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Multi-paragraph segmentation of expository text
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Building an information retrieval test collection for spontaneous conversational speech
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Toward speech as a knowledge resource
IBM Systems Journal
Spoken Document Classification with SVMs Using Linguistic Unit Weighting and Probabilistic Couplers
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02
Successful approaches in the TREC video retrieval evaluations
Proceedings of the 12th annual ACM international conference on Multimedia
Simple BM25 extension to multiple weighted fields
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A meeting browser evaluation test
CHI '05 Extended Abstracts on Human Factors in Computing Systems
Automatic title generation for spoken broadcast news
HLT '01 Proceedings of the first international conference on Human language technology research
Automatic summarization of voicemail messages using lexical and prosodic features
ACM Transactions on Speech and Language Processing (TSLP)
Spontaneous speech effects in large vocabulary speech recognition applications
HLT '91 Proceedings of the workshop on Speech and Natural Language
Assessing the retrieval effectiveness of a speech retrieval system by simulating recognition errors
HLT '94 Proceedings of the workshop on Human Language Technology
Speech-based retrieval using semantic co-occurrence filtering
HLT '94 Proceedings of the workshop on Human Language Technology
A Markov random field model for term dependencies
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing)
The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Exploring the use of latent topical information for statistical Chinese spoken document retrieval
Pattern Recognition Letters
Written versus spoken queries: A qualitative and quantitative comparative analysis
Journal of the American Society for Information Science and Technology - Research Articles
Spoken document retrieval from call-center conversations
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Techniques for information retrieval from voice messages
ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Evaluation campaigns and TRECVid
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Information Retrieval: Searching in the 21st Century
Information Retrieval: Searching in the 21st Century
Position specific posterior lattices for indexing speech
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Minimum cut model for spoken lecture segmentation
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Soft indexing of speech content for search in spoken documents
Computer Speech and Language
A system for unrestricted topic retrieval from radio news broadcasts
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Language-dependent state clustering for multilingual acoustic modelling
Speech Communication
Indexing confusion networks for morph-based spoken document retrieval
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Selection and ranking of text from highly imperfect transcripts for retrieval of video content
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Radio Oranje: searching the queen's speech(es)
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Merging storyboard strategies and automatic retrieval for improving interactive video search
Proceedings of the 6th ACM international conference on Image and video retrieval
Robust techniques for organizing and retrieving spoken documents
EURASIP Journal on Applied Signal Processing
Speech in noisy environments (spine) adds new dimension to speech recognition R&D
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Recording, Indexing, Summarizing, and Accessing Meeting Videos: An Overview of the AMI Project
ICIAPW '07 Proceedings of the 14th International Conference of Image Analysis and Processing - Workshops
Design and evaluation of systems to support interaction capture and retrieval
Personal and Ubiquitous Computing - Special Issue: User-centred design and evaluation of ubiquitous groupware
Access to recorded interviews: A research agenda
Journal on Computing and Cultural Heritage (JOCCH)
Term clouds as surrogates for user generated speech
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Information Retrieval
Introduction to Information Retrieval
The Rich Transcription 2007 Meeting Recognition Evaluation
Multimodal Technologies for Perception of Humans
Overview of the CLEF-2007 Cross-Language Speech Retrieval Track
Advances in Multilingual and Multimodal Information Retrieval
Time-Compressing Speech: ASR Transcripts Are an Effective Way to Support Gist Extraction
MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
A comparison of grapheme and phoneme-based units for Spanish spoken term detection
Speech Communication
A critical assessment of spoken utterance retrieval through approximate lattice representations
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Social summarization: does social feedback improve access to speech data?
Proceedings of the 2008 ACM conference on Computer supported cooperative work
Search Engines: Information Retrieval in Practice
Search Engines: Information Retrieval in Practice
Disclosing spoken culture: user interfaces for access to spoken word archives
BCS-HCI '08 Proceedings of the 22nd British HCI Group Annual Conference on People and Computers: Culture, Creativity, Interaction - Volume 1
Investigating the Global Semantic Impact of Speech Recognition Error on Spoken Content Collections
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
An audio indexing system for election video material
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Efficient subword lattice retrieval for German spoken term detection
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Improved lattice-based spoken document retrieval by directly learning from the evaluation measures
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Non-speech audio event detection
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Topic segmentation algorithms for text summarization and passage retrieval: an exhaustive evaluation
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Phrase-based query degradation modeling for vocabulary-independent ranked utterance retrieval
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Search User Interfaces
An overview of text-independent speaker recognition: From features to supervectors
Speech Communication
Statistical lattice-based spoken document retrieval
ACM Transactions on Information Systems (TOIS)
Search of spoken documents retrieves well recognized transcripts
ECIR'07 Proceedings of the 29th European conference on IR research
Modern Information Retrieval
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Performance analysis for lattice-based speech indexing approaches using words and subword units
IEEE Transactions on Audio, Speech, and Language Processing
Information Retrieval: Implementing and Evaluating Search Engines
Information Retrieval: Implementing and Evaluating Search Engines
Joke-o-Mat HD: browsing sitcoms with human derived transcripts
Proceedings of the international conference on Multimedia
The ambient spotlight: queryless desktop search from meeting speech
Proceedings of the 2010 international workshop on Searching spontaneous conversational speech
Overview of VideoCLEF 2009: new perspectives on speech-based multimedia content enrichment
CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
The use of emphasis to automatically summarize a spoken discourse
ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Automatic tagging and geotagging in video collections and communities
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Automated speech and audio analysis for semantic access to multimedia
SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
The AMI meeting corpus: a pre-announcement
MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
Robust speaker segmentation for meetings: the ICSI-SRI spring 2005 diarization system
MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
Overview of the CLEF-2005 cross-language speech retrieval track
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Using string comparison in context for improved relevance feedback in different text media
SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
Information retrieval from spoken documents
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Browsing recorded meetings with ferret
MLMI'04 Proceedings of the First international conference on Machine Learning for Multimodal Interaction
A system for information retrieval from large records of czech spoken data
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Boosting web retrieval through query operations
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Speaker diarization: from broadcast news to lectures
MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
Turkish Broadcast News Transcription and Retrieval
IEEE Transactions on Audio, Speech, and Language Processing
A Probabilistic Generative Framework for Extractive Broadcast News Speech Summarization
IEEE Transactions on Audio, Speech, and Language Processing
Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting
IEEE Transactions on Audio, Speech, and Language Processing
Introduction to the Special Section on Rich Transcription
IEEE Transactions on Audio, Speech, and Language Processing
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
IEEE Transactions on Audio, Speech, and Language Processing
An overview of automatic speaker diarization systems
IEEE Transactions on Audio, Speech, and Language Processing
Temporal Compression Of Speech: An Evaluation
IEEE Transactions on Audio, Speech, and Language Processing
Affective video content representation and modeling
IEEE Transactions on Multimedia
Approaches to reduce the effects of OOV queries on indexed spoken audio
IEEE Transactions on Multimedia
IEEE Transactions on Multimedia
Multimedia Search Without Visual Analysis: The Value of Linguistic and Contextual Information
IEEE Transactions on Circuits and Systems for Video Technology
Direct posterior confidence for out-of-vocabulary spoken term detection
ACM Transactions on Information Systems (TOIS)
ACM Transactions on Information Systems (TOIS)
Sibyl, a factoid question-answering system for spoken documents
ACM Transactions on Information Systems (TOIS)
Overview of the CLEF-2006 cross-language speech retrieval track
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
An attempt to measure the quality of questions in question time of the Australian Federal Parliament
Proceedings of the Seventeenth Australasian Document Computing Symposium
Hi-index | 0.00 |
Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings. This growth has motivated extensive research on techniques and technologies that facilitate reliable indexing and retrieval. Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR). SCR research initially investigated planned speech structured in document-like units, but has subsequently shifted focus to more informal spoken content produced spontaneously, outside of the studio and in conversational settings. This survey provides an overview of the field of SCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues. It is aimed at researchers with backgrounds in speech technology or IR who are seeking deeper insight on how these fields are integrated to support research and development, thus addressing the core challenges of SCR.