Contextual correlates of synonymy
Communications of the ACM
An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Extracting Keyphrases from Spoken Audio Documents
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
A Probabilistic Approach to Confidence Estimation and Evaluation
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Confidence Measures for Spontaneous Speech Recognition
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Passage retrieval vs. document retrieval for factoid question answering
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Applying the Naïve Bayes Classifier to Assist Users in Detecting Speech Recognition Errors
HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences - Volume 07
Confidence measures for the SWITCHBOARD database
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Extended gloss overlaps as a measure of semantic relatedness
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Applications of corpus-based semantic similarity and word segmentation to database schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
A Hybrid Approach to Improving Automatic Speech Recognition Via NLP
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Correction of medical handwriting OCR based on semantic similarity
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Combination of error detection techniques in automatic speech transcription
AIS'11 Proceedings of the Second international conference on Autonomous and intelligent systems
Using various indexing schemes and multiple translations in the CL-SR task at CLEF 2005
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
A novel voting scheme for ROVER using automatic error detection
AIS'12 Proceedings of the Third international conference on Autonomous and Intelligent Systems
Measuring contextual fitness using error contexts extracted from the Wikipedia revision history
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Detection of semantic errors in Arabic texts
Artificial Intelligence
Hi-index | 0.00 |
Browsing through large volumes of spoken audio is known to be a challenging task for end users. One way to alleviate this problem is to allow users to gist a spoken audio document by glancing over a transcript generated through Automatic Speech Recognition. Unfortunately, such transcripts typically contain many recognition errors which are highly distracting and make gisting more difficult. In this paper we present an approach that detects recognition errors by identifying words which are semantic outliers with respect to other words in the transcript. We describe several variants of this approach. We investigate a wide range of evaluation measures and we show that we can significantly reduce the number of errors in content words, with the trade-off of losing some good content words.