Third-party error detection support mechanisms for dictation speech recognition

Authors:
Lina Zhou;Yongmei Shi;Andrew Sears
Affiliations:
Department of Information Systems, UMBC, Baltimore, MD 21250, USA;Tetherless World Constellation, Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY 12180, USA;Department of Information Systems, UMBC, Baltimore, MD 21250, USA
Venue:
Interacting with Computers
Year:
2010

Citing 31
Cited 1

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Feedback strategies for error correction in speech recognition systems

International Journal of Man-Machine Studies
Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
MedSpeak: report creation with continuous speech recognition

Proceedings of the ACM SIGCHI Conference on Human factors in computing systems
Patterns of entry and correction in large vocabulary continuous speech recognition systems

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Social cognitive theory and individual reactions to computing technology: a longitudinal study

MIS Quarterly
Impact of information quality and decision-maker quality on decision quality: a theoretical model and simulation analysis

Decision Support Systems
Multimodal error correction for speech user interfaces

ACM Transactions on Computer-Human Interaction (TOCHI)
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
A Probabilistic Approach to Confidence Estimation and Evaluation

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Challenges in adopting speech recognition

Communications of the ACM - Multimodal interfaces that flex, adapt, and persist
Beyond n-grams: can linguistic sophistication improve language modeling?

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Using confidence scores to improve hands-free speech based navigation in continuous dictation systems

ACM Transactions on Computer-Human Interaction (TOCHI)
How to wreck a nice beach you sing calm incense

Proceedings of the 10th international conference on Intelligent user interfaces
Combining knowledge sources to reorder N-best speech hypothesis lists

HLT '94 Proceedings of the workshop on Human Language Technology
Error correction of voicemail transcripts in SCANMail

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Current practice in measuring usability: Challenges to usability studies and research

International Journal of Human-Computer Studies
Accessibility, transcription, and access everywhere

IBM Systems Journal
Information System Success: Individual and Organizational Determinants

Management Science
Discovering Cues to Error Detection in Speech Recognition Output: A User-Centered Approach

Journal of Management Information Systems
The Effects of Decision Guidance and Problem Modeling on Group Decision-Making

Journal of Management Information Systems
Acceptance of speech recognition by physicians: A survey of expectations, experiences, and social influence

International Journal of Human-Computer Studies
Hands-free, speech-based navigation during dictation: difficulties, consequences, and solutions

Human-Computer Interaction
Uncovering cognitive processes: Different techniques that can contribute to cognitive load research and instruction

Computers in Human Behavior
Editorial: State of the art research into Cognitive Load Theory

Computers in Human Behavior
An investigation of linguistic information for speech recognition error detection

An investigation of linguistic information for speech recognition error detection
Cognitive load in ecommerce applications: measurement and effects on user satisfaction

Advances in Human-Computer Interaction
Are some speech recognition errors easier to detect than others?

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
The automatic creation of literature abstracts

IBM Journal of Research and Development
Writing with speech recognition: The adaptation process of professional writers with and without dictating experience

Interacting with Computers
Design science in information systems research

MIS Quarterly

A crowdsourcing quality control model for tasks distributed in parallel

CHI '12 Extended Abstracts on Human Factors in Computing Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Although speech recognition has improved significantly in recent years, its adoption continues to be limited, in part, by the effort and frustration associated with correcting speech recognition errors. Error detection is a particularly challenging issue in third-party error correction where different individuals are responsible for the original dictation and correcting the resulting text. This research aims to address the difficulty experienced in third-party error detection by developing and evaluating a variety of support mechanisms. Drawing on a growing body of literature on human computer interaction and speech recognition, four support mechanisms were designed and evaluated, namely indexed audio, speech summarization, error prediction, and the presentation of alternative hypotheses. A user study assessed the impact of these support mechanisms on both performance and perceptions during error detection tasks. Performance measures included effectiveness and efficiency, and perception measures included confidence, perceived usefulness, and cognitive workload. The results provide strong support for the use of indexed audio in the context of third-party error detection. The results also confirm that consecutive error rate, or the percentage of recognition errors immediately adjacent to another error, has a negative impact on the effectiveness of third-party error detection. Other support mechanisms failed to improve either effectiveness or perceptions, but they did negate the negative impact as consecutive error rate increased. These findings have significant implications for speech recognition error detection research and the design of error detection support solutions.