Putting people first: specifying proper names in speech interfaces

  • Authors:
  • Matt Marx;Chris Schmandt

  • Affiliations:
  • Speech Research Group, MIT Media Laboratory, 20 Ames St., Cambridge, MA;Speech Research Group, MIT Media Laboratory, 20 Ames St., Cambridge, MA

  • Venue:
  • UIST '94 Proceedings of the 7th annual ACM symposium on User interface software and technology
  • Year:
  • 1994

Quantified Score

Hi-index 0.00

Visualization

Abstract

Communication is about people, not machines. But as firms and families alike spread out geographically, we rely increasingly on telecommunications tools to keep us “connected”. The challenge of such systems is to enable conversation between individuals without computational infrastructure getting in the way. This paper compares two speech-based communication systems, Phoneshell and Chatter, in how they deal with the keys to communication: proper names. Chatter, a conversational system using speech-recognition, improves upon the hierarchical nature of the touch-tone based Phoneshell by maintaining context and enabling use of anaphora. Proper names can present particular problems for speech recognizers, so an interface algorithm for reliable name specification by spelling is offered. Since individual letter recognition is non-robust, Chatter implicitly disambiguates strings of letters based on context. We hypothesize that the right interface can make faulty speech recognition as usable as TouchTones—even more so.