Interaction techniques using prosodic features of speech and audio localization

  • Authors:
  • Alex Olwal;Steven Feiner

  • Affiliations:
  • Columbia University, New York, NY and Royal Institute of Technology, Stockholm, Sweden;Columbia University, New York, NY

  • Venue:
  • Proceedings of the 10th international conference on Intelligent user interfaces
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe several approaches for using prosodic features of speech and audio localization to control interactive applications. This information can be applied to parameter control, as well as to speech disambiguation. We discuss how characteristics of spoken sentences can be exploited in the user interface; for example, by considering the speed with which a sentence is spoken and the presence of extraneous utterances. We also show how coarse audio localization can be used for low-fidelity gesture tracking, by inferring the speaker's head position.