The HCRC Map Task corpus: natural dialogue for speech recognition

  • Authors:
  • Henry S. Thompson;Anne Anderson;Ellen Gurman Bard;Gwyneth Doherty-Sneddon;Alison Newlands;Cathy Sotillo

  • Affiliations:
  • University of Edinburgh, Edinburgh, Scotland;University of Edinburgh, Edinburgh, Scotland;University of Edinburgh, Edinburgh, Scotland;University of Edinburgh, Edinburgh, Scotland;University of Edinburgh, Edinburgh, Scotland;University of Edinburgh, Edinburgh, Scotland

  • Venue:
  • HLT '93 Proceedings of the workshop on Human Language Technology
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

The HCRC Map Task corpus has been collected and transcribed in Glasgow and Edinburgh, and recently published on CD-ROM. This effort was made possible by funding from the British Economic and Social Research Council.The corpus is composed of 128 two-person conversations in both high-quality digital audio and orthographic transcriptions, amounting to 18 hours and 150,000 words respectively.The experimental design is quite detailed and complex, allowing a number of different phonemic, syntactico-semantic and pragmatic contrasts to be explored in a controlled way.The corpus is a uniquely valuable resource for speech recognition research in particular, as we move from developing systems intended for controlled use by familiar users to systems intended for less constrained circumstances and naive or occasional users. Examples supporting this claim are given, including preliminary evidence of the phonetic consequences of second mention and the impact of different styles of referent negotiation on communicative efficacy.