Expanding the scope of the ATIS task: the ATIS-3 corpus

Authors:
Deborah A. Dahl;Madeleine Bates;Michael Brown;William Fisher;Kate Hunicke-Smith;David Pallett;Christine Pao;Alexander Rudnicky;Elizabeth Shriberg
Affiliations:
Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA;Unisys Corporation, Paoli PA
Venue:
HLT '94 Proceedings of the workshop on Human Language Technology
Year:
1994

Citing 6
Cited 16

Evaluation of spoken language systems: the ATIS domain

HLT '90 Proceedings of the workshop on Speech and Natural Language
The ATIS spoken language systems pilot corpus

HLT '90 Proceedings of the workshop on Speech and Natural Language
Multi-site data collection for a spoken language corpus

HLT '91 Proceedings of the workshop on Speech and Natural Language
Benchmark tests for the DARPA Spoken Language Program

HLT '93 Proceedings of the workshop on Human Language Technology
Multi-site data collection and evaluation in spoken language understanding

HLT '93 Proceedings of the workshop on Human Language Technology
Semantic evaluation for spoken-language systems

HLT '94 Proceedings of the workshop on Human Language Technology

Semiautomatic Acquisition of Semantic Structures for Understanding Domain-Specific Natural Language Queries

IEEE Transactions on Knowledge and Data Engineering
Transparent combination of rule-based and data-driven approaches in a speech understanding architecture

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Portability issues for speech recognition technologies

HLT '01 Proceedings of the first international conference on Human language technology research
Practical issues in compiling typed unification grammars for speech recognition

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
1993 benchmark tests for the ARPA spoken language program

HLT '94 Proceedings of the workshop on Human Language Technology
Recent improvements in the CMU spoken language understanding system

HLT '94 Proceedings of the workshop on Human Language Technology
Speech Processing at BBN

IEEE Annals of the History of Computing
Learning context-dependent mappings from sentences to logical form

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Finding common ground: towards a surface realisation shared task

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Comparing local and sequential models for statistical incremental natural language understanding

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Spoken dialogue system based on information extraction using similarity of predicate argument structures

SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
Computing logical form on regulatory texts

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Unsupervised concept-to-text generation with hypergraphs

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Towards a computational history of the ACL: 1980-2008

ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Concept-to-text generation via discriminative reranking

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A global model for concept-to-text generation

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Air Travel Information System (ATIS) domain serves as the common evaluation task for ARPA spoken language system developers. To support this task, the Multi-Site ATIS Data COllection Working group (MADCOW) coordinates data collection activities. This paper describes recent MADCOW activities. In particular, this paper describes the migration of the ATIS task to a richer relational database and development corpus (ATIS-3) and describes the ATIS-3 corpus. The expanded database, which includes information on 46 US and Canadian cities and 23,457 flights, was released in the fall of 1992, and data collection for the ATIS-3 corpus began shortly thereafter. The ATIS-3 corpus now consists of a total of 8297 released training utterances and 3211 utterances reserved for testing, collected at BBN, CMU, MIT, NIST and SRI. 2906 of the training utterances have been annotated with the correct information from the database. This paper describes the ATIS-3 corpus in detail, including breakdowns of data by type (e.g. context-independent, context-dependent, and unevaluable)and variations in the data collected at different sites. This paper also includes a description of the ATIS-3 database. Finally, we discuss future data collection and evaluation plans.