The CLEAR 2006 evaluation

  • Authors:
  • Rainer Stiefelhagen;Keni Bernardin;Rachel Bowers;John Garofolo;Djamel Mostefa;Padmanabhan Soundararajan

  • Affiliations:
  • Interactive Systems Lab, Universität Karlsruhe, Karlsruhe, Germany;Interactive Systems Lab, Universität Karlsruhe, Karlsruhe, Germany;National Institute of Standards and Technology, Information Technology Lab, Information Access Division, Speech Group;National Institute of Standards and Technology, Information Technology Lab, Information Access Division, Speech Group;Evaluations and Language Resources Distribution Agency, Paris, France;Computer Science and Engineering, University of South Florida, Tampa, FL

  • Venue:
  • CLEAR'06 Proceedings of the 1st international evaluation conference on Classification of events, activities and relationships
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper is a summary of the first CLEAR evaluation on CLassification of Events, Activities and Relationships - which took place in early 2006 and concluded with a two day evaluation workshop in April 2006. CLEAR is an international effort to evaluate systems for the multimodal perception of people, their activities and interactions. It provides a new international evaluation framework for such technologies. It aims to support the definition of common evaluation tasks and metrics, to coordinate and leverage the production of necessary multimodal corpora and to provide a possibility for comparing different algorithms and approaches on common benchmarks, which will result in faster progress in the research community. This paper describes the evaluation tasks, including metrics and databases used, that were conducted in CLEAR 2006, and provides an overview of the results. The evaluation tasks in CLEAR 2006 included person tracking, face detection and tracking, person identification, head pose estimation, vehicle tracking as well as acoustic scene analysis. Overall, more than 20 subtasks were conducted, which included acoustic, visual and audio-visual analysis for many of the main tasks, as well as different data domains and evaluation conditions.