Statistical significance of MUC-6 results

  • Authors:
  • Nancy Chinchor

  • Affiliations:
  • Science Applications International Corporation, San Diego, CA

  • Venue:
  • MUC6 '95 Proceedings of the 6th conference on Message understanding
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

The results of the MUC-6 evaluation must be analyzed to determine whether close scores significantly distinguish systems or whether the differences in those scores are a matter of chance. In order to do such an analysis, a method of computer intensive hypothesis testing was developed by SAIC for the MUC-3 results and has been used for distinguishing MUC scores since that time. The implementation of this method for the MUC evaluations was first described in [1] and later the concepts behind the statistical model were explained in a more understandable manner in [2]. This paper gives the results of the statistical testing for the three MUC-6 tasks where a single metric could be associated with a system's performance.