The statistical significance of the MUC-5 results

Authors:
Nancy Chinchor
Affiliations:
Science Applications International Corporation, San Diego, CA
Venue:
MUC5 '93 Proceedings of the 5th conference on Message understanding
Year:
1993

Citing 2
Cited 4

Evaluating message understanding systems: an analysis of the third message understanding conference (MUC-3)

Computational Linguistics
The statistical significance of the MUC-4 results

MUC4 '92 Proceedings of the 4th conference on Message understanding

Can We Make Information Extraction More Adaptive?

Information Extraction: Towards Scalable, Adaptable Systems
Focusing on scenario recognition in information extraction

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
Tipster/MUC-5 information extraction system evaluation

TIPSTER '93 Proceedings of a workshop on held at Fredericksburg, Virginia: September 19-23, 1993
Better hypothesis testing for statistical machine translation: controlling for optimizer instability

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

The statistical significance of the results of the MUC-5 evaluation is determined using a computer-intensive method of hypothesis testing known as approximate randomization. The exact method is described in detail in [1] and [2] and has been used as the accepted statistical test for the MUC results since MUC-3. The purpose of the statistical testing is to determine whether the scores of the systems are different by chance or due to a significant difference in the character of the systems.