A systematic comparison of feature-rich probabilistic classifiers for NER tasks

  • Authors:
  • Benjamin Rosenfeld;Moshe Fresko;Ronen Feldman

  • Affiliations:
  • Computer Science Department, Data-Mining Lab., Bar-Ilan University, Ramat-Gan, Israel;Computer Science Department, Data-Mining Lab., Bar-Ilan University, Ramat-Gan, Israel;Computer Science Department, Data-Mining Lab., Bar-Ilan University, Ramat-Gan, Israel

  • Venue:
  • PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the CoNLL 2003 NER shared task, more than two thirds of the submitted systems used the feature-rich representation of the task. Most of them used maximum entropy to combine the features together. Others used linear classifiers, such as SVM and RRM. Among all systems presented there, one of the MEMM-based classifiers took the second place, losing only to a committee of four different classifiers, one of which was ME-based and another RRM-based. The lone RRM was fourth, and CRF came in the middle of the pack. In this paper we shall demonstrate, by running the three algorithms upon the same tasks under exactly the same conditions that this ranking is due to feature selection and other causes and not due to the inherent qualities of the algorithms, which should be ranked otherwise.