Application of association rules mining to Named Entity Recognition and co-reference resolution for the Indonesian language

  • Authors:
  • Indra Budi;Stephane Bressan

  • Affiliations:
  • Faculty of Computer Science, University of Indonesia, Indonesia.;School of Computing, National University of Singapore, Singapore

  • Venue:
  • International Journal of Business Intelligence and Data Mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a new method, association rules mining for Named Entity Recognition (NER) and co-reference resolution. The method uses several morphological and lexical features such as Pronoun Class (PC) and Name Class (NC), String Similarity (SP) and Position (P) in the text, into a vector of attributes. Applied to a corpus of newspaper in the Indonesian language, the method outperforms state-of-the-art maximum entropy method in name entity recognition and is comparable with state-of-the-art machine learning methods, decision tree, for co-reference resolution.