ETL ensembles for chunking, NER and SRL

  • Authors:
  • Cícero N. dos Santos;Ruy L. Milidiú;Carlos E. M. Crestana;Eraldo R. Fernandes

  • Affiliations:
  • Mestrado em Informática Aplicada – MIA, Universidade de Fortaleza – UNIFOR, Fortaleza, Brazil;Departamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro – PUC-Rio, Rio de Janeiro, Brazil;Departamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro – PUC-Rio, Rio de Janeiro, Brazil;Departamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro – PUC-Rio, Rio de Janeiro, Brazil

  • Venue:
  • CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a new ensemble method that uses Entropy Guided Transformation Learning (ETL) as the base learner. The proposed approach, ETL Committee, combines the main ideas of Bagging and Random Subspaces. We also propose a strategy to include redundancy in transformation-based models. To evaluate the effectiveness of the ensemble method, we apply it to three Natural Language Processing tasks: Text Chunking, Named Entity Recognition and Semantic Role Labeling. Our experimental findings indicate that ETL Committee significantly outperforms single ETL models, achieving state-of-the-art competitive results. Some positive characteristics of the proposed ensemble strategy are worth to mention. First, it improves the ETL effectiveness without any additional human effort. Second, it is particularly useful when dealing with very complex tasks that use large feature sets. And finally, the resulting training and classification processes are very easy to parallelize.