Selecting a feature set to summarize texts in brazilian portuguese

  • Authors:
  • Daniel Saraiva Leite;Lucia Helena Machado Rino

  • Affiliations:
  • Departamento de Computação, UFSCar Núcleo Interinstitucional de Lingüística Computacional, São Carlos, SP, Brazil;Departamento de Computação, UFSCar Núcleo Interinstitucional de Lingüística Computacional, São Carlos, SP, Brazil

  • Venue:
  • IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a novel approach to combining features for training an automatic extractive summarizer of texts written in Brazilian Portuguese. The approach aims at both diminishing the effort of classifying features that are representative for Automatic Summarization and providing more informativeness for the summarizer to decide which text spans to include in an extract. Finding a balanced set of features is explored through WEKA. We discuss several ways of modifying the feature set and show how automatic feature selection may be useful for customizing the summarizer.