Using Stemming Algorithms on a Grid Environment

  • Authors:
  • Valeriana G. Roncero;Myrian C. Costa;Nelson F. Ebecken

  • Affiliations:
  • COPPE/Federal University of Rio de Janeiro, Rio de Janeiro, Brazil 21945-970;COPPE/Federal University of Rio de Janeiro, Rio de Janeiro, Brazil 21945-970;COPPE/Federal University of Rio de Janeiro, Rio de Janeiro, Brazil 21945-970

  • Venue:
  • High Performance Computing for Computational Science - VECPAR 2008
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Stemming algorithms are commonly used in Information Retrieval with the goal of reducing the number of the words which are in the same morpho-logical variant in a common representation. Stemming analysis is one of the tasks of the pre-processing phase on text mining that consumes a lot of time. This study proposes a model of distributed stemming analysis on a grid environment to reduce the stemming processing time; this speeds up the text preparation. This model can be integrated into grid-based text mining tool, helping to improve the overall performance of the text mining process.