Efficient stemmer generation

  • Authors:
  • Brian Fox;Christopher J. Fox

  • Affiliations:
  • Department of Computer Science, James Madison University, MSC 4103, Harrisonburg, VA;Department of Computer Science, James Madison University, MSC 4103, Harrisonburg, VA

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an algorithm for generating stemmers from text stemmer specification files. A small study shows that the generated stemmers are computationally efficient, often running faster than stemmers custom written to implement particular stemming algorithms. The stemmer specification files are easily written and modified by non-programmers, making it much easier to create a stemmer, or tune a stemmer's performance, than would be the case with a custom stemmer program. Stemmer generation is thus also human-resource efficient.