Evolving Lucene search queries for text classification

  • Authors:
  • Laurence Hirsch;Robin Hirsch;Masoud Saeedi

  • Affiliations:
  • Sheffield Hallam University;University College London;Royal Holloway University of London

  • Venue:
  • Proceedings of the 9th annual conference on Genetic and evolutionary computation
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to construct Lucene search queries. Genetic programs acquire fitness by producing queries that are effective binary classifiers for a particular category when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from classification tasks.