Use of a genetic algorithm in brill's transformation-based part-of-speech tagger

Authors:
Garnett Wilson;Malcolm Heywood
Affiliations:
Dalhousie University, Halifax, NS, Canada;Dalhousie University, Halifax, NS, Canada
Venue:
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Year:
2005

Citing 6
Cited 2

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
Learning syntactic rules and tags with genetic algorithms for information retrieval and filtering: an empirical basis for grammatical rules

Information Processing and Management: an International Journal
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Dynamic Training Subset Selection for Supervised Learning in Genetic Programming

PPSN III Proceedings of the International Conference on Evolutionary Computation. The Third Conference on Parallel Problem Solving from Nature: Parallel Problem Solving from Nature
Part-of-Speech Tagging with Evolutionary Algorithms

CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Coping with ambiguity and unknown words through probabilistic models

Computational Linguistics - Special issue on using large corpora: II

TBL Template Selection: An Evolutionary Approach

Current Topics in Artificial Intelligence
How evolutionary algorithms are applied to statistical natural language processing

Artificial Intelligence Review

Quantified Score

Hi-index	0.00

Visualization

Abstract

The tagging problem in natural language processing is to find a way to label every word in a text as a particular part of speech, e.g., proper noun. An effective way of solving this problem with high accuracy is the transformation-based or "Brill" tagger. In Brill's system, a number of transformation templates are specified a priori that are instantiated and ranked during a greedy search-based algorithm. This paper describes a variant of Brill's implementation that instead uses a genetic algorithm to generate the instantiated rules and provide an adaptive ranking. Based on tagging accuracy, the new system provides a better hybrid evolutionary computation solution to the part-of-speech (POS) problem than the previous attempt. Although not able to make up for the use of a priori knowledge utilized by Brill, the method appears to point the way for an improved solution to the tagging problem.