Regular expressions for language engineering

  • Authors:
  • L. Karttunen;J.-P. Chanod;G. Grefenstette;A. Schille

  • Affiliations:
  • Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Lauri.Karttunen@grenoble.rxrc.xerox.com,;Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Jean-Pierre.Chanod@grenoble.rxrc.xerox.com;Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Gregory.Grefenstette@grenoble.rxrc.xerox.com,;Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Anne.Schiller@grenoble.rxrc.xerox.com

  • Venue:
  • Natural Language Engineering
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many of the processing steps in natural language engineering can be performed using finite state transducers. An optimal way to create such transducers is to compile them from regular expressions. This paper is an introduction to the regular expression calculus, extended with certain operators that have proved very useful in natural language applications ranging from tokenization to light parsing. The examples in the paper illustrate in concrete detail some of these applications.