Measure for measure: towards increased component comparability and exchange

  • Authors:
  • Stephan Oepen;Ulrich Callmeier

  • Affiliations:
  • Center for the Study of Language and Information Stanford University, Stanford, CA;Department of Computational Linguistics, Saarland University, Saarbrücken, Germany

  • Venue:
  • New developments in parsing technology
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Over the past few years, significant progress has been made in efficient processing with wide-coverage HPSG grammars. HPSG-based parsing systems are now available that can process medium-complexity sentences (of ten to twenty words, say) in average parse times equivalent to real (i.e. human reading) time. A large number of engineering improvements in current HPSG systems have been achieved through collaboration of multiple research centers and mutual exchange of experience, encoding techniques, algorithms, and even pieces of software. This article presents an approach to grammar and system engineering, termed competence & performance profiling, that makes systematic experimentation and the precise empirical study of system properties a focal point in development. Adapting the profiling metaphor familiar from software engineering to constraint-based grammars and parsers enables developers to maintain an accurate record of system evolution, identify grammar and system deficiencies quickly, and compare to earlier versions or between different systems. We discuss a number of example problems that motivate the experimental approach, and apply the empirical methodology in a fairly detailed discussion of progress made during a development period of three years.