A brief introduction to the GeM annotation schema for complex document layout

  • Authors:
  • John Bateman;Renate Henschel;Judy Delin

  • Affiliations:
  • University of Bremen, Bremen, Germany;University of Stirling, Stirling, Scotland;University of Stirling, Newport Pagnell, England

  • Venue:
  • NLPXML '02 Proceedings of the 2nd workshop on NLP and XML - Volume 17
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we sketch the design, motivation and use of the GeM annotation scheme: an XML-based annotation framework for preparing corpora involving documents with complex layout of text, graphics, diagrams, layout and other navigational elements. We set out the basic organizational layers, contrast the technical approach with some other schemes for complex markup in the XML tradition, and indicate some of the applications we are pursuing.