Creating structured PDF files using XML templates

  • Authors:
  • Matthew R. B. Hardy;David F. Brailsford;Peter L. Thomas

  • Affiliations:
  • University of Nottingham, Nottingham, UK;University of Nottingham, Nottingham, UK;University of Nottingham, Nottingham, UK

  • Venue:
  • Proceedings of the 2004 ACM symposium on Document engineering
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a tool for recombining the logical structure from an XML document with the typeset appearance of the corresponding PDF document. The tool uses the XML representation as a template for the insertion of the logical structure into the existing PDF document thereby creating a Structured/Tagged PDF. The addition of logical structure adds value to the PDF in three ways: the accessibility is improved (PDF screen readers for visually impaired users perform better) media options are enhanced (the ability to reflow PDF documents using structure as a guide makes PDF viable for use on hand-held devices) and the re-usability of the PDF documents benefits greatly from the presence of an XML-like structure tree to guide the process of text retrieval in reading order (e.g. when interfacing to XML applications and databases).