Table Form Document Synthesis by Grammar-Based Structure Analysis

  • Authors:
  • Affiliations:
  • Venue:
  • ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abstract: Document structure is an important issue not only for document analysis but for document synthesis. This pa-per presents a computer assisted document synthesis system based on the grammar-based structure analysis. The system is designed to accomplish the analysis and synthesis of table form documents cooperatively by user and computer; namely, the user interprets the document meaning and gives the entry data to be filled in, while the computer detects the boxes formed by horizontal and vertical rules and determine the logical relations of adjacent boxes. First, the document is decomposed into a set of boxes and they are classified semi-automatically into four types, blank, insertion, indication, and explanation. Then the box relations between indication box and its associated entry one are analyzed based on the semantic and geometric knowledge defined in the document structure grammar. Finally, the system generates L A T E X codes of the synthesized documents whose blank and insertion boxes are filled with the text and image data given by user. Experimental results have shown that the system analyzed successfully several kinds of table forms and yielded synthesized documents as expected.