DMOS: A Generic Document Recognition Method, Application to an Automatic Generator of Musical Scores, Mathematical Formulae and Table Structures Recognition Systems

  • Authors:
  • Bertrand Coüasnon

  • Affiliations:
  • -

  • Venue:
  • ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abstract: Genericity in structured document recognition is a difficult challenge. We propose in this paper a new generic document recognition method (DMOS) made of a new grammatical formalism (EPF) and an associated parser able to introduce context in segmentation. We implement this method to obtain a generator of document recognition systems. This generator can automatically produce new recognition systems. It is just necessary to describe the document with an EPF grammar which is then simply compiled. In this way we have developed various recognition systems: one on musical scores, one on mathematical formulae and one on recursive table structures. We have also defined a specific application on quite damaged military forms of the 19th century. We have been able to test the generated system on 5,000 of these military forms. This has permit us to validate the DMOS method on a real-world application.