A graph based approach for heterogeneous document segmentation

  • Authors:
  • Fattah Zirari;Driss Mammass;Abdellatif Ennaji;Stephane Nicolas

  • Affiliations:
  • Laboratory IRF-SIC, Ibn Zohr University, Agadir, Morocco, Laboratory LITIS, University of Rouen, Rouen, France;Laboratory IRF-SIC, Ibn Zohr University, Agadir, Morocco;Laboratory LITIS, University of Rouen, Rouen, France;Laboratory LITIS, University of Rouen, Rouen, France

  • Venue:
  • ICISP'12 Proceedings of the 5th international conference on Image and Signal Processing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the field of document image processing, the text/graphic separation is a major step that conditions the performance of the recognition and indexing systems. That involves identifying and separating the graphical and textual components of a document image. In this context, it is important to implement approaches that effectively address these problems. This paper presents a method for separating textual and non textual components in document images using a graph-based modeling and structural analysis. This is a fast and efficient method to separate adequately the graphical and the textual areas of a document. Some examples obtained on technical documents and magazines issued from the databases approved by the community make it possible to validate the approach.