Robust image based document comparison using attributed relational graphs

  • Authors:
  • Katja Worm;Beate Meffert

  • Affiliations:
  • Siemens ElectroCom Postautomation GmbH, Berlin, Germany;Humboldt University of Berlin, Berlin, Germany

  • Venue:
  • SPPRA '08 Proceedings of the Fifth IASTED International Conference on Signal Processing, Pattern Recognition and Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a new approach which allows similarity measurement between documents using a compact image based feature representation. Various applications, in particular document management systems, require the comparison of scanned documents for their classification. The proposed method focuses on mail piece identification within the postal sorting process. Generally, mail pieces resemble in their structure and differ in text regions. Concentration on structural text region features and text line profiles exploits these differences. An attributed relational graph representation is used to combine detailed local information with rough layout information of a document. This method is designed to comply with the strong requirements for postal sorting machines. In particular this approach is invariant towards document rotation, translation and towards document surface modifications caused by mail piece handling and transportation. Efficient algorithms allow its usage in a real time environment. The quality and applicability for mail piece identification has been proven in various tests.