Normalized text font resemblance method aimed at document image page clustering

  • Authors:
  • Costin-Anton Boiangiu;Andrei-Cristian Spataru;Andrei-Iulian Dvornic;Dan-Cristian Cananau

  • Affiliations:
  • Computer Science Department, "Politehnica" University of Bucharest, Bucharest, Romania;Computer Science Department, "Politehnica" University of Bucharest, Bucharest, Romania;Computer Science Department, "Politehnica" University of Bucharest, Bucharest, Romania;Computer Science Department, "Politehnica" University of Bucharest, Bucharest, Romania

  • Venue:
  • WSEAS Transactions on Computers
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes an approach towards obtaining the normalized measure of text resemblance in scanned images. The technique, aimed at automatic content conversion, is relying on the detection of standard character features and uses a sequence of procedures and algorithms applied sequentially on the input document. The approach makes use solely of the geometrical characteristics of characters, ignoring information regarding context or the character-recognition.