Touching text character localization in graphical documents using SIFT

  • Authors:
  • Partha Pratim Roy;Umapada Pal;Josep Lladós

  • Affiliations:
  • Computer Vision Center, Universitat Autònoma de Barcelona, Barcelona, Spain;Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India;Computer Vision Center, Universitat Autònoma de Barcelona, Barcelona, Spain

  • Venue:
  • GREC'09 Proceedings of the 8th international conference on Graphics recognition: achievements, challenges, and evolution
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Interpretation of graphical document images is a challenging task as it requires proper understanding of text/graphics symbols present in such documents. Difficulties arise in graphical document recognition when text and symbol overlapped/touched. Intersection of text and symbols with graphical lines and curves occur frequently in graphical documents and hence separation of such symbols is very difficult. Several pattern recognition and classification techniques exist to recognize isolated text/symbol. But, the touching/overlapping text and symbol recognition has not yet been dealt successfully. An interesting technique, Scale Invariant Feature Transform (SIFT), originally devised for object recognition can take care of overlapping problems. Even if SIFT features have emerged as a very powerful object descriptors, their employment in graphical documents context has not been investigated much. In this paper we present the adaptation of the SIFT approach in the context of text character localization (spotting) in graphical documents. We evaluate the applicability of this technique in such documents and discuss the scope of improvement by combining some state-of-the-art approaches.