Headline based text extraction from outdoor images

  • Authors:
  • Ranjit Ghoshal;Anandarup Roy;Tapan Kumar Bhowmik;Swapan K. Parui

  • Affiliations:
  • St. Thomas' Collage of Engineering and Technology, Kolkata, India;CVPR Unit, Indian Statistical Institute, Kolkata, India;Faculty of Mathematics and Natural Sciences, University of Groningen, Netherlands;CVPR Unit, Indian Statistical Institute, Kolkata, India

  • Venue:
  • PReMI'11 Proceedings of the 4th international conference on Pattern recognition and machine intelligence
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The goal of this article is to design an effective scheme for extraction of Bangla/Devnagari text from outdoor images. We first segment a color image using fuzzy c-means algorithm. In Bangla/Devnagari script, text may be attached/unattached to the headlines. Hence, after segmentation, headlines are detected from each connected components using morphology. Now, the components attached or close to the detected headlines are separated. Further by applying certain shape and position based purification we could distinguish text and non text. Our experiments on a dataset of 100 outdoor images containing Bangla and/or Devnagari text reveals satisfactory performance.