Text detection of two major indian scripts in natural scene images

  • Authors:
  • Aruni Roy Chowdhury;Ujjwal Bhattacharya;Swapan K. Parui

  • Affiliations:
  • Department of Information Technology, Heritage Institute of Technology, Kolkata, India;Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India;Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India

  • Venue:
  • CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this article, we present a robust scheme for detection of Devanagari or Bangla texts in scene images. These are the two most popular scripts in India. The proposed scheme is primarily based on two major characteristics of such texts - (i) variations in stroke thickness for text components of a script are low compared to their non-text counterparts and (ii) presence of a headline along with a few vertical downward strokes originating from this headline. We use the Euclidean distance transform to verify the general characteristics of texts in (i). Also, we apply the probabilistic Hough line transform to detect the characteristic headline of Devanagari and Bangla texts. Further, similarity and adjacency measures are applied to identify text regions, which do not satisfy the verification in (ii). The proposed approach has been simulated on a repository of 120 images taken from Indian roads and the results are encouraging. Also, we have discussed the applicability of the proposed scheme for detection of English texts. Towards this end, we have considered the training and test samples from the image database of ICDAR 2003 Robust Reading Competition.