Multi-oriented english text line identification

  • Authors:
  • U. Pal;S. Sinha;B. B. Chaudhuri

  • Affiliations:
  • Indian Statistical Institute, Kolkata, India;Indian Statistical Institute, Kolkata, India;Indian Statistical Institute, Kolkata, India

  • Venue:
  • SCIA'03 Proceedings of the 13th Scandinavian conference on Image analysis
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are many artistic documents where text lines of a single page may have different inclinations (orientations). To enhance the ability of document analysis system, we have to extract text line in multiple orientations. In this paper, we propose a robust technique to detect English text lines of arbitrary orientation in a single document page. We propose here a bottom-up approach where the connected components are at first labelled. They are then clustered into word groups. Text lines of arbitrary orientation are identified from the estimation of these word groups. From an experiment of 3700 text lines, we obtained an accuracy of 98.3% by the proposed method.