The fast scheme for document page segmentation in OCR using window and optimum image

  • Authors:
  • Wichian Premchaiswadi;Phaisarn Sutheebanjard;Nuchree Premchaiswadi

  • Affiliations:
  • Graduate School of Information Technology, Siam University, Phasi-charoen, Bangkok, Thailand;Graduate School of Information Technology, Siam University, Phasi-charoen, Bangkok, Thailand;Faculty of Information Technology, Dhurakij Pundit University, Bangkok, Thailand

  • Venue:
  • CIMMACS'06 Proceedings of the 5th WSEAS International Conference on Computational Intelligence, Man-Machine Systems and Cybernetics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents the speed-up method for document page segmentation which is one of the most important processes in an Optical Character Recognition (OCR) system. In this proposed scheme, a window size of 12 by 12 pixels is used to find a black pixel and its contour border. Then, the optimum image is created from these borders of characters where the 12×12 pixels of the original picture are represented by 1 pixel in the optimum image. Therefore, the number of pixels is reduced to 1/144 times the original image but still keeps the original image structure correctly. Finally, the optimum image is used for block extraction process to provide the faster work result. The experimental results show that the proposed scheme can significantly speed up the processing time of the document page segmentation process.