Gabor Filter Based Text Extraction from Digital Document Images

Authors:
Yu-Long Qiao;Meng Li;Zhe-Ming Lu;Sheng-He Sun
Affiliations:
Harbin Institute of Technology, China/ Harbin Engineering University, China;Harbin Institute of Technology, China;Harbin Institute of Technology Shenzhen Graduate School, China;Harbin Institute of Technology, China
Venue:
IIH-MSP '06 Proceedings of the 2006 International Conference on Intelligent Information Hiding and Multimedia
Year:
2006

Citing 0
Cited 2

An efficient method for text detection in video based on stroke width similarity

ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part I
Understanding Digital Documents Using Gestalt Properties of Isothetic Components

International Journal of Digital Library Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The automatic text detection in document images is useful for many applications. This paper presents an algorithm that can automatically detect and extract text in digital document images. Firstly, we process and fuse Gabor filtered images at different orientations and scales and obtain an image that reflects the layout of the document image. Then, potential text regions are directly extracted from the resulting image. Finally, two criteria based on the geometrical property and high frequency content are adopted to kick-out those non-text regions. The experiments are performed on some representative images with different styles and with texts in different languages and fonts. Experimental results show that the algorithm works well on document images from a wide variety of source.