A Contour-Based Robust Algorithm for Text Detection in Color Images

Authors:
Yangxing Liu;Satoshi Goto;Takeshi Ikenaga
Affiliations:
The authors are with the Graduate School of Information, Production and Systems, Waseda University, Kita kyushu-shi, 808--0135 Japan. E-mail: lyx@ruri.waseda.jp, E-mail: goto@waseda.jp, E-mail: ik ...;The authors are with the Graduate School of Information, Production and Systems, Waseda University, Kita kyushu-shi, 808--0135 Japan. E-mail: lyx@ruri.waseda.jp, E-mail: goto@waseda.jp, E-mail: ik ...;The authors are with the Graduate School of Information, Production and Systems, Waseda University, Kita kyushu-shi, 808--0135 Japan. E-mail: lyx@ruri.waseda.jp, E-mail: goto@waseda.jp, E-mail: ik ...
Venue:
IEICE - Transactions on Information and Systems
Year:
2006

Citing 0
Cited 6

Text region extraction algorithm on steel making process

ROCOM'08 Proceedings of the 8th WSEAS International Conference on Robotics, Control and Manufacturing Technology
Foreground Text Extraction in Color Document Images for Enhanced Readability

PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
New approach based on texture and geometric features for text detection

ICISP'10 Proceedings of the 4th international conference on Image and signal processing
Localizing slab identification numbers in factory scene images

Expert Systems with Applications: An International Journal
Text extraction from videos using a hybrid approach

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Integrating multiple character proposals for robust scene text extraction

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Text detection in color images has become an active research area in the past few decades. In this paper, we present a novel approach to accurately detect text in color images possibly with a complex background. The proposed algorithm is based on the combination of connected component and texture feature analysis of unknown text region contours. First, we utilize an elaborate color image edge detection algorithm to extract all possible text edge pixels. Connected component analysis is performed on these edge pixels to detect the external contour and possible internal contours of potential text regions. The gradient and geometrical characteristics of each region contour are carefully examined to construct candidate text regions and classify part non-text regions. Then each candidate text region is verified with texture features derived from wavelet domain. Finally, the Expectation maximization algorithm is introduced to binarize each text region to prepare data for recognition. In contrast to previous approach, our algorithm combines both the efficiency of connected component based method and robustness of texture based analysis. Experimental results show that our proposed algorithm is robust in text detection with respect to different character size, orientation, color and language and can provide reliable text binarization result.