Accurate and robust text detection: a step-in for text retrieval in natural scene images

Authors:
Xu-Cheng Yin;Xuwang Yin;Kaizhu Huang;Hong-Wei Hao
Affiliations:
University of Science and Technology Beijing, Beijing, China;University of Science and Technology Beijing, Beijing, China;Xi'an Jiaotong-Liverpool University, Suzhou, China;Institute of Automation, Chinese Academy of Sciences, Beijing, China
Venue:
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Year:
2013

Citing 5
Cited 0

A method for text localization and recognition in real-world images

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images

ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
A Hybrid Approach to Detect and Localize Texts in Natural Scene Images

IEEE Transactions on Image Processing
Real-time scene text localization and recognition

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Scene text detection using graph model built upon maximally stable extremal regions

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose and implement a robust text detection system, which is a prominent step-in for text retrieval in natural scene images or videos. Our system includes several key components: (1) A fast and effective pruning algorithm is designed to extract Maximally Stable Extremal Regions as character candidates using the strategy of minimizing regularized variations. (2) Character candidates are grouped into text candidates by the single-link clustering algorithm, where distance weights and threshold of clustering are learned automatically by a novel self-training distance metric learning algorithm. (3) The posterior probabilities of text candidates corresponding to non-text are estimated with an character classifier; text candidates with high probabilities are then eliminated and finally texts are identified with a text classifier. The proposed system is evaluated on the ICDAR 2011 Robust Reading Competition dataset and a publicly available multilingual dataset; the f measures are over 76% and 74% which are significantly better than the state-of-the-art performances of 71% and 65%, respectively.