T-HOG: An effective gradient-based descriptor for single line text regions

Authors:
Rodrigo Minetto;Nicolas Thome;Matthieu Cord;Neucimar J. Leite;Jorge Stolfi
Affiliations:
DAINF, Federal University of Technology of Paraná, Curitiba, Brazil;Laboratoire d'Informatique Paris 6 (LIP6), Université Pierre et Marie Curie, Paris, France;Laboratoire d'Informatique Paris 6 (LIP6), Université Pierre et Marie Curie, Paris, France;Institute of Computing, University of Campinas, Campinas, Brazil;Institute of Computing, University of Campinas, Campinas, Brazil
Venue:
Pattern Recognition
Year:
2013

Citing 20
Cited 0

Filters for common resampling tasks

Graphics gems
Support-Vector Networks

Machine Learning
Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns

IEEE Transactions on Pattern Analysis and Machine Intelligence
Scene Text Extraction in Natural Scene Images using Hierarchical Feature Combining and Verification

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Text Locating Competition Results

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Text search for medieval manuscript images

Pattern Recognition
Text line detection in handwritten documents

Pattern Recognition
A Robust System to Detect and Localize Texts in Natural Scene Images

DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A New Block Partitioned Text Feature for Text Verification

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Fast and robust text detection in images and video frames

Image and Vision Computing
Accurate video text detection through classification of low and high contrast images

Pattern Recognition
A two-stage scheme for text detection in video images

Image and Vision Computing
Operator context scanning to support high segmentation rates for real time license plate recognition

Pattern Recognition
Text detection in images using sparse representation with discriminative dictionaries

Image and Vision Computing
Detecting and reading text in natural scenes

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
A cognitive and video-based approach for multinational License Plate Recognition

Machine Vision and Applications
Recent Advances in Video Based Document Processing: A Review

DAS '12 Proceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems
Text String Detection From Natural Scenes by Structure-Based Partition and Grouping

IEEE Transactions on Image Processing

Quantified Score

Hi-index	0.01

Visualization

Abstract

We discuss the use of histogram of oriented gradients (HOG) descriptors as an effective tool for text description and recognition. Specifically, we propose a HOG-based texture descriptor (T-HOG) that uses a partition of the image into overlapping horizontal cells with gradual boundaries, to characterize single-line texts in outdoor scenes. The input of our algorithm is a rectangular image presumed to contain a single line of text in Roman-like characters. The output is a relatively short descriptor that provides an effective input to an SVM classifier. Extensive experiments show that the T-HOG is more accurate than Dalal and Triggs's original HOG-based classifier, for any descriptor size. In addition, we show that the T-HOG is an effective tool for text/non-text discrimination and can be used in various text detection applications. In particular, combining T-HOG with a permissive bottom-up text detector is shown to outperform state-of-the-art text detection systems in two major publicly available databases.