Fast perspective recovery of text in natural scenes

Authors:
Carlos Merino-Gracia;Majid Mirmehdi;José Sigut;José L. González-Mora
Affiliations:
Neurochemistry and Neuroimaging Laboratory, University of La Laguna, Spain and Visual Information Laboratory, University of Bristol, United Kingdom;Visual Information Laboratory, University of Bristol, United Kingdom;Department of Systems Engineering and Control and Computer Architecture, University of La Laguna, Spain;Neurochemistry and Neuroimaging Laboratory, University of La Laguna, Spain
Venue:
Image and Vision Computing
Year:
2013

Citing 15
Cited 0

Twenty Years of Document Image Analysis in PAMI

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiple View Geometry in Computer Vision

Multiple View Geometry in Computer Vision
ICDAR 2003 Robust Reading Competitions

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
A multifunctional reading assistant for the visually impaired

Journal on Image and Video Processing
Geometric Rectification of Camera-Captured Document Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Text Localization in Natural Scene Images Based on Conditional Random Field

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Recognizing Planar Symbols with Severe Perspective Deformation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Document image analysis: issues, comparison of methods and remaining problems

Artificial Intelligence Review
ICDAR 2011 Robust Reading Competition - Challenge 1: Reading Text in Born-Digital Images (Web and Email)

ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images

ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
The eigen-transform and applications

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
A head-mounted device for recognizing text in natural scenes

CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
Automatic detection and recognition of signs from natural scenes

IEEE Transactions on Image Processing
Goal-Oriented Rectification of Camera-Based Document Images

IEEE Transactions on Image Processing
End-to-end scene text recognition

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

Cheap, ubiquitous, high-resolution digital cameras have led to opportunities that demand camera-based text understanding, such as wearable computing or assistive technology. Perspective distortion is one of the main challenges for text recognition in camera captured images since the camera may often not have a fronto-parallel view of the text. We present a method for perspective recovery of text in natural scenes, where text can appear as isolated words, short sentences or small paragraphs (as found on posters, billboards, shop and street signs etc.). It relies on the geometry of the characters themselves to estimate a rectifying homography for every line of text, irrespective of the view of the text over a large range of orientations. The horizontal perspective foreshortening is corrected by fitting two lines to the top and bottom of the text, while the vertical perspective foreshortening and shearing are estimated by performing a linear regression on the shear variation of the individual characters within the text line. The proposed method is efficient and fast. We present comparative results with improved recognition accuracy against the current state-of-the-art.