Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiple View Geometry in Computer Vision
Multiple View Geometry in Computer Vision
ICDAR 2003 Robust Reading Competitions
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
A multifunctional reading assistant for the visually impaired
Journal on Image and Video Processing
Geometric Rectification of Camera-Captured Document Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Text Localization in Natural Scene Images Based on Conditional Random Field
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Recognizing Planar Symbols with Severe Perspective Deformation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Document image analysis: issues, comparison of methods and remaining problems
Artificial Intelligence Review
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
The eigen-transform and applications
ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
A head-mounted device for recognizing text in natural scenes
CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
Automatic detection and recognition of signs from natural scenes
IEEE Transactions on Image Processing
Goal-Oriented Rectification of Camera-Based Document Images
IEEE Transactions on Image Processing
End-to-end scene text recognition
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Hi-index | 0.00 |
Cheap, ubiquitous, high-resolution digital cameras have led to opportunities that demand camera-based text understanding, such as wearable computing or assistive technology. Perspective distortion is one of the main challenges for text recognition in camera captured images since the camera may often not have a fronto-parallel view of the text. We present a method for perspective recovery of text in natural scenes, where text can appear as isolated words, short sentences or small paragraphs (as found on posters, billboards, shop and street signs etc.). It relies on the geometry of the characters themselves to estimate a rectifying homography for every line of text, irrespective of the view of the text over a large range of orientations. The horizontal perspective foreshortening is corrected by fitting two lines to the top and bottom of the text, while the vertical perspective foreshortening and shearing are estimated by performing a linear regression on the shear variation of the individual characters within the text line. The proposed method is efficient and fast. We present comparative results with improved recognition accuracy against the current state-of-the-art.