A multiresolution spline with application to image mosaics
ACM Transactions on Graphics (TOG)
Document Representation and Its Application to Page Decomposition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Geometric Information Criterion for Model Selection
International Journal of Computer Vision
MLESAC: a new robust estimator with application to estimating image geometry
Computer Vision and Image Understanding - Special issue on robusst statistical techniques in image understanding
Multiple view geometry in computer visiond
Multiple view geometry in computer visiond
Rectifying the Bound Document Image Captured by the Camera: A Model Based Approach
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
A Cylindrical Surface Model to Rectify the Bound Document Image
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Interactive digital photomontage
ACM SIGGRAPH 2004 Papers
An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Document capture using stereo vision
Proceedings of the 2004 ACM symposium on Document engineering
Conformal Deskewing of Non-Planar Documents
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Flattening Curved Documents in Images
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Warped Image restoration with Applications to Digital Libraries
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
3D Structure Recovery and Unwarping of Surfaces Applicable to Planes
International Journal of Computer Vision
Camera-Based Document Image Mosaicing
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Restoring 2D Content from Distorted Documents
IEEE Transactions on Pattern Analysis and Machine Intelligence
Geometric Rectification of Camera-Captured Document Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Video mosaicing based on structure from motion for distortion-free document digitization
ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part II
A comparative study of energy minimization methods for markov random fields
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Geometric and shading correction for images of printed materials using boundary
IEEE Transactions on Image Processing
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
3D reconstruction for damaged documents: imaging of the great parchment book
Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing
Hi-index | 0.01 |
In this paper, we propose an algorithm to compose a geometrically dewarped and visually enhanced image from two document images taken by a digital camera at different angles. Unlike the conventional works that require special equipments or assumptions on the contents of books or complicated image acquisition steps, we estimate the unfolded book or document surface from the corresponding points between two images. For this purpose, the surface and camera matrices are estimated using structure reconstruction, 3-D projection analysis, and random sample consensus-based curve fitting with the cylindrical surface model. Because we do not need any assumption on the contents of books, the proposed method can be applied not only to optical character recognition (OCR), but also to the high-quality digitization of pictures in documents. In addition to the dewarping for a structurally better image, image mosaic is also performed for further improving the visual quality. By finding better parts of images (with less out of focus blur and/or without specular reflections) from either of views, we compose a better image by stitching and blending them. These processes are formulated as energy minimization problems that can be solved using a graph cut method. Experiments on many kinds of book or document images show that the proposed algorithm robustly works and yields visually pleasing results. Also, the OCR rate of the resulting image is comparable to that of document images from a flatbed scanner.