Composition of a dewarped and enhanced document image from two view images

Authors:
Hyung Il Koo;Jinho Kim;Nam Ik Cho
Affiliations:
Department of Electrical Engineering and Computer Science and INMC, Seoul National University, Seoul, Korea;Multimedia Laboratory, Telecommunication R&D Center, Samsung Electronics Company, Ltd., Suwon, Gyeonggi-do, Korea;Department of Electrical Engineering and Computer Science and INMC, Seoul National University, Seoul, Korea
Venue:
IEEE Transactions on Image Processing
Year:
2009

Citing 22
Cited 2

A multiresolution spline with application to image mosaics

ACM Transactions on Graphics (TOG)
Document Representation and Its Application to Page Decomposition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Geometric Information Criterion for Model Selection

International Journal of Computer Vision
MLESAC: a new robust estimator with application to estimating image geometry

Computer Vision and Image Understanding - Special issue on robusst statistical techniques in image understanding
Multiple view geometry in computer visiond

Multiple view geometry in computer visiond
Rectifying the Bound Document Image Captured by the Camera: A Model Based Approach

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
A Cylindrical Surface Model to Rectify the Bound Document Image

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Interactive digital photomontage

ACM SIGGRAPH 2004 Papers
An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision

IEEE Transactions on Pattern Analysis and Machine Intelligence
Shape Reconstruction and Image Restoration for Non-Flat Surfaces of Documents with a Stereo Vision System

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Document capture using stereo vision

Proceedings of the 2004 ACM symposium on Document engineering
Conformal Deskewing of Non-Planar Documents

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Flattening Curved Documents in Images

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Warped Image restoration with Applications to Digital Libraries

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
3D Structure Recovery and Unwarping of Surfaces Applicable to Planes

International Journal of Computer Vision
Camera-Based Document Image Mosaicing

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Restoring 2D Content from Distorted Documents

IEEE Transactions on Pattern Analysis and Machine Intelligence
Geometric Rectification of Camera-Captured Document Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video mosaicing based on structure from motion for distortion-free document digitization

ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part II
A comparative study of energy minimization methods for markov random fields

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Geometric and shading correction for images of printed materials using boundary

IEEE Transactions on Image Processing

State estimation in a document image and its application in text block identification and text line extraction

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
3D reconstruction for damaged documents: imaging of the great parchment book

Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper, we propose an algorithm to compose a geometrically dewarped and visually enhanced image from two document images taken by a digital camera at different angles. Unlike the conventional works that require special equipments or assumptions on the contents of books or complicated image acquisition steps, we estimate the unfolded book or document surface from the corresponding points between two images. For this purpose, the surface and camera matrices are estimated using structure reconstruction, 3-D projection analysis, and random sample consensus-based curve fitting with the cylindrical surface model. Because we do not need any assumption on the contents of books, the proposed method can be applied not only to optical character recognition (OCR), but also to the high-quality digitization of pictures in documents. In addition to the dewarping for a structurally better image, image mosaic is also performed for further improving the visual quality. By finding better parts of images (with less out of focus blur and/or without specular reflections) from either of views, we compose a better image by stitching and blending them. These processes are formulated as energy minimization problems that can be solved using a graph cut method. Experiments on many kinds of book or document images show that the proposed algorithm robustly works and yields visually pleasing results. Also, the OCR rate of the resulting image is comparable to that of document images from a flatbed scanner.