Efficient search in document image collections

Authors:
Anand Kumar;C. V. Jawahar;R. Manmatha
Affiliations:
Center for Visual Information Technology, International Institute of Information Technology, Hyderabad, India;Center for Visual Information Technology, International Institute of Information Technology, Hyderabad, India;Department of Computer Science, University of Massachusetts Amherst, MA
Venue:
ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part I
Year:
2007

Citing 12
Cited 8

Approximate nearest neighbors: towards removing the curse of dimensionality

STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
Similarity Search in High Dimensions via Hashing

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Advances in the BBN BYBLOS OCR System

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Fast Pose Estimation with Parameter-Sensitive Hashing

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
A search engine for historical manuscript images

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Rapid Object Indexing Using Locality Sensitive Hashing and Joint 3D-Signature Space Estimation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Searching Off-line Arabic Documents

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Retrieval of Ottoman documents

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Word spotting for historical documents

International Journal on Document Analysis and Recognition
Keyword-guided word spotting in historical printed documents using synthetic data and user feedback

International Journal on Document Analysis and Recognition
Retrieval from document image collections

DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval

DAS'06 Proceedings of the 7th international conference on Document Analysis Systems

Efficient Language-Independent Retrieval of Printed Documents without OCR

SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Nearest neighbor based collection OCR

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Efficient logo retrieval through hashing shape context descriptors

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Towards more effective distance functions for word image matching

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
A line-based representation for matching words in historical manuscripts

Pattern Recognition Letters
Content level access to digital library of India pages

Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing
Experimental comparison of representation methods and distance measures for time series data

Data Mining and Knowledge Discovery
Recognition of Bangla compound characters using structural decomposition

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an efficient indexing and retrieval scheme for searching in document image databases. In many non-European languages, optical character recognizers are not very accurate. Word spotting - word image matching - may instead be used to retrieve word images in response to a word image query. The approaches used for word spotting so far, dynamic time warping and/or nearest neighbor search, tend to be slow. Here indexing is done using locality sensitive hashing (LSH) - a technique which computes multiple hashes - using word image features computed at word level. Efficiency and scalability is achieved by content-sensitive hashing implemented through approximate nearest neighbor computation. We demonstrate that the technique achieves high precision and recall (in the 90% range), using a large image corpus consisting of seven Kalidasa's (a well known Indian poet of antiquity) books in the Telugu language. The accuracy is comparable to using dynamic time warping and nearest neighbor search while the speed is orders of magnitude better - 20000 word images can be searched in milliseconds.