Approximate nearest neighbors: towards removing the curse of dimensionality
STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
Document image database retrieval and browsing using texture analysis
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Classification Method Study for Automatic Form Class Identification
ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 1 - Volume 1
Multiscale document description using rectangular granulometries
International Journal on Document Analysis and Recognition
Layout based document image retrieval by means of XY tree reduction
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Distance Measures for Layout-Based Document Image Retrieval
DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Image classification: Classifying distributions of visual features
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Asymmetric distance estimation with sketches for similarity search in high-dimensional spaces
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
A Rotation Invariant Page Layout Descriptor for Document Classification and Retrieval
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Document Image Retrieval with Local Feature Sequences
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Page frame detection for double page document images
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Product Quantization for Nearest Neighbor Search
IEEE Transactions on Pattern Analysis and Machine Intelligence
Asymmetric distances for binary embeddings
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Texture information in run-length matrices
IEEE Transactions on Image Processing
Hi-index | 0.01 |
We present a new document image descriptor based on multi-scale runlength histograms. This descriptor does not rely on layout analysis and can be computed efficiently. We show how this descriptor can achieve state-of-the-art results on two very different public datasets in classification and retrieval tasks. Moreover, we show how we can compress and binarize these descriptors to make them suitable for large-scale applications. We can achieve state-of-the-art results in classification using binary descriptors of as few as 16-64 bits.