Approximate nearest neighbor queries in fixed dimensions
SODA '93 Proceedings of the fourth annual ACM-SIAM Symposium on Discrete algorithms
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Speeded-Up Robust Features (SURF)
Computer Vision and Image Understanding
HOTPAPER: multimedia interaction with paper using mobile phones
MM '08 Proceedings of the 16th ACM international conference on Multimedia
PCA-SIFT: a more distinctive representation for local image descriptors
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Embedded media markers: marks on paper that signify associated media
Proceedings of the 15th international conference on Intelligent user interfaces
Pacer: fine-grained interactive paper via camera-touch hybrid gestures on a cell phone
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Proceedings of the international conference on Multimedia
Embedded media marker: linking multimedia to paper
Proceedings of the international conference on Multimedia
International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Discrete point based signatures and applications to document matching
ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
Hi-index | 0.00 |
This paper presents a tool and a novel Fast Invariant Transform (FIT) algorithm for language independent e-documents access. The tool enables a person to access an e-document through an informal camera capture of a document hardcopy. It can save people from remembering/exploring numerous directories and file names, or even going through many pages/paragraphs in one document. It can also facilitate people's manipulation of a document or people's interactions through documents. Additionally, the algorithm is useful for binding multimedia data to language independent paper documents. Our document recognition algorithm is inspired by the widely known SIFT descriptor [4] but can be computed much more efficiently for both descriptor construction and search. It also uses much less storage space than the SIFT approach. By testing our algorithm with randomly scaled and rotated document pages, we can achieve a 99.73% page recognition rate on the 2188-page ICME06 proceedings and 99.9% page recognition rate on a 504-page Japanese math book [2].