A polar-based logo representation based on topological and colour features
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
A kernel-based approach to document retrieval
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Near-duplicate document image matching: A graphical perspective
Pattern Recognition
Hi-index | 0.00 |
Document classification usually requieres of structural features such as the physical layout to obtain good accuracy rates on complex documents. This paper introduces a descriptor of the layout and a distance measure based on the cyclic Dynamic Time Warping which can be computed in $\mathcal{O}(n^2)$. This descriptor is translation invariant and can be easily modified to be scale and rotation invariant. Experiments with this descriptor and its rotation invariant modification are performed on the Girona Archives database and compared against another common layout distance, the Minimum Weight Edge Cover. The experiments show that these methods outperform the MWEC both in accuracy and speed, particularly on rotated documents.