A Model of Saliency-Based Visual Attention for Rapid Scene Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Logo Spotting by a Bag-of-words Approach for Document Categorization
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Towards an automatic characterization of criteria
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Retrieval of hand-sketched envelopes in logo images
ICIAR'07 Proceedings of the 4th international conference on Image Analysis and Recognition
Hi-index | 0.00 |
The document digitization process becomes a crucial economical issue in our society. Then, it becomes necessary to be able to organize this huge amount of documents. The work proposed in this paper tends to propose a new method to automatically classify document using a saliency-based segmentation process on one hand, and a terminology extraction and annotation on the other hand. The saliency-based segmentation is used to extract salient regions and by the way logo, while the terminology approach is used to annotate them and to automatically classify the document. The approach does not require human expertise, and use Google Images as a knowledge database. The results obtained on a real database of 1766 documents show the relevance of the approach.