A framework for the assessment of text extraction algorithms on complex colour images
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Query driven word retrieval in graphical documents
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
A polar-based logo representation based on topological and colour features
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Hi-index | 0.00 |
The extraction of textual content from colour documents of a graphical nature is a complicated task. The text can be rendered in any colour, size and orientation while the existence of complex background graphics with repetitive patterns can make its localization and segmentation extremely difficult. Here, we propose a new method for extracting textual content from such colour images that makes no assumption as to the size of the characters, their orientation or colour, while it is tolerant to characters that do not follow a straight baseline. We evaluate this method on a collection of documents with historical connotations: the Posters from the Spanish Civil War.