An analysis of binarization ground truthing
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
A platform for storing, visualizing, and interpreting collections of noisy documents
AND '10 Proceedings of the fourth workshop on Analytics for noisy unstructured text data
Pixel accurate document image content extraction
Proceedings of the 2011 ACM Symposium on Applied Computing
MAST: multi-script annotation toolkit for scenic text
Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
A design of a preprocessing framework for large database of historical documents
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Proceedings of the International Working Conference on Advanced Visual Interfaces
Divide and conquer: atomizing and parallelizing a task in a mobile crowdsourcing platform
Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia
Hi-index | 0.00 |
We present a user interface design for labeling elements in document images at a pixel level. Labels are represented by overlay color, which might map to such terms as “handwriting”, “machine print”, “graphics”, etc. The primary purpose is to streamline processes for manual production of ground truth data, which is necessary for training algorithms and evaluating performance. Unlike general paint-type programs, the UI design is targeted specifically toward selection of collections of foreground pixels that are likely to be meaningful elements in a document image analysis context.Our implementation, called PixLabeler, is available for download and allows customized plug-ins for bootstrapping according to the labeling task.