Readings in hardware/software co-design
A Ground-Truthing Tool for Layout Analysis Performance Evaluation
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
A taxonomy of scientific workflow systems for grid computing
ACM SIGMOD Record
A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
An Overview of the Tesseract OCR Engine
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Taverna Workflows: Syntax and Semantics
E-SCIENCE '07 Proceedings of the Third IEEE International Conference on e-Science and Grid Computing
Investigating web services on the world wide web
Proceedings of the 17th international conference on World Wide Web
A Two-Step Dewarping of Camera Document Images
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
Orchestrating caGrid Services in Taverna
ICWS '08 Proceedings of the 2008 IEEE International Conference on Web Services
A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA Solution
IEEE Transactions on Services Computing
SOA Approach to Integration: XML, Web services, ESB, and BPEL in real-world SOA projects
SOA Approach to Integration: XML, Web services, ESB, and BPEL in real-world SOA projects
Poor access to digitised historical texts: the solutions of the IMPACT project
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
Recent progress on the OCRopus OCR system
Proceedings of the International Workshop on Multilingual OCR
Word-Based Adaptive OCR for Historical Books
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A Realistic Dataset for Performance Evaluation of Document Layout Analysis
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
The PAGE (Page Analysis and Ground-Truth Elements) Format Framework
ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
An Open Architecture for End-to-End Document Analysis Benchmarking
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
An open diachronic corpus of historical Spanish
Language Resources and Evaluation
Hi-index | 0.00 |
The paper presents a novel web-based platform for experimental workflow development in historical document digitisation and analysis. The platform has been developed as part of the IMPACT project, providing a range of tools and services for transforming physical documents into digital resources. It explains the main drivers in developing the technical framework and its architecture, how and by whom it can be used and presents some initial results. The main idea lies in setting up an interoperable and distributed infrastructure based on loose coupling of tools via web services that are wrapped in modular workflow templates which can be executed, combined and evaluated in many different ways. As the workflows are registered through a Web 2.0 environment, which is integrated with a workflow management system, users can easily discover, share, rate and tag workflows and thereby support the building of capacity across the whole community. Where ground truth is available, the workflow templates can also be used to compare and evaluate new methods in a transparent and flexible way.