Parallel programming in OpenMP
Parallel programming in OpenMP
High Performance Cluster Computing: Architectures and Systems
High Performance Cluster Computing: Architectures and Systems
MPI-The Complete Reference, Volume 1: The MPI Core
MPI-The Complete Reference, Volume 1: The MPI Core
A new algorithm for removing noisy borders from monochromatic documents
Proceedings of the 2004 ACM symposium on Applied computing
Discouraging Free Riding in a Peer-to-Peer CPU-Sharing Grid
HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
Scalable component abstractions
OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
A fast orientation and skew detection algorithm for monochromatic document images
Proceedings of the 2005 ACM symposium on Document engineering
A new rotation algorithm for monochromatic images
Proceedings of the 2005 ACM symposium on Document engineering
BigBatch – an environment for processing monochromatic documents
ICIAR'06 Proceedings of the Third international conference on Image Analysis and Recognition - Volume Part II
Event-Based programming without inversion of control
JMLC'06 Proceedings of the 7th joint conference on Modular Programming Languages
Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Efficient Removal of Noisy Borders of Monochromatic Documents
ICIAR '09 Proceedings of the 6th International Conference on Image Analysis and Recognition
Thanatos: automatically retrieving information from death certificates in Brazil
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
HistDoc v. 2.0: enhancing a platform to process historical documents
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Hi-index | 0.00 |
BigBatch is an image processing environment designed to process batches of thousands of monochromatic documents. One of the flexibilities and pioneer aspects of BigBatch is offering the possibility of working in distributed environments such as clusters and grids. This paper presents the BigBatch tool and the results of a comparative analysis between cluster and grid configurations. The results obtained show almost no difference in total execution times, indicating that performance is not a primary criterion for choosing between the use of a cluster or a grid. However, there are other, qualitative, aspects that may impact this choice. This paper also considers these aspects and provides a general picture of how to successfully use BigBatch to process document images employing many computers for this task.