High Performance Cluster Computing: Architectures and Systems
High Performance Cluster Computing: Architectures and Systems
Document Image Analysis: An Executive Briefing
Document Image Analysis: An Executive Briefing
A new algorithm for removing noisy borders from monochromatic documents
Proceedings of the 2004 ACM symposium on Applied computing
A fast orientation and skew detection algorithm for monochromatic document images
Proceedings of the 2005 ACM symposium on Document engineering
A new rotation algorithm for monochromatic images
Proceedings of the 2005 ACM symposium on Document engineering
Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
A quantitative method for assessing algorithms to remove back-to-front interference in documents
Proceedings of the 2007 ACM symposium on Applied computing
A fast algorithm to binarize and filter documents with back-to-front interference
Proceedings of the 2007 ACM symposium on Applied computing
BigBatch: a document processing platform for clusters and grids
Proceedings of the 2008 ACM symposium on Applied computing
Content recognition and indexing in the LiveMemory platform
GREC'09 Proceedings of the 8th international conference on Graphics recognition: achievements, challenges, and evolution
HistDoc v. 2.0: enhancing a platform to process historical documents
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Statistically analyzing RGB histograms to remove highlighting in aged paper monochromatic documents
GREC'11 Proceedings of the 9th international conference on Graphics Recognition: new trends and challenges
Hi-index | 0.00 |
BigBatch is a processing environment designed to automatically process batches of millions of monochromatic images of documents generated by production line scanners. It removes noisy borders, checks and corrects orientation, calculates and compensates the skew angle, crops the image standardizing document sizes, and finally compresses it according to user defined file format. BigBatch encompasses the best and recently developed algorithms for such kind of document images. BigBatch may work either in standalone or operator assisted modes. Besides that, BigBatch in standalone mode is able to process in clusters of workstations or in grids.