BigBatch – an environment for processing monochromatic documents

  • Authors:
  • Rafael Dueire Lins;Bruno Tenório Ávila;Andrei de Araújo Formiga

  • Affiliations:
  • Universidade Federal de Pernambuco, Recife, PE, Brazil;Universidade Federal de Pernambuco, Recife, PE, Brazil;Universidade Federal de Pernambuco, Recife, PE, Brazil

  • Venue:
  • ICIAR'06 Proceedings of the Third international conference on Image Analysis and Recognition - Volume Part II
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

BigBatch is a processing environment designed to automatically process batches of millions of monochromatic images of documents generated by production line scanners. It removes noisy borders, checks and corrects orientation, calculates and compensates the skew angle, crops the image standardizing document sizes, and finally compresses it according to user defined file format. BigBatch encompasses the best and recently developed algorithms for such kind of document images. BigBatch may work either in standalone or operator assisted modes. Besides that, BigBatch in standalone mode is able to process in clusters of workstations or in grids.