BigBatch: a toolbox for monochromatic documents

  • Authors:
  • Rafael Dueire Lins;Bruno Tenório Ávila

  • Affiliations:
  • Universidade Federal de Pernambuco, Brazil;Universidade Federal de Pernambuco, Brazil

  • Venue:
  • Proceedings of the 2005 ACM symposium on Document engineering
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

BigBatch is a tool designed to automatically process thousands of monochromatic images of documents generated by production line scanners. It removes noisy borders, checks and corrects orientation, calculates and compensates the skew angle, crops the image standardizing document sizes, and finally compresses it according to user defined file format. BigBatch encompasses the best and recently developed algorithms for such kind of document images. BigBatch may work either in standalone or operator assisted modes. Besides that, BigBatch in standalone mode is able to process in clusters of workstations.