Recent progress on the OCRopus OCR system

  • Authors:
  • Thomas Breuel

  • Affiliations:
  • U. Kaiserslautern and DFKI

  • Venue:
  • Proceedings of the International Workshop on Multilingual OCR
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The OCRopus system is an open source OCR system developed for book capture and digital library applications. It is designed to be a multilingual system in which all components are easily pluggable and replaceable. In this paper, I describe recent progress, on-going work, and preliminary results in the development of the OCRopus system, including the new component model, a new line recognizer, a new set of decoders, and language modeling tools.