A multi-scale framework for adaptive binarization of degraded document images

  • Authors:
  • Reza Farrahi Moghaddam;Mohamed Cheriet

  • Affiliations:
  • Synchromedia Laboratory for Multimedia Communication in Telepresence, ícole de Technologie Supérieure, Montréal, QC, Canada H3C 1K3;Synchromedia Laboratory for Multimedia Communication in Telepresence, ícole de Technologie Supérieure, Montréal, QC, Canada H3C 1K3

  • Venue:
  • Pattern Recognition
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this work, a multi-scale binarization framework is introduced, which can be used along with any adaptive threshold-based binarization method. This framework is able to improve the binarization results and to restore weak connections and strokes, especially in the case of degraded historical documents. This is achieved thanks to localized nature of the framework on the spatial domain. The framework requires several binarizations on different scales, which is addressed by introduction of fast grid-based models. This enables us to explore high scales which are usually unreachable to the traditional approaches. In order to expand our set of adaptive methods, an adaptive modification of Otsu's method, called AdOtsu, is introduced. In addition, in order to restore document images suffering from bleed-through degradation, we combine the framework with recursive adaptive methods. The framework shows promising performance in subjective and objective evaluations performed on available datasets.