A concept for the separation of foreground/ background in arabic historical manuscripts using hybrid methods

  • Authors:
  • W. Boussellaa;H. El Abed;A. Zahour

  • Affiliations:
  • Research Group on Intelligents Machines, ENIS, University of Sfax, Tunisia;Equipe Gestion Électronique de Document, University of Le Havre, France;Institut for Communications Technology, Braunschweig Technical University, Germany

  • Venue:
  • VAST'06 Proceedings of the 7th International conference on Virtual Reality, Archaeology and Intelligent Cultural Heritage
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a new color document image segmentation system suitable for historical Arabic manuscripts. Our system is composed of a hybrid method which couple together background light intensity normalization algorithm and k-means clustering with maximum likelihood (ML) estimation, for foreground/ background separation. Firstly, the background normalization algorithm performs separation between foreground and background. This foreground is used in later steps. Secondly, our algorithm proceeds on luminance and distort the contrast. These distortions are corrected with a gamma correction and contrast adjustment. Finally, the new enhanced foreground image is segmented to foreground/background on the basis of ML estimation. The initial parameters for the ML method are estimated by k-means clustering algorithm. The segmented image is used to produce a final restored document image. The techniques are tested on a set of Arabic historical manuscripts documents from the National Tunisian Library. The performance of the algorithm is demonstrated on by real color manuscripts distorted with show-through effects, uneven background color and localized spot.