Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents

  • Authors:
  • Itay Bar-Yosef;Isaac Beckman;Klara Kedem;Itshak Dinstein

  • Affiliations:
  • Ben Gurion University, Computer Science Department, 84105, Beer-Sheva, Israel;Ben Gurion University, Computer Science Department, 84105, Beer-Sheva, Israel;Ben Gurion University, Computer Science Department, 84105, Beer-Sheva, Israel;Ben Gurion University, Electrical and Computer Engineering Department, 84105, Beer-Sheva, Israel

  • Venue:
  • International Journal on Document Analysis and Recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present our work on the paleographic analysis and recognition system intended for processing of historical Hebrew calligraphy documents. The main goal is to analyze documents of different writing styles in order to identify the locations, dates, and writers of test documents. Using interactive software tools, a data base of extracted characters has been established. It now contains about 20,000 characters of 34 different writers, and will be distinctly expanded in the near future. Preliminary results of automatic extraction of pre-specified letters using the erosion operator are presented. We further propose and test topological features for handwriting style classification based on a selected subset of the Hebrew alphabet. A writer identification experiment using 34 writers yielded 100% correct classification.