IBN SINA: a database for research on processing and understanding of Arabic manuscripts images

  • Authors:
  • Reza Farrahi Moghaddam;Mohamed Cheriet;Mathias M. Adankon;Kostyantyn Filonenko;Robert Wisnovsky

  • Affiliations:
  • Synchromedia Laboratory ETS, Montréal, (QC) Canada;Synchromedia Laboratory, ETS, Montréal, (QC) Canada;Synchromedia Laboratory, ETS, Montréal, (QC) Canada;McGill University;McGill University

  • Venue:
  • DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the steps that have been undertaken in order to develop the IBN SINA database, which is designed to apply learning techniques in the processing and understanding of document images. The description of the preparation process, including preprocessing, feature extraction and labeling, is provided. The database has been evaluated using classification techniques, such as the SVM classifiers. In order to make the database compatible with these classifiers, the labels of the shapes have been translated into a set of bi-class problems. Promising results with the SVM classifiers have been obtained.