Digitizing a million books: challenges for document analysis

  • Authors:
  • K. Pramod Sankar;Vamshi Ambati;Lakshmi Pratha;C. V. Jawahar

  • Affiliations:
  • Regional Mega Scanning Centre, International Institute of Information Technology, Hyderabad, India;Institute for Software Research International, Carnegie Mellon University;Regional Mega Scanning Centre, International Institute of Information Technology, Hyderabad, India;Regional Mega Scanning Centre, International Institute of Information Technology, Hyderabad, India

  • Venue:
  • DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the challenges for document image analysis community for building large digital libraries with diverse document categories. The challenges are identified from the experience of the on-going activities toward digitizing and archiving one million books. Smooth workflow has been established for archiving large quantity of books, with the help of efficient image processing algorithms. However, much more research is needed to address the challenges arising out of the diversity of the content in digital libraries.