Proper Names Extraction from Fax Images Combining Textual and Image Features

  • Authors:
  • Laurence Likforman-Sulem;Pascal Vaillant;François Yvon

  • Affiliations:
  • -;-;-

  • Venue:
  • ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the frame of a Unified Messaging System, acrucial task of the system is to provide the user with keyinformation on every message received, like keywordsreflecting the object of the message, or the name of thesender. However, in the case of facsimiles, thisinformation is not as easy to detect as in the case of e-mails,since no standard headers are defined. The aimof the present work is to identify and extract a specificinformation (the name of the sender) from a fax coverpage. For this purpose, methods based on imagedocument analysis (OCR recognition, physical blocksselection), and text analysis methods (optimiseddictionary lookup, local grammar rules), areimplemented to work in parallel. The fusion of theirresults brings a more accurate guess than any of themethods would achieve separately.