Latent Dirichlet allocation based writer identification in offline handwriting

  • Authors:
  • Anurag Bhardwaj;Manavender Reddy;Srirangaraj Setlur;Venu Govindaraju;Sitaram Ramachandrula

  • Affiliations:
  • University at Buffalo - SUNY, Amherst, NY;University at Buffalo - SUNY, Amherst, NY;University at Buffalo - SUNY, Amherst, NY;University at Buffalo - SUNY, Amherst, NY;HP Labs, Bangalore, India

  • Venue:
  • DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe a novel approach to Writer Identification in Offline handwriting using Latent Dirichlet Allocation. State-of-the-art methods for writer identification employ the traditional feature-classification paradigm which does not provide enough information about the handwriting attributes such as writing style which are key components in any forensic analysis of handwriting. This problem is also compounded due to lack of efficient rules for defining a particular writing style that can capture writer specific characteristics over a large dataset. We propose to address this issue by using a generative model in form of Latent Dirichlet Allocation(LDA) that automatically infers writing styles from handwritten document collection without any pre-defined set of rules. This information is then used to represent each writer as a distribution over multiple writing style for classifying any unknown writer sample. We describe our approach on two different feature sets consisting of contour angle features as well as structural and concavity features. Our experimental results show comparable performance with baseline systems and also demonstrate the efficacy of LDA for learning multiple handwriting styles.