SVD and clustering for unsupervised POS tagging

  • Authors:
  • Michael Lamar;Yariv Maron;Mark Johnson;Elie Bienenstock

  • Affiliations:
  • Brown University, Providence, RI;Bar-Ilan University, Ramat-Gan, Israel;Macquarie University, Sydney, Australia;Brown University, Providence, RI

  • Venue:
  • ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We revisit the algorithm of Schütze (1995) for unsupervised part-of-speech tagging. The algorithm uses reduced-rank singular value decomposition followed by clustering to extract latent features from context distributions. As implemented here, it achieves state-of-the-art tagging accuracy at considerably less cost than more recent methods. It can also produce a range of finer-grained taggings, with potential applications to various tasks.