Mining relational structure from millions of books: position paper

  • Authors:
  • David A. Smith;R. Manmatha;James Allan

  • Affiliations:
  • University of Massachusetts, Amherst, Amherst, MA, USA;University of Massachusetts, Amherst, Amherst, MA, USA;University of Massachusetts, Amherst, Amherst, MA, USA

  • Venue:
  • Proceedings of the 4th ACM workshop on Online books, complementary social media and crowdsourcing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Existing large-scale scanned book collections have many shortcomings for data-driven research, from OCR of variable quality to the lack of accurate descriptive and structural metadata. We argue that complementary research in inferring relational metadata is important in its own right to support use of these collections and that it can help to mitigate other problems with scanned book collections.