Using statistical topic models to organize and visualize large-scale architectural image databases

  • Authors:
  • Patrick Paczkowski;Julie Dorse;Holly Rushmeier

  • Affiliations:
  • Yale University;Yale University;Yale University

  • Venue:
  • ACM SIGGRAPH 2011 Posters
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Digital libraries have now become by far the preferred method for storing large image collections, making it increasingly important to improve issues of search and discovery within these databases. Content-based annotations of images, for classification purposes, have been a particularly significant area of research; however, even the best of such methods are prone to error -- potentially associating incorrect text to image features and overlooking significant features. In the broader field of classification and search, Newman et al. demonstrated benefits of using statistical topic models to enrich document metadata. [Newman et al. 2007] This assumes that a document can be described with sufficient accuracy using a "bag of words" model, i.e. described by the words it contains (instances from a large vocabulary of words), and the frequency with which they occur. [Horster et al. 2007] applied this idea to large-scale image datasets, using SIFT features as data for creating a topic model.