Incorporating domain knowledge in latent topic models

  • Authors:
  • Mark Craven;Xiaojin Zhu;David Michael Andrzejewski

  • Affiliations:
  • The University of Wisconsin - Madison;The University of Wisconsin - Madison;The University of Wisconsin - Madison

  • Venue:
  • Incorporating domain knowledge in latent topic models
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Latent topic models can be used to automatically decompose a collection of text documents into their constituent topics. This representation is useful for both exploratory browsing and other tasks such as informational retrieval. However, learned topics may not necessarily be meaningful to the user or well aligned with modeling goals. In this thesis we develop novel methods for enabling topic models to take advantage of side information, domain knowledge, and user guidance and feedback. These methods are used to enhance topic model analyses across a variety of datasets, including non-text domains.