Analyzing entities and topics in news articles using statistical topic models

  • Authors:
  • David Newman;Chaitanya Chemudugunta;Padhraic Smyth;Mark Steyvers

  • Affiliations:
  • Department of Computer Science, UC Irvine, Irvine, CA;Department of Computer Science, UC Irvine, Irvine, CA;Department of Computer Science, UC Irvine, Irvine, CA;Department of Cognitive Science, UC Irvine, Irvine, CA

  • Venue:
  • ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Statistical language models can learn relationships between topics discussed in a document collection and persons, organizations and places mentioned in each document. We present a novel combination of statistical topic models and named-entity recognizers to jointly analyze entities mentioned (persons, organizations and places) and topics discussed in a collection of 330,000 New York Times news articles. We demonstrate an analytic framework which automatically extracts from a large collection: topics; topic trends; and topics that relate entities.