Says who?: automatic text-based content analysis of television news

  • Authors:
  • Carlos Castillo;Gianmarco De Francisci Morales;Marcelo Mendoza;Nasir Khan

  • Affiliations:
  • QCRI, Doha, Qatar;Yahoo! Research, Barcelona, Spain;Yahoo! Research, Santiago, Chile;Al Jazeera, Doha, Qatar

  • Venue:
  • Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We perform an automatic analysis of television news programs, based on the closed captions that accompany them. Specifically, we collect all the news broadcasted in over 140 television channels in the US during a period of six months. We start by segmenting, processing, and annotating the closed captions automatically. Next, we focus on the analysis of their linguistic style and on mentions of people using NLP methods. We present a series of key insights about news providers, people in the news, and we discuss the biases that can be uncovered by automatic means. These insights are contrasted by looking at the data from multiple points of view, including qualitative assessment.