Identification of the minimal set of attributes that maximizes the information towards the author of a political discourse: the case of the candidates in the mexican presidential elections

  • Authors:
  • Antonio Neme;Sergio Hernández;Vicente Carrión

  • Affiliations:
  • Complex Systems Group, Universidad Autónoma de la Ciudad de México, México, D.F., México,Institute for Molecular Medicine, Finland;Postgraduation Program in Complex Systems, Universidad Autónoma de la Ciudad de México, México;CINVESTAV IDS, México D.F., México

  • Venue:
  • MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Computational Intelligence - Volume Part II
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Authorship attribution has attracted the attention of the natural language processing and machine learning communities in the past few years. Here we are interested in finding a general measure of the style followed in the texts from the three main candidates in the Mexican presidential elections of 2012. We analyzed dozens of texts (discourses) from the three authors. We applied tools from the time series processing field and machine learning community in order to identify the overall attributes that define the writing style of the three authors. Several attributes and time series were extracted from each text. A novel methodology, based in mutual information, was applied on those time series and attributes to explore the relevance of each attribute to linearly separate the texts accordingly to their authorship. We show that less than 20 variables are enough to identify, by means of a linear recognizer, the authorship of a text from within one of the three considered authors.