Distinguishing between positive and negative opinions with complex network features

  • Authors:
  • Diego R. Amancio;Renato Fabbri;Osvaldo N. Oliveira, Jr.;Maria G. V. Nunes;Luciano da F. Costa

  • Affiliations:
  • University of São Paulo, São Carlos, São Paulo, Brazil;University of São Paulo, São Carlos, São Paulo, Brazil;University of São Paulo, São Carlos, São Paulo, Brazil;University of São Paulo, São Carlos, São Paulo, Brazil;University of São Paulo, São Carlos, São Paulo, Brazil

  • Venue:
  • TextGraphs-5 Proceedings of the 2010 Workshop on Graph-based Methods for Natural Language Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Topological and dynamic features of complex networks have proven to be suitable for capturing text characteristics in recent years, with various applications in natural language processing. In this article we show that texts with positive and negative opinions can be distinguished from each other when represented as complex networks. The distinction was possible by obtaining several metrics of the networks, including the in-degree, out-degree, shortest paths, clustering coefficient, betweenness and global efficiency. For visualization, the obtained multidimensional dataset was projected into a 2-dimensional space with the canonical variable analysis. The distinction was quantified using machine learning algorithms, which allowed an recall of 70% in the automatic discrimination for the negative opinions, even without attempts to optimize the pattern recognition process.