Identifying automatic posting systems in microblogs

  • Authors:
  • Gustavo Laboreiro;Luís Sarmento;Eugénio Oliveira

  • Affiliations:
  • Faculdade de Engenharia da Universidade do Porto, DEI, LIACC, Portugal;Faculdade de Engenharia da Universidade do Porto, DEI, LIACC and SAPO Labs Porto, Portugal;Faculdade de Engenharia da Universidade do Porto, DEI, LIACC, Portugal

  • Venue:
  • EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we study the problem of identifying systems that automatically inject non-personal messages in micro-blogging message streams, thus potentially biasing results of certain information extraction procedures, such as opinion-mining and trend analysis. We also study several classes of features, namely features based on the time of posting, the client used to post, the presence of links, the user interaction and the writing style. This last class of features, that we introduce here for the first time, is proved to be a top performer, achieving accuracy near the 90%, on par with the best features previously used for this task.