Towards an on-line analysis of tweets processing

  • Authors:
  • Sandra Bringay;Nicolas Béchet;Flavien Bouillot;Pascal Poncelet;Mathieu Roche;Maguelonne Teisseire

  • Affiliations:
  • LIRMM - CNRS, Univ. Montpellier 2, France and Dept MIAp, Univ. Montpellier 3, France;INRIA Rocquencourt - Domaine de Voluceau, France;LIRMM - CNRS, Univ. Montpellier 2, France;LIRMM - CNRS, Univ. Montpellier 2, France;LIRMM - CNRS, Univ. Montpellier 2, France;LIRMM - CNRS, Univ. Montpellier 2, France and CEMAGREF - UMR TETIS, France

  • Venue:
  • DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Tweets exchanged over the Internet represent an important source of information, even if their characteristics make them difficult to analyze (a maximum of 140 characters, etc.). In this paper, we define a data warehouse model to analyze large volumes of tweets by proposing measures relevant in the context of knowledge discovery. The use of data warehouses as a tool for the storage and analysis of textual documents is not new but current measures are not well-suited to the specificities of the manipulated data. We also propose a new way for extracting the context of a concept in a hierarchy. Experiments carried out on real data underline the relevance of our proposal.