Mining Twitter in the Cloud: A Case Study

Authors:
Pieter Noordhuis;Michiel Heijkoop;Alexander Lazovik
Affiliations:
-;-;-
Venue:
CLOUD '10 Proceedings of the 2010 IEEE 3rd International Conference on Cloud Computing
Year:
2010

Citing 0
Cited 4

Automated Twitter data collecting tool for data mining in social network

Proceedings of the 2012 ACM Research in Applied Computation Symposium
Automated Twitter data collecting tool and case study with rule-based analysis

Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
Does social contact matter?: modelling the hidden web of trust underlying twitter

Proceedings of the 22nd international conference on World Wide Web companion
CUVIM: extracting fresh information from social network

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Mining and analyzing data from social networks can be difficult because of the large amounts of data involved. Such activities are usually very expensive, as they require a lot of computational resources. With the recent success of cloud computing, data analysis is going to be more accessible due to easier access to less expensive computational resources. In this work we propose to use cloud computing services as a possible solution for analysis of large amounts of data. As a source for a large data set, we propose to use Twitter, yielding a graph with 50 million nodes and 1.8 billion edges. In this paper, we use computation of PageRank on Twitter’s social graph to investigate whether or not cloud computing, and Amazon cloud services1 in particular, can make these tasks more feasible and, as a side effect, whether or not PageRank provides a good ranking of Twitter users.