Finding critical blocks of information diffusion in social networks

  • Authors:
  • Ende Zhang;Guoren Wang;Kening Gao;Ge Yu

  • Affiliations:
  • Computing Center, Northeastern University, China,College of Information Science & Engineering, Northeastern University, China;College of Information Science & Engineering, Northeastern University, China;Computing Center, Northeastern University, China;Computing Center, Northeastern University, China,College of Information Science & Engineering, Northeastern University, China

  • Venue:
  • WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The diffusion of information is taking place every place and every time over the Internet. The widely used web applications of online social networks, have many benefits to serve as a medium for fast, widespread information diffusion platforms. While there is a substantial works on how to maximize the diffusion of useful information, there are many misinformation diffusing on social networks. How to control the misinformation diffusing efficiently with the smallest cost is still a big challenge. We tackle this challenge by reducing the problem to finding the critical blocks. The critical blocks are the sets of nodes that partition the whole network evenly at a small cost, and we believe they play a key role during the process of diffusion. We prove such problem of finding critical blocks is NP-complete and therefore an exact solution is infeasible to get. A simple but effective solution is proposed by the following steps: first we convert a social network graph into a Laplacian matrix, then we compute its Fiedler Vector, which has been proved to have good properties, with the help of Fiedler Vector, we develop some heuristic algorithms to find critical blocks. We also perform lots of experiments both on synthetic data and real world datasets of Twitter, the experimental results show that our algorithm is effective and efficient both on synthetic data and real world data.