Computationally efficient link prediction in a variety of social networks

Authors:
Michael Fire;Lena Tenenboim-Chekina;Rami Puzis;Ofrit Lesser;Lior Rokach;Yuval Elovici
Affiliations:
Ben-Gurion University of the Negev, Be'er Sheva, 84105, Israel;Ben-Gurion University of the Negev, Be'er Sheva, 84105, Israel;Ben-Gurion University of the Negev, Be'er Sheva, 84105, Israel;Ben-Gurion University of the Negev, Be'er Sheva, 84105, Israel;Ben-Gurion University of the Negev, Be'er Sheva, 84105, Israel;Ben-Gurion University of the Negev, Be'er Sheva, 84105, Israel
Venue:
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Year:
2014

Citing 17
Cited 0

Propagation of trust and distrust

Proceedings of the 13th international conference on World Wide Web
Editorial: special issue on learning from imbalanced data sets

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Link prediction approach to collaborative filtering

Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Introduction to Data Mining, (First Edition)

Introduction to Data Mining, (First Edition)
The link-prediction problem for social networks

Journal of the American Society for Information Science and Technology
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Measurement and analysis of online social networks

Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
2008 Special Issue: Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance

Neural Networks
On the evolution of user interaction in Facebook

Proceedings of the 2nd ACM workshop on Online social networks
Learning when training data are costly: the effect of class distribution on tree induction

Journal of Artificial Intelligence Research
Scalable proximity estimation and link prediction in online social networks

Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Predicting positive and negative links in online social networks

Proceedings of the 19th international conference on World wide web
New perspectives and methods in link prediction

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Social Network Data Analytics

Social Network Data Analytics
Friendship and mobility: user movement in location-based social networks

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Link prediction via matrix factorization

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Online social networking sites have become increasingly popular over the last few years. As a result, new interdisciplinary research directions have emerged in which social network analysis methods are applied to networks containing hundreds of millions of users. Unfortunately, links between individuals may be missing either due to an imperfect acquirement process or because they are not yet reflected in the online network (i.e., friends in the real world did not form a virtual connection). The primary bottleneck in link prediction techniques is extracting the structural features required for classifying links. In this article, we propose a set of simple, easy-to-compute structural features that can be analyzed to identify missing links. We show that by using simple structural features, a machine learning classifier can successfully identify missing links, even when applied to a predicament of classifying links between individuals with at least one common friend. We also present a method for calculating the amount of data needed in order to build more accurate classifiers. The new Friends measure and Same community features we developed are shown to be good predictors for missing links. An evaluation experiment was performed on ten large social networks datasets: Academia.edu, DBLP, Facebook, Flickr, Flixster, Google+, Gowalla, TheMarker, Twitter, and YouTube. Our methods can provide social network site operators with the capability of helping users to find known, offline contacts and to discover new friends online. They may also be used for exposing hidden links in online social networks.