Propagation of trust and distrust
Proceedings of the 13th international conference on World Wide Web
Editorial: special issue on learning from imbalanced data sets
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Link prediction approach to collaborative filtering
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Introduction to Data Mining, (First Edition)
Introduction to Data Mining, (First Edition)
The link-prediction problem for social networks
Journal of the American Society for Information Science and Technology
Statistical Comparisons of Classifiers over Multiple Data Sets
The Journal of Machine Learning Research
Measurement and analysis of online social networks
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
On the evolution of user interaction in Facebook
Proceedings of the 2nd ACM workshop on Online social networks
Learning when training data are costly: the effect of class distribution on tree induction
Journal of Artificial Intelligence Research
Scalable proximity estimation and link prediction in online social networks
Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Predicting positive and negative links in online social networks
Proceedings of the 19th international conference on World wide web
New perspectives and methods in link prediction
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Social Network Data Analytics
Friendship and mobility: user movement in location-based social networks
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Link prediction via matrix factorization
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Hi-index | 0.00 |
Online social networking sites have become increasingly popular over the last few years. As a result, new interdisciplinary research directions have emerged in which social network analysis methods are applied to networks containing hundreds of millions of users. Unfortunately, links between individuals may be missing either due to an imperfect acquirement process or because they are not yet reflected in the online network (i.e., friends in the real world did not form a virtual connection). The primary bottleneck in link prediction techniques is extracting the structural features required for classifying links. In this article, we propose a set of simple, easy-to-compute structural features that can be analyzed to identify missing links. We show that by using simple structural features, a machine learning classifier can successfully identify missing links, even when applied to a predicament of classifying links between individuals with at least one common friend. We also present a method for calculating the amount of data needed in order to build more accurate classifiers. The new Friends measure and Same community features we developed are shown to be good predictors for missing links. An evaluation experiment was performed on ten large social networks datasets: Academia.edu, DBLP, Facebook, Flickr, Flixster, Google+, Gowalla, TheMarker, Twitter, and YouTube. Our methods can provide social network site operators with the capability of helping users to find known, offline contacts and to discover new friends online. They may also be used for exposing hidden links in online social networks.