Fast and exact top-k search for random walk with restart

Authors:
Yasuhiro Fujiwara;Makoto Nakatsuji;Makoto Onizuka;Masaru Kitsuregawa
Affiliations:
NTT Cyber Space Labs, and The University of Tokyo;NTT Cyber Space Labs;NTT Cyber Space Labs;The University of Tokyo
Venue:
Proceedings of the VLDB Endowment
Year:
2012

Citing 21
Cited 4

Multilevel k-way partitioning scheme for irregular graphs

Journal of Parallel and Distributed Computing
The link prediction problem for social networks

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Automatic multimedia cross-modal correlation discovery

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Manifold-ranking based image retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Neighborhood Formation and Anomaly Detection in Bipartite Graphs

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
The Structure and Dynamics of Networks: (Princeton Studies in Complexity)

The Structure and Dynamics of Networks: (Princeton Studies in Complexity)
Center-piece subgraphs: problem definition and fast solutions

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Fast Random Walk with Restart and Its Applications

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Optimization and evaluation of shortest path queries

The VLDB Journal — The International Journal on Very Large Data Bases
Graph indexing: tree + delta

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Fast algorithms for topk personalized pagerank queries

Proceedings of the 17th international conference on World Wide Web
Numerical Recipes 3rd Edition: The Art of Scientific Computing

Numerical Recipes 3rd Edition: The Art of Scientific Computing
Simrank++: query rewriting through link analysis of the click graph

Proceedings of the VLDB Endowment
Accuracy estimate and optimization techniques for SimRank computation

Proceedings of the VLDB Endowment
Fast Graph Pattern Matching

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
On social networks and collaborative recommendation

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Introduction to Algorithms, Third Edition

Introduction to Algorithms, Third Edition
Graph clustering based on structural/attribute similarities

Proceedings of the VLDB Endowment
Folks in Folksonomies: social link prediction from shared metadata

Proceedings of the third ACM international conference on Web search and data mining
Neighborhood based fast graph search in large networks

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Quick detection of top-k personalized pagerank lists

WAW'11 Proceedings of the 8th international conference on Algorithms and models for the web graph

Efficient personalized pagerank with accuracy assurance

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient ad-hoc search for personalized PageRank

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
IRWR: incremental random walk with restart

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
LR-PPR: locality-sensitive, re-use promoting, approximate personalized pagerank computation

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Graphs are fundamental data structures and have been employed for centuries to model real-world systems and phenomena. Random walk with restart (RWR) provides a good proximity score between two nodes in a graph, and it has been successfully used in many applications such as automatic image captioning, recommender systems, and link prediction. The goal of this work is to find nodes that have top-k highest proximities for a given node. Previous approaches to this problem find nodes efficiently at the expense of exactness. The main motivation of this paper is to answer, in the affirmative, the question, 'Is it possible to improve the search time without sacrificing the exactness?'. Our solution, K-dash, is based on two ideas: (1) It computes the proximity of a selected node efficiently by sparse matrices, and (2) It skips unnecessary proximity computations when searching for the top-k nodes. Theoretical analyses show that K-dash guarantees result exactness. We perform comprehensive experiments to verify the efficiency of K-dash. The results show that K-dash can find top-k nodes significantly faster than the previous approaches while it guarantees exactness.