P-Rank: a comprehensive structural similarity measure over information networks

Authors:
Peixiang Zhao;Jiawei Han;Yizhou Sun
Affiliations:
University of Illinois at Urbana-Champaign, Urbana, IL, USA;University of Illinois at Urbana-Champaign, Urbana, IL, USA;University of Illinois at Urbana-Champaign, Urbana, IL, USA
Venue:
Proceedings of the 18th ACM conference on Information and knowledge management
Year:
2009

Citing 17
Cited 27

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
The Web as a graph

PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
SimRank: a measure of structural-context similarity

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering and Identifying Temporal Trends in Document Databases

ADL '00 Proceedings of the IEEE Advances in Digital Libraries 2000
Knowledge Discovery from Transportation Network Data

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Scaling link-based similarity search

WWW '05 Proceedings of the 14th international conference on World Wide Web
SimFusion: measuring similarity using unified relationship matrix

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Graph mining: Laws, generators, and algorithms

ACM Computing Surveys (CSUR)
LinkClus: efficient clustering via heterogeneous semantic links

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Algorithmic Computation and Approximation of Semantic Similarity

World Wide Web
PageSim: A Novel Link-Based Similarity Measure for the World Wide Web

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Measurement and analysis of online social networks

Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Relational link-based ranking

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Accuracy estimate and optimization techniques for SimRank computation

Proceedings of the VLDB Endowment
Integrative construction and analysis of condition-specific biological networks

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
The future of citeseer: citeseerx

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases

A link-based similarity measure for scientific literature

Proceedings of the 19th international conference on World wide web
Closed form solution of similarity algorithms

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Taming computational complexity: efficient and parallel simrank optimizations on undirected graphs

WAIM'10 Proceedings of the 11th international conference on Web-age information management
A fast two-stage algorithm for computing SimRank and its extensions

WAIM'10 Proceedings of the 2010 international conference on Web-age information management
Efficient link-based clustering in a large scaled blog network

Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
Using internal link and social network analysis to support searches in Wikipedia: A model and its evaluation

Journal of Information Science
CollabSeer: a search engine for collaboration discovery

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
SFViz: interest-based friends exploration and recommendation in social networks

Proceedings of the 2011 Visual Information Communication - International Symposium
Axiomatic ranking of network role similarity

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
ASAP: towards accurate, stable and accelerative penetrating-rank estimation on large graphs

WAIM'11 Proceedings of the 12th international conference on Web-age information management
Finding information nebula over large networks

Proceedings of the 20th ACM international conference on Information and knowledge management
Exploratory search over social-medical data

Proceedings of the 20th ACM international conference on Information and knowledge management
Structured lexical similarity via convolution kernels on dependency trees

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A space and time efficient algorithm for SimRank computation

World Wide Web
An experimental study on unsupervised graph-based word sense disambiguation

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Discovering missing links in networks using vertex similarity measures

Proceedings of the 27th Annual ACM Symposium on Applied Computing
SimFusion+: extending simfusion towards efficient estimation on large and dynamic networks

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
On the efficiency of estimating penetrating rank on large graphs

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Influence and similarity on heterogeneous networks

Proceedings of the 21st ACM international conference on Information and knowledge management
Hierarchical data organization for effective retrieval of similar shaders

Proceedings of the 2012 ACM Research in Applied Computation Symposium
E-rank: A Structural-Based Similarity Measure in Social Networks

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
ASCOS: an asymmetric network structure COntext similarity measure

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
On exploiting content and citations together to compute similarity of scientific papers

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
On combining text-based and link-based similarity measures for scientific papers

Proceedings of the 2013 Research in Adaptive and Convergent Systems
Scalable and axiomatic ranking of network role similarity

ACM Transactions on Knowledge Discovery from Data (TKDD) - Casin special issue
Computing paper similarity based on latent dirichlet allocation

Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
Structure/attribute computation of similarities between nodes of a RDF graph with application to linked data clustering

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the ubiquity of information networks and their broad applications, the issue of similarity computation between entities of an information network arises and draws extensive research interests. However, to effectively and comprehensively measure "how similar two entities are within an information network" is nontrivial, and the problem becomes even more challenging when the information network to be examined is massive and diverse. In this paper, we propose a new similarity measure, P-Rank (Penetrating Rank), toward effectively computing the structural similarities of entities in real information networks. P-Rank enriches the well-known similarity measure, SimRank, by jointly encoding both in- and out-link relationships into structural similarity computation. P-Rank is proven to be a unified structural similarity framework, under which all state-of-the-art similarity measures, including CoCitation, Coupling, Amsler and SimRank, are just its special cases. Based on its recursive nature of P-Rank, we propose a fixed point algorithm to reinforce structural similarity of vertex pairs beyond the localized neighborhood scope toward the entire information network. Our experimental studies demonstrate the power of P-Rank as an effective similarity measure in different information networks. Meanwhile, under the same time/space complexity, P-Rank outperforms SimRank as a comprehensive and more meaningful structural similarity measure, especially in large real information networks.