On the Evaluation of Large and Sparse Graph Reachability Queries

  • Authors:
  • Yangjun Chen

  • Affiliations:
  • Department of Applied Computer Science, University of Winnipeg, Winnipeg, Canada R3B 2E9

  • Venue:
  • DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given a directed graph Gwith nnodes and eedges, to check whether a node vis reachable from another node uthrough a path is often required. Such a problem is fundamental to numerous applications, including geographic navigation, Internet routing, ontology queries based on RDF/OWL, and metabolic network, as well as XML indexing. Among them, some involve huge but sparse graphs and require fast answering of reachability queries. In this paper, we propose a novel method called core labeling to handle reachability queries for massive, sparse graphs. The goal is to optimize both query time and labeling time. Our method consists of two schemes: Core-I and Core-II. For the Core-I labeling scheme, both the time and space requirements are bounded by O(n+ e+ s·b), where sis the number of the start nodes of all non-tree edges (edges that do not appear in the spanning tree of G);andbis the width of a subgraph of G. The size of that subgraph is bounded by O(t), where tis the number of all the non-tree edges. The query time of Core-I is bounded by O(logb). The Core-II labeling scheme has constant query time, but the labeling time is increased to O(n+ e+ s·b·logb).