Efficiency and precision trade-offs in graph summary algorithms

  • Authors:
  • Stéphane Campinas;Renaud Delbru;Giovanni Tummarello

  • Affiliations:
  • National University of Ireland, Galway;National University of Ireland, Galway;National University of Ireland, Galway

  • Venue:
  • Proceedings of the 17th International Database Engineering & Applications Symposium
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In many applications, it is convenient to substitute a large data graph with a smaller homomorphic graph. This paper investigates approaches for summarising massive data graphs. In general, massive data graphs are processed using a shared-nothing infrastructure such as MapReduce. However, accurate graph summarisation algorithms are suboptimal for this kind of environment as they require multiple iterations over the data graph. We investigate approximate graph summarisation algorithms that are efficient to compute in a shared-nothing infrastructure. We define a quality assessment model of a summary with regards to a gold standard summary. We evaluate over several datasets the trade-offs between efficiency and precision of the algorithms. With regards to an application, experiments highlight the need to trade-off the precision and volume of a graph summary with the complexity of a summarisation technique.