Improved sampling for triangle counting with MapReduce

  • Authors:
  • Jin-Hyun Yoon;Sung-Ryul Kim

  • Affiliations:
  • Division of Internet and Media, Konkuk University, Seoul, Korea;Division of Internet and Media, Konkuk University, Seoul, Korea

  • Venue:
  • ICHIT'11 Proceedings of the 5th international conference on Convergence and hybrid information technology
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of counting the number of triangles in a graph has gained importance in the last few years due to its importance in many data mining applications. Recently, Tsourkakis, et al. proposed DOULION, which is based on a simple sampling idea but works very well on many of the important graphs. In this preliminary report, we show that DOULION may not be very correct on special cases of graphs and argue that it may not fulfill the main purpose of the triangle counting problem for real-world graphs. We then present improvements on DOULION and show that it works better, much better in some cases, than DOULION.