Extracting Research Communities by Improved Maximum Flow Algorithm

  • Authors:
  • Toshihiko Horiike;Youhei Takahashi;Tetsuji Kuboyama;Hiroshi Sakamoto

  • Affiliations:
  • Kyushu Institute of Technology, Iizuka, Japan 820-8502;Kyushu Institute of Technology, Iizuka, Japan 820-8502;Gakushuin University, Tokyo, Japan 171-8588;Kyushu Institute of Technology, Iizuka, Japan 820-8502

  • Venue:
  • KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose an algorithm, which is an improvement of identification of web communities by [1], to extract research communities from bibliography data. Web graph is huge graph structure consisting nodes and edges, which represent web pages and hyperlinks. An web community is considered to be a set of web pages holding a common topic, in other words, it is a dense subgraph of web graph. Such subgraphs obtained by the max-flow algorithm [1] are called max-flow communities . We then improve this algorithm by introducing the strategy for selection of community nodes. The effectiveness of our improvement is shown by experiments on finding research communities from CiteSeer bibliography data.