An improved algorithm for extracting research communities from bibliographic data

  • Authors:
  • Yushi Nakamura;Toshihiko Horiike;Yoshimasa Taira;Hiroshi Sakamoto

  • Affiliations:
  • Kyushu Institute of Technology, Iizuka-shi, Fukuoka, Japan;Kyushu Institute of Technology, Iizuka-shi, Fukuoka, Japan;Kyushu Institute of Technology, Iizuka-shi, Fukuoka, Japan;Kyushu Institute of Technology, Iizuka-shi, Fukuoka, Japan and PRESTO, JST, Kawaguchi-shi, Saitama, Japan

  • Venue:
  • DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we improve the performance of the community extraction algorithm in [1] from bibliographic data, which was originally proposed for web community discovery by [2]. A web community is considered to be a set of web pages holding a common topic, in other words, it is a dense subgraph induced in web graph. Such subgraphs obtained by the max-flow algorithm are called max-flow communities, and this algorithm was improved to obtain research communities from bibliographic data by the strategy for selection of community nodes in [1]. We propose an improvement of this algorithm by carefully selecting initial seed node, and show the performance of this algorithm by experiments for the list of many keywords frequently appearing in data.