Extracting research communities from bibliographic data

  • Authors:
  • Yushi Nakamura;Toshihiko Horiike;Tetsuji Kuboyama;Hiroshi Sakamoto

  • Affiliations:
  • Kyushu Institute of Technology, Fukuoka, Japan;Kyushu Institute of Technology, Fukuoka, Japan;Gakushuin University, Tokyo, Japan;Kyushu Institute of Technology, Fukuoka, Japan and JST PRESTO, 4-1-8 Honcho Kawaguchi, Saitama, Japan

  • Venue:
  • International Journal of Knowledge-based and Intelligent Engineering Systems - Intelligent Information Processing: Techniques and Applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We develop a research community extraction algorithm from large bibliographic data, which was preliminarily reported in Horiike et al. [10] and Nakamura et al. [18]. A research community in bibliographic data is considered to be a set of the linked texts holding a common topic, in other words, it is a dense subgraph embedded in the directed graph. Our method is based on the maximum flow algorithm for finding web communities by Flake et al. [5]. We propose improvements of the algorithm to select community nodes and initial seeds taking account of the restriction that any directed graph is acyclic. We examine the improved algorithm for the list of keywords frequently appearing in the bibliographic data. In addition we propose a simple method to extract characteristic keywords for deciding initial seed nodes. This method is also evaluated by experiments.