Extracting research communities from bibliographic data

Authors:
Yushi Nakamura;Toshihiko Horiike;Tetsuji Kuboyama;Hiroshi Sakamoto
Affiliations:
Kyushu Institute of Technology, Fukuoka, Japan;Kyushu Institute of Technology, Fukuoka, Japan;Gakushuin University, Tokyo, Japan;Kyushu Institute of Technology, Fukuoka, Japan and JST PRESTO, 4-1-8 Honcho Kawaguchi, Saitama, Japan
Venue:
International Journal of Knowledge-based and Intelligent Engineering Systems - Intelligent Information Processing: Techniques and Applications
Year:
2012

Citing 19
Cited 0

A new approach to the maximum flow problem

STOC '86 Proceedings of the eighteenth annual ACM symposium on Theory of computing
Inferring Web communities from link topology

Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems
Automatic resource compilation by analyzing hyperlink structure and associated text

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Trawling the Web for emerging cyber-communities

WWW '99 Proceedings of the eighth international conference on World Wide Web
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Theoretical Improvements in Algorithmic Efficiency for Network Flow Problems

Journal of the ACM (JACM)
Efficient identification of Web communities

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Creating a Web community chart for navigating related communities

Proceedings of the 12th ACM conference on Hypertext and Hypermedia
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Effects of maximum flow algorithm on identifying web community

Proceedings of the 4th international workshop on Web information and data management
Self-Organization and Identification of Web Communities

Computer
Extracting Large-Scale Knowledge Bases from the Web

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Partitioning of Web graphs by community topology

WWW '05 Proceedings of the 14th international conference on World Wide Web
Mining Communities on the Web Using a Max-Flow and a Site-Oriented Framework

IEICE - Transactions on Information and Systems
Graph Theory and Its Applications, Second Edition (Discrete Mathematics and Its Applications)

Graph Theory and Its Applications, Second Edition (Discrete Mathematics and Its Applications)
Efficient sequential access pattern mining for web recommendations

International Journal of Knowledge-based and Intelligent Engineering Systems
Extracting Research Communities by Improved Maximum Flow Algorithm

KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II
Empirical comparison of algorithms for network community detection

Proceedings of the 19th international conference on World wide web
An improved algorithm for extracting research communities from bibliographic data

DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop a research community extraction algorithm from large bibliographic data, which was preliminarily reported in Horiike et al. [10] and Nakamura et al. [18]. A research community in bibliographic data is considered to be a set of the linked texts holding a common topic, in other words, it is a dense subgraph embedded in the directed graph. Our method is based on the maximum flow algorithm for finding web communities by Flake et al. [5]. We propose improvements of the algorithm to select community nodes and initial seeds taking account of the restriction that any directed graph is acyclic. We examine the improved algorithm for the list of keywords frequently appearing in the bibliographic data. In addition we propose a simple method to extract characteristic keywords for deciding initial seed nodes. This method is also evaluated by experiments.