Exploiting edge semantics in citation graphs using efficient, vertical ARM

  • Authors:
  • Imad Rahal;Dongmei Ren;Weihua Wu;Anne Denton;Christopher Besemann;William Perrizo

  • Affiliations:
  • Department of Computer Science, Peter Engel Science Center (room 211), Saint John's University, 56321-3000, Collegeville, MN, USA;Computer Science and Operations Research Department, North Dakota State University, 56321-3000, Fargo, ND, USA;Computer Science and Operations Research Department, North Dakota State University, 56321-3000, Fargo, ND, USA;Computer Science and Operations Research Department, North Dakota State University, 56321-3000, Fargo, ND, USA;Computer Science and Operations Research Department, North Dakota State University, 56321-3000, Fargo, ND, USA;Computer Science and Operations Research Department, North Dakota State University, 56321-3000, Fargo, ND, USA

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Graphs are increasingly becoming a vital source of information within which a great deal of semantics is embedded. As the size of available graphs increases, our ability to arrive at the embedded semantics grows into a much more complicated task. One form of important hidden semantics is that which is embedded in the edges of directed graphs. Citation graphs serve as a good example in this context. This paper attempts to understand temporal aspects in publication trends through citation graphs, by identifying patterns in the subject matters of scientific publications using an efficient, vertical association rule mining model. Such patterns can (a) indicate subject-matter evolutionary history, (b) highlight subject-matter future extensions, and (c) give insights on the potential effects of current research on future research. We highlight our major differences with previous work in the areas of graph mining, citation mining, and Web-structure mining, propose an efficient vertical data representation model, introduce a new subjective interestingness measure for evaluating patterns with a special focus on those patterns that signify strong associations between properties of cited papers and citing papers, and present an efficient algorithm for the purpose of discovering rules of interest followed by a detailed experimental analysis.