Modelling citation networks for improving scientific paper classification performance

  • Authors:
  • Mengjie Zhang;Xiaoying Gao;Minh Duc Cao;Yuejin Ma

  • Affiliations:
  • School of Mathematics, Statistics and Computer Science, Victoria Univ. of Wellington, Wellington, New Zealand and Artificial Intelligence Research Centre, College of Mechanical and Electrical Eng. ...;School of Mathematics, Statistics and Computer Science, Victoria Univ. of Wellington, Wellington, New Zealand and Artificial Intelligence Research Centre, College of Mechanical and Electrical Eng. ...;School of Mathematics, Statistics and Computer Science, Victoria University of Wellington, Wellington, New Zealand;Artificial Intelligence Research Centre, College of Mechanical and Electrical Engineering, Agricultural University of Hebei, Baoding, China

  • Venue:
  • PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes an approach to the use of citation links to improve the scientific paper classification performance. In this approach, we develop two refinement functions, a linear label refinement (LLR) and a probabilistic label refinement (PLR), to model the citation link structures of the scientific papers for refining the class labels of the documents obtained by the content-based Naive Bayes classification method. The approach with the two new refinement models is examined and compared with the content-based Naive Bayes method on a standard paper classification data set with increasing training set sizes. The results suggest that both refinement models can significantly improve the system performance over the content-based method for all the training set sizes and that PLR is better than LLR when the training examples are sufficient.