Using Abstract Information and Community Alignment Information for Link Prediction

  • Authors:
  • Mrinmaya Sachan;Ryutaro Ichise

  • Affiliations:
  • -;-

  • Venue:
  • ICMLC '10 Proceedings of the 2010 Second International Conference on Machine Learning and Computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Although there have been many recent studies of link prediction in co-authorship networks, few have tried to utilize the Semantic information hidden in abstracts of the research documents. We propose to build a link predictor in a co-authorship network where nodes represent researchers and links represent co-authorship. In this method, we use the structure of the constructed graph, and propose to add a semantic approach using abstract information, research titles and the event information to improve the accuracy of the predictor. Secondly, we make use of the fact that researchers tend to work in close knit communities. The knowledge of a pair of researchers lying in the same dense community can be used to improve the accuracy of our predictor further. Finally, we test out hypothesis on the DBLP database in a reasonable time by under-sampling and balancing the data set using decision trees and the SMOTE technique.