Using complex networks concepts to assess approaches for citations in scientific papers

  • Authors:
  • D. R. Amancio;M. G. Nunes;O. N. Oliveira, Jr.;L. F. Costa

  • Affiliations:
  • Institute of Physics of São Carlos, University of São Paulo, São Carlos, Brazil 13560-970;Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, Brazil 13560-970;Institute of Physics of São Carlos, University of São Paulo, São Carlos, Brazil 13560-970;Institute of Physics of São Carlos, University of São Paulo, São Carlos, Brazil 13560-970

  • Venue:
  • Scientometrics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The number of citations received by authors in scientific journals has become a major parameter to assess individual researchers and the journals themselves through the impact factor. A fair assessment therefore requires that the criteria for selecting references in a given manuscript should be unbiased with regard to the authors or journals cited. In this paper, we assess approaches for citations considering two recommendations for authors to follow while preparing a manuscript: (i) consider similarity of contents with the topics investigated, lest related work should be reproduced or ignored; (ii) perform a systematic search over the network of citations including seminal or very related papers. We use formalisms of complex networks for two datasets of papers from the arXiv and the Web of Science repositories to show that neither of these two criteria is fulfilled in practice. By representing the texts as complex networks we estimated a similarity index between pieces of texts and found that the list of references did not contain the most similar papers in the dataset. This was quantified by calculating a consistency index, whose maximum value is one if the references in a given paper are the most similar in the dataset. For the areas of "complex networks" and "graphenes", the consistency index was only 0.11---0.23 and 0.10---0.25, respectively. To simulate a systematic search in the citation network, we employed a traditional random walk search (i.e. diffusion) and a random walk whose probabilities of transition are proportional to the number of the ingoing edges of the neighbours. The frequency of visits to the nodes (papers) in the network had a very small correlation with either the actual list of references in the papers or with the number of downloads from the arXiv repository. Therefore, apparently the authors and users of the repository did not follow the criterion related to a systematic search over the network of citations. Based on these results, we propose an approach that we believe is fairer for evaluating and complementing citations of a given author, effectively leading to a virtual scientometry.