A new variant of the Pathfinder algorithm to generate large visual science maps in cubic time

  • Authors:
  • A. Quirin;O. Cordón;J. Santamaría;B. Vargas-Quesada;F. Moya-Anegón

  • Affiliations:
  • European Centre for Soft Computing, Edificio Científico Tecnológico, 33600 Mieres, Spain;European Centre for Soft Computing, Edificio Científico Tecnológico, 33600 Mieres, Spain;Department of Software Engineering, University of Cádiz, Cádiz, Spain;SCImago Group, Library and Information Science Faculty, University of Granada, 18071 Granada, Spain;SCImago Group, Library and Information Science Faculty, University of Granada, 18071 Granada, Spain

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the last few years, there is an increasing interest to generate visual representations of very large scientific domains. A methodology based on the combined use of ISI-JCR category cocitation and social networks analysis through the use of the Pathfinder algorithm has demonstrated its ability to achieve high quality, schematic visualizations for these kinds of domains. Now, the next step would be to generate these scientograms in an on-line fashion. To do so, there is a need to significantly decrease the run time of the latter pruning technique when working with category cocitation matrices of a large dimension like the ones handled in these large domains (Pathfinder has a time complexity order of O(n^4), with n being the number of categories in the cocitation matrix, i.e., the number of nodes in the network). Although a previous improvement called Binary Pathfinder has already been proposed to speed up the original algorithm, its significant time complexity reduction is not enough for that aim. In this paper, we make use of a different shortest path computation from classical approaches in computer science graph theory to propose a new variant of the Pathfinder algorithm which allows us to reduce its time complexity in one order of magnitude, O(n^3), and thus to significantly decrease the run time of the implementation when applied to large scientific domains considering the parameter q=n-1. Besides, the new algorithm has a much simpler structure than the Binary Pathfinder as well as it saves a significant amount of memory with respect to the original Pathfinder by reducing the space complexity to the need of just storing two matrices. An experimental comparison will be developed using large networks from real-world domains to show the good performance of the new proposal.