Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database

  • Authors:
  • Xinhai Liu;Shi Yu;Frizo Janssens;Wolfgang Glänzel;Yves Moreau;Bart De Moor

  • Affiliations:
  • Katholieke Universiteit Leuven, ESAT-SCD, Kasteelpark Arenberg 10, B3001, Leuven, Belgium and Wuhan University of Science and Technology (WUST), College of Information Science and Engineering, Hep ...;Katholieke Universiteit Leuven, ESAT-SCD, Kasteelpark Arenberg 10, B3001, Leuven, Belgium;Katholieke Universiteit Leuven, ESAT-SCD, Kasteelpark Arenberg 10, B3001, Leuven, Belgium;Katholieke Universiteit Leuven, Centre for R&D Monitoring, Department of Managerial Economics, Strategy and Innovation, Dekenstraat 2, B3000, Leuven, Belgium and Hungarian Academy of Sciences, IRP ...;Katholieke Universiteit Leuven, ESAT-SCD, Kasteelpark Arenberg 10, B3001, Leuven, Belgium;Katholieke Universiteit Leuven, ESAT-SCD, Kasteelpark Arenberg 10, B3001, Leuven, Belgium

  • Venue:
  • Journal of the American Society for Information Science and Technology
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a new hybrid clustering framework to incorporate text mining with bibliometrics in journal set analysis. The framework integrates two different approaches: clustering ensemble and kernel-fusion clustering. To improve the flexibility and the efficiency of processing large-scale data, we propose an information-based weighting scheme to leverage the effect of multiple data sources in hybrid clustering. Three different algorithms are extended by the proposed weighting scheme and they are employed on a large journal set retrieved from the Web of Science (WoS) database. The clustering performance of the proposed algorithms is systematically evaluated using multiple evaluation methods, and they were cross-compared with alternative methods. Experimental results demonstrate that the proposed weighted hybrid clustering strategy is superior to other methods in clustering performance and efficiency. The proposed approach also provides a more refined structural mapping of journal sets, which is useful for monitoring and detecting new trends in different scientific fields. © 2010 Wiley Periodicals, Inc.