Do second-order similarities provide added-value in a hybrid approach?

Authors:
Bart Thijs;Edgar Schiebel;Wolfgang Glänzel
Affiliations:
Centre for R&D Monitoring (ECOOM) and Department of MSI, KU Leuven, Leuven, Belgium;AIT Austrian Institute of Technology GmbH, Vienna, Austria;Centre for R&D Monitoring (ECOOM) and Department of MSI, KU Leuven, Leuven, Belgium and Department of Science Policy and Scientometrics, LHAS, Budapest, Hungary
Venue:
Scientometrics
Year:
2013

Citing 9
Cited 0

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

Journal of Computational and Applied Mathematics
Science and technology mapping: a new iteration model for representing multidimensional relationships

Journal of the American Society for Information Science - Special issue on science and technology indicators
Finding content-bearing terms using term similarities

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?

Journal of the American Society for Information Science and Technology
Using `core documents' for the representation of clusters and topics

Scientometrics
Experimental comparison of first and second-order similarities in a scientometric context

Scientometrics
Visualization of research fronts and knowledge bases by three-dimensional areal densities of bibliographically coupled publications and co-citations

Scientometrics
Using `core documents' for detecting and labelling new emerging topics

Scientometrics
The role of core documents in bibliometric network analysis and their relation with h-type indices

Scientometrics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent studies on first- and second-order similarities have shown that the latter one outperforms the first one as input for document clustering or partitioning applications. First-order similarities based on bibliographic coupling or on lexical approaches come with specific methodological issues like sparse matrices, sensitive to spelling variances or context differences. Second-order similarities were proposed to tackle these problems and take the lexical context into account. But also a hybrid combination of both types of similarities proved an important improvement which integrates the strengths of the two approaches and diminishes their weaknesses. In this paper we extend the notion of second-order similarity by applying it in the context of the hybrid approach. We conclude that there is no added value for the clearly defined clusters but that the second-order similarity can provide an additional viewpoint for the more general clusters.