Author disambiguation using multi-aspect similarity indicators

Authors:
Thomas Gurney;Edwin Horlings;Peter Van Den Besselaar
Affiliations:
Rathenau Institute, The Hague, The Netherlands 2593 HW;Rathenau Institute, The Hague, The Netherlands 2593 HW;VU University Amsterdam, Amsterdam, The Netherlands 1081HV
Venue:
Scientometrics
Year:
2012

Citing 12
Cited 2

Mapping change in scientific specialties: a scientometric reconstruction of the development of artificial intelligence

Journal of the American Society for Information Science
A technique for computer detection and correction of spelling errors

Communications of the ACM
Comparison of Schema Matching Evaluations

Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems
Search engine driven author disambiguation

Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Handbook of Quantitative Science and Technology Research: The Use of Publication and Patent Statistics in Studies of S&T Systems

Handbook of Quantitative Science and Technology Research: The Use of Publication and Patent Statistics in Studies of S&T Systems
Efficient topic-based unsupervised name disambiguation

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Author Name Disambiguation for Citations Using Topic and Web Correlation

ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
On co-authorship for author disambiguation

Information Processing and Management: an International Journal
A Term-Based Driven Clustering Approach for Name Disambiguation

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Citation analysis

Annual Review of Information Science and Technology
A method for eliminating articles by homonymous authors from the large number of articles retrieved by author search

Journal of the American Society for Information Science and Technology
Efficient name disambiguation for large-scale databases

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases

A semi-supervised approach for author disambiguation in KDD CUP 2013

Proceedings of the 2013 KDD Cup 2013 Workshop
Author name disambiguation in scientific collaboration and mobility cases

Scientometrics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Key to accurate bibliometric analyses is the ability to correctly link individuals to their corpus of work, with an optimal balance between precision and recall. We have developed an algorithm that does this disambiguation task with a very high recall and precision. The method addresses the issues of discarded records due to null data fields and their resultant effect on recall, precision and F-measure results. We have implemented a dynamic approach to similarity calculations based on all available data fields. We have also included differences in author contribution and age difference between publications, both of which have meaningful effects on overall similarity measurements, resulting in significantly higher recall and precision of returned records. The results are presented from a test dataset of heterogeneous catalysis publications. Results demonstrate significantly high average F-measure scores and substantial improvements on previous and stand-alone techniques.