Record linkage: similarity measures and algorithms

  • Authors:
  • Nick Koudas;Sunita Sarawagi;Divesh Srivastava

  • Affiliations:
  • University of Toronto;IIT Bombay;AT&T Labs-Research

  • Venue:
  • Proceedings of the 2006 ACM SIGMOD international conference on Management of data
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This tutorial provides a comprehensive and cohesive overview of the key research results in the area of record linkage methodologies and algorithms for identifying approximate duplicate records, and available tools for this purpose. It encompasses techniques introduced in several communities including databases, information retrieval, statistics and machine learning. It aims to identify similarities and differences across the techniques as well as their merits and limitations.