Identity resolution: 23 years of practical experience and observations at scale

  • Authors:
  • Jeff Jonas

  • Affiliations:
  • IBM Entity Analytic Solutions, Las Vegas, NV

  • Venue:
  • Proceedings of the 2006 ACM SIGMOD international conference on Management of data
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Identity Resolution is a semantic reconciliation activity as applied to people and organizations. Identity resolution is most frequently quantified in terms of accuracy (false positives and false negatives), however, there are additional metrics by which to evaluate identity resolution algorithms including: methodology, persistence, streaming versus batch, data survivorship, operationalizing historical data, transaction/window size, ingestion speed, end-to-end latency, sequence neutrality, handling of ambiguous conditions, reconcilability, scalability, sustainability, and operational characteristics at scale. As well, a technique for "analytics in the anonymized data space" will be presented that makes it possible to resolve identities in a more privacy-preserving manner.