Feature-based entity matching: the FBEM model, implementation, evaluation

Authors:
Heiko Stoermer;Nataliya Rassadko;Nachiket Vaidya
Affiliations:
University of Trento, Dept. of Information and Communication Tech., Trento, Italy;University of Trento, Dept. of Information and Communication Tech., Trento, Italy;University of Trento, Dept. of Information and Communication Tech., Trento, Italy
Venue:
CAiSE'10 Proceedings of the 22nd international conference on Advanced information systems engineering
Year:
2010

Citing 9
Cited 3

The merge/purge problem for large databases

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Learning object identification rules for information integration

Information Systems - Data extraction, cleaning and reconciliation
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
Semantic integration: a survey of ontology-based approaches

ACM SIGMOD Record
Reference reconciliation in complex information spaces

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Pair-Wise entity resolution: overview and challenges

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Duplicate Record Detection: A Survey

IEEE Transactions on Knowledge and Data Engineering
Entity Name System: The Back-Bone of an Open and Scalable Web of Data

ICSC '08 Proceedings of the 2008 IEEE International Conference on Semantic Computing
A novel approach for entity linkage

IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration

Unsupervised learning of link discovery configuration

ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
OtO matching system: a multi-strategy approach to instance matching

CAiSE'12 Proceedings of the 24th international conference on Advanced Information Systems Engineering
Evaluation of instance matching tools: The experience of OAEI

Web Semantics: Science, Services and Agents on the World Wide Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Entity matching or resolution is at the heart of many integration tasks in modern information systems. As with any core functionality, good quality of results is vital to ensure that upper-level tasks perform as desired. In this paper we introduce the FBEM algorithm and illustrate its usefulness for general-purpose use cases. We analyze its result quality with a range of experiments on heterogeneous data sources, and show that the approach provides good results for entities of different types, such as persons, organizations or publications, while posing minimal requirements to input data formats and requiring no training.