The merge/purge problem for large databases
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Learning object identification rules for information integration
Information Systems - Data extraction, cleaning and reconciliation
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Semantic integration: a survey of ontology-based approaches
ACM SIGMOD Record
Reference reconciliation in complex information spaces
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Pair-Wise entity resolution: overview and challenges
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Entity Name System: The Back-Bone of an Open and Scalable Web of Data
ICSC '08 Proceedings of the 2008 IEEE International Conference on Semantic Computing
A novel approach for entity linkage
IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
Unsupervised learning of link discovery configuration
ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
OtO matching system: a multi-strategy approach to instance matching
CAiSE'12 Proceedings of the 24th international conference on Advanced Information Systems Engineering
Evaluation of instance matching tools: The experience of OAEI
Web Semantics: Science, Services and Agents on the World Wide Web
Hi-index | 0.00 |
Entity matching or resolution is at the heart of many integration tasks in modern information systems. As with any core functionality, good quality of results is vital to ensure that upper-level tasks perform as desired. In this paper we introduce the FBEM algorithm and illustrate its usefulness for general-purpose use cases. We analyze its result quality with a range of experiments on heterogeneous data sources, and show that the approach provides good results for entities of different types, such as persons, organizations or publications, while posing minimal requirements to input data formats and requiring no training.