Data integration using similarity joins and a word-based information representation language
ACM Transactions on Information Systems (TOIS)
Measuring similarity between collection of values
Proceedings of the 6th annual ACM international workshop on Web information and data management
Automatic threshold estimation for data matching applications
SBBD '08 Proceedings of the 23rd Brazilian symposium on Databases
Evaluation of entity resolution approaches on real-world match problems
Proceedings of the VLDB Endowment
Automatic threshold estimation for data matching applications
Information Sciences: an International Journal
Hi-index | 0.00 |
Approximate data matching applications typically use similarity functions to quantify the degree of likeness between two data instances. There are several similarity functions available, thus, it is often necessary to evaluate a number of them aiming at choosing the function that is more adequate to a specific application. This paper presents a tool that uses average precision and discernability to evaluate the quality of similarity functions over a data set.