Markov random field modeling in computer vision
Markov random field modeling in computer vision
IEEE Transactions on Pattern Analysis and Machine Intelligence
The String-to-String Correction Problem
Journal of the ACM (JACM)
An Extension of the String-to-String Correction Problem
Journal of the ACM (JACM)
Communications of the ACM
Dynamic Programming
Dynamic Programming: Models and Applications
Dynamic Programming: Models and Applications
IEEE Transactions on Pattern Analysis and Machine Intelligence
Joint deduplication of multiple record types in relational data
Proceedings of the 14th ACM international conference on Information and knowledge management
Learning the structure of Markov logic networks
ICML '05 Proceedings of the 22nd international conference on Machine learning
Adaptive Name Matching in Information Integration
IEEE Intelligent Systems
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Entity Resolution with Markov Logic
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Bioinformatics
Detecting Duplicate Biological Entities Using Markov Random Field-Based Edit Distance
BIBM '08 Proceedings of the 2008 IEEE International Conference on Bioinformatics and Biomedicine
DTMBIO '10 Proceedings of the ACM fourth international workshop on Data and text mining in biomedical informatics
De-duplication of aggregation authority files
International Journal of Metadata, Semantics and Ontologies
De-duplication of aggregation authority files
International Journal of Metadata, Semantics and Ontologies
Hi-index | 0.00 |
Duplicate entity detection in biological data is an important research task. In this paper, we propose a novel and context-sensitive Shortest Path Edit Distance (SPED) extending and supplementing our previous work on Markov Random Field-based Edit Distance (MRFED). SPED transforms the edit distance computational problem to the calculation of the shortest path among two selected vertices of a graph. We produce several modifications of SPED by applying Levenshtein, arithmetic mean, histogram difference and TFIDF techniques to solve subtasks. We compare SPED performance to other well-known distance algorithms for biological entity matching. The experimental results show that SPED produces competitive outcomes.