Towards general measures of comparison of objects
Fuzzy Sets and Systems - Special issue dedicated to the memory of Professor Arnold Kaufmann
Data & Knowledge Engineering
Determining Semantic Similarity among Entity Classes from Different Ontologies
IEEE Transactions on Knowledge and Data Engineering
An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Autoplex: Automated Discovery of Content for Virtual Databases
CooplS '01 Proceedings of the 9th International Conference on Cooperative Information Systems
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Binomial coefficient computation: recursion or iteration?
ACM SIGCSE Bulletin
Geographical information recognition and visualization in texts written in various languages
Proceedings of the 2004 ACM symposium on Applied computing
Discovering personal gazetteers: an interactive clustering approach
Proceedings of the 12th annual ACM international workshop on Geographic information systems
Introduction to Data Mining, (First Edition)
Introduction to Data Mining, (First Edition)
Putting context into schema matching
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Multi-column substring matching for database schema translation
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Query result ranking over e-commerce web databases
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Ontology Matching
A visual tool for ontology alignment to enable geospatial interoperability
Journal of Visual Languages and Computing
Discovering personally meaningful places: An interactive clustering approach
ACM Transactions on Information Systems (TOIS)
The Google Similarity Distance
IEEE Transactions on Knowledge and Data Engineering
Proceedings of the first international workshop on Location and the web
Inferring generic activities and events from image content and bags of geo-tags
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Integrating gazetteers and remote sensed imagery
Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems
Validating Multi-column Schema Matchings by Type
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
The Geolocation of Web Logs from Textual Clues
CSE '09 Proceedings of the 2009 International Conference on Computational Science and Engineering - Volume 04
Improving binary classification on text problems using differential word features
Proceedings of the 18th ACM conference on Information and knowledge management
AgreementMaker: efficient matching for large real-world schemas and ontologies
Proceedings of the VLDB Endowment
LinkedGeoData: Adding a Spatial Dimension to the Web of Data
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Semantic similarity of ontology instances tailored on the application context
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
Geospatial semantics: why, of what, and how?
Journal on Data Semantics III
Comparing representations of geographic knowledge expressed as conceptual graphs
GeoS'05 Proceedings of the First international conference on GeoSpatial Semantics
Hi-index | 0.00 |
Resolving semantic heterogeneity across distinct data sources remains a highly relevant problem in the GIS domain requiring innovative solutions. Our approach, called GSim, semantically aligns tables from respective GIS databases by first choosing attributes for comparison. We then examine their instances and calculate a similarity value between them called entropy-based distribution (EBD) by combining two separate methods. Our primary method discerns the geographic types from instances of compared attributes. If successful, EBD is calculated using only this method. GSim further facilitates geographic type matching by using latlong values to further disambiguate between multiple types of a given instance and applying attribute weighting to quantify the uniqueness of mapped attributes. If geographic type matching is not possible, we then apply a generic schema matching method, independent of the knowledge domain, which employs normalized Google distance. We show the effectiveness of our approach over the traditional approaches across multi-jurisdictional datasets by generating impressive results.