Enhanced geographically typed semantic schema matching

Authors:
Jeffrey Partyka;Pallabi Parveen;Latifur Khan;B. Thuraisingham;Shashi Shekhar
Affiliations:
Department of Computer Science, University of Texas at Dallas, 800 West Campbell Rd., Richardson, TX 75080-3021, USA;Department of Computer Science, University of Texas at Dallas, 800 West Campbell Rd., Richardson, TX 75080-3021, USA;Department of Computer Science, University of Texas at Dallas, 800 West Campbell Rd., Richardson, TX 75080-3021, USA;Department of Computer Science, University of Texas at Dallas, 800 West Campbell Rd., Richardson, TX 75080-3021, USA;Department of Computer Science, University of Minnesota, 4-192 EE/CS Bldg, 200 Union St. SE, Minneapolis, MN, USA
Venue:
Web Semantics: Science, Services and Agents on the World Wide Web
Year:
2011

Citing 30
Cited 0

Towards general measures of comparison of objects

Fuzzy Sets and Systems - Special issue dedicated to the memory of Professor Arnold Kaufmann
SEMINT: a tool for identifying attribute correspondences in heterogeneous databases using neural networks

Data & Knowledge Engineering
Determining Semantic Similarity among Entity Classes from Different Ontologies

IEEE Transactions on Knowledge and Data Engineering
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Autoplex: Automated Discovery of Content for Virtual Databases

CooplS '01 Proceedings of the 9th International Conference on Cooperative Information Systems
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
Binomial coefficient computation: recursion or iteration?

ACM SIGCSE Bulletin
Geographical information recognition and visualization in texts written in various languages

Proceedings of the 2004 ACM symposium on Applied computing
Discovering personal gazetteers: an interactive clustering approach

Proceedings of the 12th annual ACM international workshop on Geographic information systems
Automatic direct and indirect schema mapping: experiences and lessons learned

ACM SIGMOD Record
Introduction to Data Mining, (First Edition)

Introduction to Data Mining, (First Edition)
Putting context into schema matching

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Multi-column substring matching for database schema translation

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Query result ranking over e-commerce web databases

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Ontology Matching

Ontology Matching
A visual tool for ontology alignment to enable geospatial interoperability

Journal of Visual Languages and Computing
Discovering personally meaningful places: An interactive clustering approach

ACM Transactions on Information Systems (TOIS)
The Google Similarity Distance

IEEE Transactions on Knowledge and Data Engineering
The locative web

Proceedings of the first international workshop on Location and the web
Inferring generic activities and events from image content and bags of geo-tags

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Integrating gazetteers and remote sensed imagery

Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems
Validating Multi-column Schema Matchings by Type

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
SIM-DLA: A Novel Semantic Similarity Measure for Description Logics Reducing Inter-concept to Inter-instance Similarity

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
The Geolocation of Web Logs from Textual Clues

CSE '09 Proceedings of the 2009 International Conference on Computational Science and Engineering - Volume 04
Improving binary classification on text problems using differential word features

Proceedings of the 18th ACM conference on Information and knowledge management
AgreementMaker: efficient matching for large real-world schemas and ontologies

Proceedings of the VLDB Endowment
LinkedGeoData: Adding a Spatial Dimension to the Web of Data

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Semantic similarity of ontology instances tailored on the application context

ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
Geospatial semantics: why, of what, and how?

Journal on Data Semantics III
Comparing representations of geographic knowledge expressed as conceptual graphs

GeoS'05 Proceedings of the First international conference on GeoSpatial Semantics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Resolving semantic heterogeneity across distinct data sources remains a highly relevant problem in the GIS domain requiring innovative solutions. Our approach, called GSim, semantically aligns tables from respective GIS databases by first choosing attributes for comparison. We then examine their instances and calculate a similarity value between them called entropy-based distribution (EBD) by combining two separate methods. Our primary method discerns the geographic types from instances of compared attributes. If successful, EBD is calculated using only this method. GSim further facilitates geographic type matching by using latlong values to further disambiguate between multiple types of a given instance and applying attribute weighting to quantify the uniqueness of mapped attributes. If geographic type matching is not possible, we then apply a generic schema matching method, independent of the knowledge domain, which employs normalized Google distance. We show the effectiveness of our approach over the traditional approaches across multi-jurisdictional datasets by generating impressive results.