A large dataset for the evaluation of ontology matching

Authors:
Fausto Giunchiglia;Mikalai Yatskevich;Paolo Avesani;Pavel Shivaiko
Affiliations:
Department of information engineering and computer science (disi), university of trento, 38050 povo, trento, italy;Department of information engineering and computer science (disi), university of trento, 38050 povo, trento, italy;Fondazione bruno kessler, via sommarive 18, 38050 povo, trento, italy;Department of information engineering and computer science (disi), university of trento, 38050 povo, trento, italy
Venue:
The Knowledge Engineering Review
Year:
2009

Citing 32
Cited 11

Semantic integration of semistructured and structured data sources

ACM SIGMOD Record
Information Retrieval

Information Retrieval
Generic Schema Matching with Cupid

Proceedings of the 27th International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
On schema matching with opaque column names and data values

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Learning to match ontologies on the Semantic Web

The VLDB Journal — The International Journal on Very Large Data Bases
The PROMPT suite: interactive tools for ontology merging and mapping

International Journal of Human-Computer Studies
Ontology mapping: the state of the art

The Knowledge Engineering Review
Semantic matching

The Knowledge Engineering Review
iMAP: discovering complex semantic matches between database schemas

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Semantic integration: a survey of ontology-based approaches

ACM SIGMOD Record
A framework for modeling and evaluating automatic semantic reconciliation

The VLDB Journal — The International Journal on Very Large Data Bases
Supporting user-subjective categorization with self-organizing maps and learning vector quantization

Journal of the American Society for Information Science and Technology
Semantic-integration research in the database community

AI Magazine - Special issue on semantic integration
Automatic complex schema matching across Web query interfaces: A correlation mining approach

ACM Transactions on Database Systems (TODS)
Constructing virtual documents for ontology matching

Proceedings of the 15th international conference on World Wide Web
Ontology Matching

Ontology Matching
SAMBO-A system for aligning and merging biomedical ontologies

Web Semantics: Science, Services and Agents on the World Wide Web
Using Bayesian decision for ontology mapping

Web Semantics: Science, Services and Agents on the World Wide Web
COMA: a system for flexible combination of schema matching approaches

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Discovering Missing Background Knowledge in Ontology Matching

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Semantic precision and recall for ontology alignment evaluation

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Integrating multiple internet directories by instance-based learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Encoding classifications into lightweight ontologies

Journal on data semantics VIII
A large scale taxonomy mapping evaluation

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Bootstrapping ontology alignment methods with APFEL

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Guidelines for benchmarking the performance of ontology management APIs

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Detecting similarities in ontologies with the SOQA-SimPack toolkit

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Holistic schema matching for web query interfaces

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Efficient semantic matching

ESWC'05 Proceedings of the Second European conference on The Semantic Web: research and Applications
A survey of schema-based matching approaches

Journal on Data Semantics IV

Ten Challenges for Ontology Matching

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Approximate Structure-Preserving Semantic Matching

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Scaling alignment of large ontologies

International Journal of Bioinformatics Research and Applications
Lightweight parsing of classifications into lightweight ontologies

ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Ontology alignment evaluation initiative: six years of experience

Journal on data semantics XV
An evaluation of ontology matching in geo-service applications

Geoinformatica
Automatically structuring domain knowledge from text: An overview of current research

Information Processing and Management: an International Journal
A Query-based Approach for Semi-Automatic Annotation of Web Services

International Journal of Information Systems and Social Change
S-match: an open source framework for matching lightweight ontologies

Semantic Web
Ontology matching benchmarks: Generation, stability, and discriminability

Web Semantics: Science, Services and Agents on the World Wide Web
S-Match: An open source framework for matching lightweight ontologies

Semantic Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently, the number of ontology matching techniques and systems has increased significantly. This makes the issue of their evaluation and comparison more severe. One of the challenges of the ontology matching evaluation is in building large-scale evaluation datasets. In fact, the number of possible correspondences between two ontologies grows quadratically with respect to the numbers of entities in these ontologies. This often makes the manual construction of the evaluation datasets demanding to the point of being infeasible for large-scale matching tasks. In this paper, we present an ontology matching evaluation dataset composed of thousands of matching tasks, called TaxME2. It was built semi-automatically out of the Google, Yahoo, and Looksmart web directories. We evaluated TaxME2 by exploiting the results of almost two-dozen of state-of-the-art ontology matching systems. The experiments indicate that the dataset possesses the desired key properties, namely it is error-free, incremental, discriminative, monotonic, and hard for the state-of-the-art ontology matching systems.