A matching algorithm for electronic data interchange

Authors:
Rami Rifaieh;Uddam Chukmol;Nabila Benharkat
Affiliations:
San Diego Supercomputer Center, University of California San Diego, La Jolla, CA;Computer Science Department, Combodia Technological Institute, Phnom Penh, Cambodia;LIRIS, National Institute of Applied Science of Lyon, Villeurbanne, France
Venue:
TES'05 Proceedings of the 6th international conference on Technologies for E-Services
Year:
2005

Citing 16
Cited 2

The Clio project: managing heterogeneity

ACM SIGMOD Record
Reconciling schemas of disparate data sources: a machine-learning approach

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Learning to map between ontologies on the semantic web

Proceedings of the 11th international conference on World Wide Web
Information Retrieval: Algorithms and Heuristics

Information Retrieval: Algorithms and Heuristics
Query-based data warehousing tool

Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP
Generic Schema Matching with Cupid

Proceedings of the 27th International Conference on Very Large Data Bases
Database Schema Matching Using Machine Learning with Feature Selection

CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
Comparison of Schema Matching Evaluations

Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems
Discovering Direct and Indirect Matches for Schema Elements

DASFAA '03 Proceedings of the Eighth International Conference on Database Systems for Advanced Applications
Rondo: a programming platform for generic model management

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
On schema matching with opaque column names and data values

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
ebXML: Status, Research Issues, and Obstacles

RIDE '02 Proceedings of the 12th International Workshop on Research Issues in Data Engineering: Engineering E-Commerce/E-Business Systems (RIDE'02)
EXSMAL: EDI/XML Semi-Automatic Schema Matching ALgorithm

CEC '05 Proceedings of the Seventh IEEE International Conference on E-Commerce Technology
Information Integration with Ontologies: Experiences from an Industrial Showcase

Information Integration with Ontologies: Experiences from an Industrial Showcase
COMA: a system for flexible combination of schema matching approaches

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

Extension of Schema Matching Platform ASMADE to Constraints and Mapping Expression

Advanced Internet Based Systems and Applications
NDT-merge: a future tool for conciliating software requirements in MDE environments

Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the problems in the actual electronic commerce is laid on the data heterogeneity (i.e. format and vocabulary). This representation incompatibility, particularly in the EDI (Electronic Data Interchange), is managed manually with help from a human expert consulting the usage guideline of each message to translate. This manual work is tedious, error-prone and expensive. The goal of this work is to partially automate the semantic correspondence discovery between the EDI messages of various standards by using XML Schema as the pivot format. This semi-automatic schema matching algorithm take two schemata of EDI messages as the input, compute the basic similarity between each pair of elements by comparing their textual description and data type. Then, it computes the structural similarity value basing on the structural neighbors of each element (ancestor, sibling, immediate children and leaf elements) with an aggregation function. The basic similarity and structural similarity values are used in the pair wise element similarity computing which is the final similarity value between two elements. The paper shows as well some implementation issues and a scenario of test for EX-SMAL with messages coming from EDIFACT and SWIFT standards.