Graph transformation to infer schemata from XML documents

Authors:
Luciano Baresi;Elisa Quintarelli
Affiliations:
Politecnico di Milano, Milano, Italy;Politecnico di Milano, Milano, Italy
Venue:
Proceedings of the 2005 ACM symposium on Applied computing
Year:
2005

Citing 9
Cited 1

Extracting schema from semistructured data

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
DTD inference for views of XML data

PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
XTRACT: Learning Document Type Descriptors from XML Document Collections

Data Mining and Knowledge Discovery
Optimizing Regular Path Expressions Using Graph Schemas

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Querying Semi-Structured Data

ICDT '97 Proceedings of the 6th International Conference on Database Theory
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
RoadRunner: Towards Automatic Data Extraction from Large Web Sites

Proceedings of the 27th International Conference on Very Large Data Bases
The XML web: a first study

WWW '03 Proceedings of the 12th international conference on World Wide Web
A Graphical Environment to Query XML Data with XQuery

WISE '03 Proceedings of the Fourth International Conference on Web Information Systems Engineering

A graph-based approach to transform XML documents

FASE'06 Proceedings of the 9th international conference on Fundamental Approaches to Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Semi-structured data are characterized by the lack of a predefined schema. This heterogeneity simplifies the management of such data, but analysis and queries become more difficult and demand for schemata that describe these data. Super-imposed structures cannot be as general as predefined ones, but ease the retrieval of the information embedded in such data. The paper adopts XML as the language to render semi-structured data and proposes an approach - based on graph transformation techniques - to infer the schemata of XML documents.