Retrieval of semistructured Web data

  • Authors:
  • Elisa Bertino;Mohand-Saïd Hacid;Farouk Toumani

  • Affiliations:
  • Dipartimento di Scienze dell'Informazione, University of Milano, Italy;Computer Science Department, University Claude Bernard Lyon 1, France;Laboratoire LIMOS, ISIMA, Clermont-Ferrand, France

  • Venue:
  • Intelligent exploration of the web
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The ability to manage data whose structure is less rigid and strict than in conventional databases is important in many new application areas, such as biological databases, digital libraries, data integration and Web databases. Such data is called semistructured, since it cannot be constrained by a fixed predefined schema: the information that is normally associated with a schema is contained within the data, which is sometimes called self-describing. Such data has recently emerged as a particularly interesting research topic in which new data modelling and querying techniques are investigated.In this paper, we consider how constraint-based technology can be used to query and reason about semistructured data. The constraint system FT≤ [37] provides information ordering constraints interpreted over feature trees. Here, we show how a generalization of FT≤ combined with path constraints allows one to formally represent, state constraints, and reason about semistructured data. The constraint languages we propose provide possibilities to straightforwardly capture, for example, what it means for a tree to be a subtree or subsumed by another, or what it means for two paths to be divergent. We establish a logical semantics for our constraints thanks to axiom schemes presenting our first-order theory constraint system. We propose using the constraint systems for querying semistructured Web data.