XML with incomplete information

  • Authors:
  • Pablo Barceló;Leonid Libkin;Antonella Poggi;Cristina Sirangelo

  • Affiliations:
  • University of Chile, Santiago, Chile;University of Edinburgh, Edinburgh, UK;Sapienza Università di Roma, Rome, Italy;ENS-Cachan and INRIA, Cedex, France

  • Venue:
  • Journal of the ACM (JACM)
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study models of incomplete information for XML, their computational properties, and query answering. While our approach is motivated by the study of relational incompleteness, incomplete information in XML documents may appear not only as null values but also as missing structural information. Our goal is to provide a classification of incomplete descriptions of XML documents, and separate features—or groups of features—that lead to hard computational problems from those that admit efficient algorithms. Our classification of incomplete information is based on the combination of null values with partial structural descriptions of documents. The key computational problems we consider are consistency of partial descriptions, representability of complete documents by incomplete ones, and query answering. We show how factors such as schema information, the presence of node ids, and missing structural information affect the complexity of these main computational problems, and find robust classes of incomplete XML descriptions that permit tractable query evaluation.