Cooperative query answering for semistructured data

Authors:
Michael Barg;Raymond K. Wong
Affiliations:
School of Computer Science & Engineering, University of New South Wales, Sydney, NSW 2052, Australia;School of Computer Science & Engineering, University of New South Wales, Sydney, NSW 2052, Australia
Venue:
ADC '03 Proceedings of the 14th Australasian database conference - Volume 17
Year:
2003

Citing 8
Cited 0

CoBase: a scalable and extensible cooperative information system

Journal of Intelligent Information Systems - Special issue on intelligent integration of information
Lore: a database management system for semistructured data

ACM SIGMOD Record
Semistructured data

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Structural proximity searching for large collections of semi-structured data

Proceedings of the tenth international conference on Information and knowledge management
Querying Semistructured Heterogeneous Information

DOOD '95 Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases
Querying Semi-Structured Data

ICDT '97 Proceedings of the 6th International Conference on Database Theory
Proximity Search in Databases

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Answering XML Queries on Heterogeneous Data Sources

Proceedings of the 27th International Conference on Very Large Data Bases

Quantified Score

Hi-index	0.00

Visualization

Abstract

Semistructured data, in particular XML, has emerged as one of the primary means for information exchange and content management. The power of XML allows authors to structure a document in a way which precisely captures the semantics of the data. This, however, poses a substantial barrier to casual and non-expert users who wish to query such data, as it is the structure of the data which forms the basis of all XML query languages. Without an accurate understanding of how the data is structured, users are unable to issue meaningful queries. This problem is compounded when one realises that data adhering to different schema are likely to be contained within the same data warehouse or federated database. This paper proposes a method which enables users to meaningfully query semistructured data with no prior knowledge of its structure. We describe a mechanism for returning approximate answers to a database query when the structure of the underlying data is unknown. Our mechanism also returns useful results to the user if a specific value in the query cannot be matched. We discuss a number of novel query processing and optimisation techniques which enable us to perform our cooperative query answering in an efficient and scalable manner.