CoBase: a scalable and extensible cooperative information system
Journal of Intelligent Information Systems - Special issue on intelligent integration of information
Lore: a database management system for semistructured data
ACM SIGMOD Record
PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Structural proximity searching for large collections of semi-structured data
Proceedings of the tenth international conference on Information and knowledge management
Querying Semistructured Heterogeneous Information
DOOD '95 Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases
ICDT '97 Proceedings of the 6th International Conference on Database Theory
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Answering XML Queries on Heterogeneous Data Sources
Proceedings of the 27th International Conference on Very Large Data Bases
Hi-index | 0.00 |
Semistructured data, in particular XML, has emerged as one of the primary means for information exchange and content management. The power of XML allows authors to structure a document in a way which precisely captures the semantics of the data. This, however, poses a substantial barrier to casual and non-expert users who wish to query such data, as it is the structure of the data which forms the basis of all XML query languages. Without an accurate understanding of how the data is structured, users are unable to issue meaningful queries. This problem is compounded when one realises that data adhering to different schema are likely to be contained within the same data warehouse or federated database. This paper proposes a method which enables users to meaningfully query semistructured data with no prior knowledge of its structure. We describe a mechanism for returning approximate answers to a database query when the structure of the underlying data is unknown. Our mechanism also returns useful results to the user if a specific value in the query cannot be matched. We discuss a number of novel query processing and optimisation techniques which enable us to perform our cooperative query answering in an efficient and scalable manner.