Structural proximity searching for large collections of semi-structured data

Authors:
Michael Barg;Raymond K. Wong
Affiliations:
University of New South Wales, Sydney, Australia;University of New South Wales, Sydney, Australia
Venue:
Proceedings of the tenth international conference on Information and knowledge management
Year:
2001

Citing 6
Cited 13

Lore: a database management system for semistructured data

ACM SIGMOD Record
Proximal nodes: a model to query document databases by content and structure

ACM Transactions on Information Systems (TOIS)
Integrating keyword search into XML query processing

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Querying Semi-Structured Data

ICDT '97 Proceedings of the 6th International Conference on Database Theory
Proximity Search in Databases

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Answering XML Queries on Heterogeneous Data Sources

Proceedings of the 27th International Conference on Very Large Data Bases

Cooperative query answering for semistructured data

ADC '03 Proceedings of the 14th Australasian database conference - Volume 17
Interconnection semantics for keyword search in XML

Proceedings of the 14th ACM international conference on Information and knowledge management
Identifying meaningful return information for XML keyword search

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
XSEarch: a semantic search engine for XML

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Query biased snippet generation in XML search

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Reasoning and identifying relevant matches for XML keyword search

Proceedings of the VLDB Endowment
Efficient Algorithms for Skyline Top-K Keyword Queries on XML Streams

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Return specification inference and result clustering for keyword search on XML

ACM Transactions on Database Systems (TODS)
An efficient path index for querying semi-structured data

APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
A novel XML keyword query approach using entity subtree

Journal of Systems and Software
Improving XML search by generating and utilizing informative result snippets

ACM Transactions on Database Systems (TODS)
Differentiating search results on structured data

ACM Transactions on Database Systems (TODS)
Semantic relevance ranking for XML keyword search

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

The richness of the XML data format allows data to be structured in a way which precisely captures the semantics required by the author. It is the structure of the data, however, which forms the basis of all XML query languages. Without at least some notion of the structure, a user cannot meaningfully query the data. This problem is compounded when one considers that heterogeneous data adhering to different schema are likely to exist in the database(s) being queried. This paper proposes a solution based on an efficient proximity index. In particular, we describe a family of encoding and compression schemes which enable us to build an index to efficiently implement the proximity search. Our index is extremely small, and can reflect updates in the underlying database in modest time. Experiments show that our algorithm and implementation are fast and scale well.