Value-based predicate filtering of XML documents

Authors:
Joonho Kwon;Praveen Rao;Bongki Moon;Sukho Lee
Affiliations:
School of Electrical Engineering and Computer Science, Seoul National University, Seoul 151-742, Republic of Korea;Department of Computer Science and Electrical Engineering, and University of Missouri-Kansas City, Kansas City, MO 64110, USA;Department of Computer Science, University of Arizona, Tucson, AZ 85721, USA;School of Electrical Engineering and Computer Science, Seoul National University, Seoul 151-742, Republic of Korea
Venue:
Data & Knowledge Engineering
Year:
2008

Citing 26
Cited 6

Holistic twig joins: optimal XML pattern matching

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Efficient Filtering of XML Documents for Selective Dissemination of Information

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Stream processing of XPath queries with predicates

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
XPath queries on streaming data

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Efficient Filtering of XML Documents with XPath Expressions

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Path sharing and predicate evaluation for high-performance XML filtering

ACM Transactions on Database Systems (TODS)
PRIX: Indexing And Querying XML Using Prüfer Sequences

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Replacement strategies for XQuery caching systems

Data & Knowledge Engineering - Special issue: WIDM 2002
Efficient processing of XML twig queries with OR-predicates

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
FleXPath: flexible structure and full-text querying for XML

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Implementing a scalable XML publish/subscribe system using relational database systems

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
From region encoding to extended dewey: on efficient processing of XML twig pattern matching

VLDB '05 Proceedings of the 31st international conference on Very large data bases
FiST: scalable XML document filtering by sequencing twig patterns

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Predicate-based Filtering of XPath Expressions

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Sequencing XML data and query twigs for fast pattern matching

ACM Transactions on Database Systems (TODS)
Twig2Stack: bottom-up processing of generalized-tree-pattern queries over XML documents

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
AFilter: adaptable XML filtering with prefix-caching suffix-clustering

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Efficient algorithms for evaluating xpath over streams

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Efficient xml data dissemination with piggybacking

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Boosting topic-based publish-subscribe systems with dynamic clustering

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Massively multi-query join processing in publish/subscribe systems

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
A transducer-based XML query processor

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
A framework for using materialized XPath views in XML query processing

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Early profile pruning on XML-aware publish-subscribe systems

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Value-based notification conditions in large-scale publish/subscribe systems?

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
XML Prefiltering as a String Matching Problem

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering

Improving XML schema matching performance using Prüfer sequences

Data & Knowledge Engineering
Fast XML document filtering by sequencing twig patterns

ACM Transactions on Internet Technology (TOIT)
Distributed structural and value XML filtering

Proceedings of the Fourth ACM International Conference on Distributed Event-Based Systems
GPX-matcher: a generic boolean predicate-based XPath expression matcher

Proceedings of the 14th International Conference on Extending Database Technology
A syntactic approach to twig-query matching on XML streams

Journal of Systems and Software
An efficient algorithm of frequent XML query pattern mining for ebXML applications in e-commerce

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years, publish-subscribe systems based on XML filtering have received much attention in ubiquitous computing environments and Internet applications. The main challenge is to process a large number of content against millions of user subscriptions. Several XML filtering systems focus on the efficient processing of structural matching of user subscriptions represented as XPath twig patterns. However, existing techniques provide limited or no support for twig patterns that contain various operators in the value-based predicates. In this paper, we present the pFiST system that filters XML documents by transforming twig patterns into sequences based on Prufer's method. This sequencing idea for XML filtering was first demonstrated by FiST [J. Kwon, P. Rao, B. Moon, S. Lee, FiST: scalable XML document filtering by sequencing twig patterns, in: Proceedings of the 31st VLDB Conference, Trondheim, Norway, 2005, pp. 217-228]. The focus of pFiST is to support value-based predicates in twig patterns in addition to matching their structure. The pFiST system supports equality and non-equality operators, and in addition can handle logical operators such as AND and OR in the value-based predicates. Extensive experimental results show that pFiST provides good performance over data sets with different characteristics.