Worst-case optimal algorithm for XPath evaluation over XML streams

Authors:
Prakash Ramanan
Affiliations:
EECS Department, Wichita State University, 1845 N Fairmount Ave, Wichita, KS 67260-0083, United States
Venue:
Journal of Computer and System Sciences
Year:
2009

Citing 21
Cited 7

Models and issues in data stream systems

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Holistic twig joins: optimal XML pattern matching

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Efficient Filtering of XML Documents for Selective Dissemination of Information

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Stream processing of XPath queries with predicates

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Efficient Filtering of XML Documents with XPath Expressions

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Characterizing memory requirements for queries over continuous data streams

ACM Transactions on Database Systems (TODS)
Processing XML streams with deterministic automata and stream indexes

ACM Transactions on Database Systems (TODS)
Querying XML streams

The VLDB Journal — The International Journal on Very Large Data Bases
The complexity of XPath query evaluation and XML typing

Journal of the ACM (JACM)
Buffering in query evaluation over XML streams

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient algorithms for processing XPath queries

ACM Transactions on Database Systems (TODS)
XSQ: A streaming XPath engine

ACM Transactions on Database Systems (TODS)
An Efficient XPath Query Processor for XML Streams

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
On the memory requirements of XPath evaluation over XML streams

Journal of Computer and System Sciences
Efficient algorithms for evaluating xpath over streams

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Holistic Join for Generalized Tree Patterns

Information Systems
A transducer-based XML query processor

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Covering indexes for XML queries: bisimulation - simulation = negation

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
The BEA/XQRL streaming XQuery processor

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Schema-based scheduling of event processors and buffer minimization for queries on structured data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Worst-case optimal algorithm for XPath evaluation over XML streams

Journal of Computer and System Sciences

Worst-case optimal algorithm for XPath evaluation over XML streams

Journal of Computer and System Sciences
Reducing redundancy of XPath query over networks by transmitting XML views

WAIM'10 Proceedings of the 2010 international conference on Web-age information management
Queries on Xml streams with bounded delay and concurrency

Information and Computation
Memory lower bounds for XPath evaluation over XML streams

Journal of Computer and System Sciences
Streamable fragments of forward XPath

CIAA'11 Proceedings of the 16th international conference on Implementation and application of automata
Eager XPath evaluation over XML streams

SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
A survey on XML streaming evaluation techniques

The VLDB Journal — The International Journal on Very Large Data Bases

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider the XPath evaluation problem: Evaluate an XPath query Q on a streaming XML document D; i.e., determine the set Q(D) of document elements selected by Q. We mainly consider Conjunctive XPath queries that involve only the child and descendant axes. Previously known in-memory algorithms for this problem use O(|D|) space and O(|Q||D|) time. Several previously known algorithms for the streaming version use @W(d^n) space and @W(d^n|D|) time in the worst case; d denotes the depth of D, and n denotes the number of location steps in Q. Their exponential space requirement could well exceed the O(|D|) space used by the in-memory algorithms. We present an efficient algorithm that uses O(d|Q|+nc) space and O((|Q|+dn)|D|) time in the worst case; c denotes the maximum number of elements of D that can be candidates for output, at any one instant. For some worst case Q and D, the memory space used by our algorithm matches our lower bound proved in a different paper; so, our algorithm uses optimal memory space in the worst case.