AFilter: adaptable XML filtering with prefix-caching suffix-clustering

Authors:
K. Selçuk Candan;Wang-Pin Hsiung;Songting Chen;Junichi Tatemura;Divyakant Agrawal
Affiliations:
NEC Laboratories America, Cupertino, CA;NEC Laboratories America, Cupertino, CA;NEC Laboratories America, Cupertino, CA;NEC Laboratories America, Cupertino, CA;NEC Laboratories America, Cupertino, CA
Venue:
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Year:
2006

Citing 20
Cited 26

NiagaraCQ: a scalable continuous query system for Internet databases

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Holistic twig joins: optimal XML pattern matching

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
ToXgene: a template-based data generator for XML

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Efficient Filtering of XML Documents for Selective Dissemination of Information

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Efficient filtering of XML documents with XPath expressions

The VLDB Journal — The International Journal on Very Large Data Bases
An XML query engine for network-bound data

The VLDB Journal — The International Journal on Very Large Data Bases
Stream processing of XPath queries with predicates

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
XPath queries on streaming data

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Path sharing and predicate evaluation for high-performance XML filtering

ACM Transactions on Database Systems (TODS)
Implementing a scalable XML publish/subscribe system using relational database systems

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
The BEA streaming XQuery processor

The VLDB Journal — The International Journal on Very Large Data Bases
Processing XML streams with deterministic automata and stream indexes

ACM Transactions on Database Systems (TODS)
On the memory requirements of XPath evaluation over XML streams

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Querying XML streams

The VLDB Journal — The International Journal on Very Large Data Bases
Buffering in query evaluation over XML streams

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
From region encoding to extended dewey: on efficient processing of XML twig pattern matching

VLDB '05 Proceedings of the 31st international conference on Very large data bases
FiST: scalable XML document filtering by sequencing twig patterns

VLDB '05 Proceedings of the 31st international conference on Very large data bases
An Efficient XPath Query Processor for XML Streams

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Query processing for high-volume XML message brokering

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Schema-based scheduling of event processors and buffer minimization for queries on structured data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30

Sum-max monotonic ranked joins for evaluating top-k twig queries on weighted data graphs

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
XFIS: an XML filtering system based on string representation and matching

International Journal of Web Engineering and Technology
Efficient processing of branch queries for high-performance XML filtering

Proceedings of the 2nd international conference on Scalable information systems
Distributed XML processing: Theory and applications

Journal of Parallel and Distributed Computing
Value-based predicate filtering of XML documents

Data & Knowledge Engineering
XML Filtering Using Dynamic Hierarchical Clustering of User Profiles

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
FMware: middleware for efficient filtering and matching of XML messages with local data

Proceedings of the ACM/IFIP/USENIX 2006 International Conference on Middleware
An Efficient Bottom-up Filtering of XML Messages by Exploiting the Postfix Commonality of XPath Queries

IEICE - Transactions on Information and Systems
Fast XML document filtering by sequencing twig patterns

ACM Transactions on Internet Technology (TOIT)
Ordered Backward XPath Axis Processing against XML Streams

XSym '09 Proceedings of the 6th International XML Database Symposium on Database and XML Technologies
Processing XPath queries with forward and downward axes over XML streams

Proceedings of the 13th International Conference on Extending Database Technology
Posfilter: an efficient filtering technique of XML documents based on postfix sharing

BNCOD'07 Proceedings of the 24th British national conference on Databases
Evaluating xpath queries on XML data streams

BNCOD'07 Proceedings of the 24th British national conference on Databases
XML filtering system based on ontology

Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India
Efficient evaluation of generalized tree-pattern queries on XML streams

The VLDB Journal — The International Journal on Very Large Data Bases
GPX-matcher: a generic boolean predicate-based XPath expression matcher

Proceedings of the 14th International Conference on Extending Database Technology
E-Cube: multi-dimensional event sequence analysis using hierarchical pattern query sharing

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
High-performance composite event monitoring system supporting large numbers of queries and sources

Proceedings of the 5th ACM international conference on Distributed event-based system
Mixing bottom-up and top-down XPath query evaluation

ADBIS'11 Proceedings of the 15th international conference on Advances in databases and information systems
FMware: middleware for efficient filtering and matching of XML messages with local data

Middleware'06 Proceedings of the 7th ACM/IFIP/USENIX international conference on Middleware
An automaton-based index scheme for on-demand XML data broadcast

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part II
Energy and Latency Efficient Access of Wireless XML Stream

Journal of Database Management
A survey on XML streaming evaluation techniques

The VLDB Journal — The International Journal on Very Large Data Bases
Optimized XPath evaluation for schema-compressed XML data

ADC '12 Proceedings of the Twenty-Third Australasian Database Conference - Volume 124
Analysis and optimization for boolean expression indexing

ACM Transactions on Database Systems (TODS)
A study on parallelizing XML path filtering using accelerators

ACM Transactions on Embedded Computing Systems (TECS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

XML message filtering problem involves searching for instances of a given, potentially large, set of patterns in a continuous stream of XML messages. Since the messages arrive continuously, it is essential that the filtering rate matches the data arrival rate. Therefore, the given set of filter patterns needs to be indexed appropriately to enable real-time processing of the streaming XML data. In this paper, we propose AFilter, an adaptable, and thus scalable, path expression filtering approach. AFilter has a base memory requirement linear in filter expression and data size. Furthermore, when additional memory is available, AFilter can exploit prefix commonalities in the set of filter expressions using a loosely-coupled prefix caching mechanism as opposed to tightly-coupled active state representation of alternative approaches. Unlike existing systems, AFilter can also exploit suffix-commonalities across filter expressions, while simultaneously leveraging the prefix-commonalities through the cache. Finally, AFilter uses a triggering mechanism to prevent excessive consumption of resources by delaying processing until a trigger condition is observed. Experiment results show that AFilter provides significantly better scalability and runtime performance when compared to state of the art filtering systems.