Introduction to algorithms
Data on the Web: from relations to semistructured data and XML
Data on the Web: from relations to semistructured data and XML
XMill: an efficient compressor for XML data
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
NiagaraCQ: a scalable continuous query system for Internet databases
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Efficient string matching: an aid to bibliographic search
Communications of the ACM
Monitoring XML data on the Web
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Mesh-based content routing using XML
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Handbook of Formal Languages
Optimizing Regular Path Expressions Using Graph Schemas
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Adding Structure to Unstructured Data
ICDT '97 Proceedings of the 6th International Conference on Database Theory
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Efficient Filtering of XML Documents for Selective Dissemination of Information
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Everything You Ever Wanted to Know About DTDs, But Were Afraid to Ask (Extended Abstract)
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
An XML query engine for network-bound data
The VLDB Journal — The International Journal on Very Large Data Bases
YFilter: Efficient and Scalable Filtering of XML Documents
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Efficient Filtering of XML Documents with XPath Expressions
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
The view selection problem for XML content based routing
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Techniques for the evaluation of XML queries: a survey
Data & Knowledge Engineering
Stream processing of XPath queries with predicates
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
XPath queries on streaming data
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Light-weight xPath processing of XML stream with deterministic automata
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Path sharing and predicate evaluation for high-performance XML filtering
ACM Transactions on Database Systems (TODS)
From searching text to querying XML streams
Journal of Discrete Algorithms - SPIRE 2002
Implementing a scalable XML publish/subscribe system using relational database systems
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Filtering algorithms for information retrieval models with named attributes and proximity operators
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Optimizing the lazy DFA approach for XML stream processing
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
EXPedite: a system for encoded XML processing
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Processing XML streams with deterministic automata and stream indexes
ACM Transactions on Database Systems (TODS)
XML stream transformer generation through program composition and dependency analysis
Science of Computer Programming
Cache-Conscious Automata for XML Filtering
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Bloom Filter-Based XML Packets Filtering for Millions of Path Queries
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Distributed XML Stream Filtering System with High Scalability
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Frontiers of tractability for typechecking simple XML transformations
PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On the memory requirements of XPath evaluation over XML streams
PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
The VLDB Journal — The International Journal on Very Large Data Bases
Incremental maintenance for materialized XPath/XSLT views
WWW '05 Proceedings of the 14th international conference on World Wide Web
Buffering in query evaluation over XML streams
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient algorithms for processing XPath queries
ACM Transactions on Database Systems (TODS)
ACM Transactions on Database Systems (TODS)
Tree-pattern queries on a lightweight XML processor
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Semantic query optimization for XQuery over XML streams
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Processing queries on tree-structured data efficiently
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Filtering XML documents using XPath expressions and aspect-oriented programming
Proceedings of the 2006 ACM symposium on Document engineering
XML access control using static analysis
ACM Transactions on Information and System Security (TISSEC)
Journal of Computer and System Sciences
Frontiers of tractability for typechecking simple XML transformations
Journal of Computer and System Sciences
On the memory requirements of XPath evaluation over XML streams
Journal of Computer and System Sciences
Automaton meets algebra: a hybrid paradigm for XML stream processing
Data & Knowledge Engineering - Special issue: ER 2003
Visibly pushdown automata for streaming XML
Proceedings of the 16th international conference on World Wide Web
Tight lower bounds for query processing on streaming and external memory data
Theoretical Computer Science
Efficient xml data dissemination with piggybacking
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Query-based monitoring of BPEL business processes
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Attribute grammars for scalable query processing on XML streams
The VLDB Journal — The International Journal on Very Large Data Bases
Reasoning about XML update constraints
Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
SPEX: Streamed and Progressive Evaluation of XPath
IEEE Transactions on Knowledge and Data Engineering
Optimizing XPath queries on streaming XML data
ADC '07 Proceedings of the eighteenth conference on Australasian database - Volume 63
RRXS: redundancy reducing XML storage in relations
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Query processing for high-volume XML message brokering
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Towards an internet-scale XML dissemination service
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
FluXQuery: an optimizing XQuery processor for streaming XML data
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Online evaluation of regular tree queries
Nordic Journal of Computing
Monitoring business processes with queries
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Early profile pruning on XML-aware publish-subscribe systems
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Bulk data in main memory-based XQuery evaluation
XIME-P '07 Proceedings of the 4th international workshop on XQuery implementation, experience and perspectives
An efficient mechanism for matching multiple patterns with streamed XML data
SE'07 Proceedings of the 25th conference on IASTED International Multi-Conference: Software Engineering
BP-Mon: query-based monitoring of BPEL business processes
ACM SIGMOD Record
Stream firewalling of xml constraints
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Automaton in or out: run-time plan optimization for XML stream processing
SSPS '08 Proceedings of the 2nd international workshop on Scalable stream processing system
Runtime-Efficient Approach for Multiple Continuous Filtering in XML Message Brokers
SOFSEM '07 Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science
ACM Computing Surveys (CSUR)
Information Systems
Information filtering and query indexing for an information retrieval model
ACM Transactions on Information Systems (TOIS)
Efficient maintenance techniques for views over active documents
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Adding nesting structure to words
Journal of the ACM (JACM)
IEICE - Transactions on Information and Systems
Selectivity-sensitive shared evaluation of multiple continuous XPath queries over XML streams
Information Sciences: an International Journal
Fast XML document filtering by sequencing twig patterns
ACM Transactions on Internet Technology (TOIT)
Knowledge and Information Systems
An XML publish/subscribe algorithm implemented by relational operators
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Posfilter: an efficient filtering technique of XML documents based on postfix sharing
BNCOD'07 Proceedings of the 24th British national conference on Databases
Efficient and expressive tree filters
FSTTCS'07 Proceedings of the 27th international conference on Foundations of software technology and theoretical computer science
A development of hash-lookup trees to support querying streaming XML
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Selective regular expression matching
ISC'10 Proceedings of the 13th international conference on Information security
GPX-matcher: a generic boolean predicate-based XPath expression matcher
Proceedings of the 14th International Conference on Extending Database Technology
Efficient computation of queries on feature streams
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Database query processing using finite cursor machines
ICDT'07 Proceedings of the 11th international conference on Database Theory
Tight lower bounds for query processing on streaming and external memory data
ICALP'05 Proceedings of the 32nd international conference on Automata, Languages and Programming
Relaxing result accuracy for performance in publish/subscribe systems
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
An effective XML filtering method for high-performance publish/subscribe system
APWeb'06 Proceedings of the 2006 international conference on Advanced Web and Network Technologies, and Applications
Towards expressive publish/subscribe systems
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
An extensible light-weight XML-Based monitoring system for sequence databases
DILS'06 Proceedings of the Third international conference on Data Integration in the Life Sciences
Network layer XML routing using lazy DFA
SERA'04 Proceedings of the Second international conference on Software Engineering Research, Management and Applications
Foundations of XML based on logic and automata: a snapshot
FoIKS'12 Proceedings of the 7th international conference on Foundations of Information and Knowledge Systems
XML filtering with XPath expressions containing parent and ancestor axes
Information Sciences: an International Journal
RW'07 Proceedings of the Third international summer school conference on Reasoning Web
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
JetXSLT: a resource-conscious XSLT processor
ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
Scalable XML query processing using parallel pushdown transducers
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
We consider the problem of evaluating a large number of XPath expressions on an XML stream. Our main contribution consists in showing that Deterministic Finite Automata (DFA) can be used effectively for this problem: in our experiments we achieve a throughput of about 5.4MB/s, independent of the number of XPath expressions (up to 1,000,000 in our tests). The major problem we face is that of the size of the DFA. Since the number of states grows exponentially with the number of XPath expressions, it was previously believed that DFAs cannot be used to process large sets of expressions. We make a theoretical analysis of the number of states in the DFA resulting from XPath expressions, and consider both the case when it is constructed eagerly, and when it is constructed lazily. Our analysis indicates that, when the automaton is constructed lazily, and under certain assumptions about the structure of the input XML data, the number of states in the lazy DFA is manageable. We also validate experimentally our findings, on both synthetic and real XML data sets.