Optimal Semijoins for Distributed Database Systems
IEEE Transactions on Software Engineering
PERF join: an alternative to two-way semijoin and bloomjoin
CIKM '95 Proceedings of the fourth international conference on Information and knowledge management
Summary cache: a scalable wide-area Web cache sharing protocol
Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
Space/time trade-offs in hash coding with allowable errors
Communications of the ACM
Mesh-based content routing using XML
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Processing XML Streams with Deterministic Automata
ICDT '03 Proceedings of the 9th International Conference on Database Theory
Efficient Filtering of XML Documents for Selective Dissemination of Information
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
The view selection problem for XML content based routing
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Stream processing of XPath queries with predicates
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
XPath queries on streaming data
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
YFilter: Efficient and Scalable Filtering of XML Documents
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Efficient Filtering of XML Documents with XPath Expressions
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Path sharing and predicate evaluation for high-performance XML filtering
ACM Transactions on Database Systems (TODS)
Storing and retrieving XPath fragments in structured P2P networks
Data & Knowledge Engineering - Special issue: WIDM 2004
Efficient xml data dissemination with piggybacking
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Massively multi-query join processing in publish/subscribe systems
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Efficient processing of branch queries for high-performance XML filtering
Proceedings of the 2nd international conference on Scalable information systems
Xml data dissemination using automata on top of structured overlay networks
Proceedings of the 17th international conference on World Wide Web
Fast XML document filtering by sequencing twig patterns
ACM Transactions on Internet Technology (TOIT)
EC-Web'07 Proceedings of the 8th international conference on E-commerce and web technologies
An approach for XML similarity join using tree serialization
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Distributed structural and value XML filtering
Proceedings of the Fourth ACM International Conference on Distributed Event-Based Systems
Towards bipartite graph data management
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Distributed SLCA-based XML keyword search by map-reduce
DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
FVC: a feature-vector-based classification for XML dissemination
DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
A data mining approach to XML dissemination
WISE'10 Proceedings of the 11th international conference on Web information systems engineering
CLAP: Collaborative pattern mining for distributed information systems
Decision Support Systems
An efficient co-operative framework for multi-query processing over compressed XML data
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
XML filtering with XPath expressions containing parent and ancestor axes
Information Sciences: an International Journal
FoXtrot: Distributed structural and value XML filtering
ACM Transactions on the Web (TWEB)
XKFitler: A Keyword Filter on XML Stream
International Journal of Information Retrieval Research
A survey on XML streaming evaluation techniques
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
The filtering of XML data is the basis of many complex applications. Lots of algorithms have been proposed to solve this problem [2, 3, 5, 6, 7, 8, 9, 11, 12, 13, 18]. One important challenge is that the number of path queries is huge. It is necessary to take an efficient data structure representing path queries. Another challenge is that these path queries usually vary with time. The maintenance of path queries determines the flexibility and capacity of a filtering system. In this paper, we introduce a novel approximate method for XML data filtering, which uses Bloom filters representing path queries. In this method, millions of path queries can be stored efficiently. At the same time, it is easy to deal with the change of these path queries. To improve the filtering performance, we introduce a new data structure, Prefix Filters, to decrease the number of candidate paths. Experiments show that our Bloom filter-based method takes less time to build routing table than automaton-based method. And our method has a good performance with acceptable false positive when filtering XML packets of relatively small depth with millions of path queries.