Index structures for selective dissemination of information under the Boolean model
ACM Transactions on Database Systems (TODS)
Matching events in a content-based subscription system
Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing
The SIFT information dissemination system
ACM Transactions on Database Systems (TODS)
An Efficient Method for Generating Discrete Random Variables with General Distributions
ACM Transactions on Mathematical Software (TOMS)
Filtering algorithms and implementation for very fast publish/subscribe systems
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Publish/Subscribe on the Web at Extreme Speed
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Forwarding in a content-based network
Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications
Hourly analysis of a very large topically categorized web query log
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Analysis and Algorithms for Content-Based Event Matching
ICDCSW '05 Proceedings of the Fourth International Workshop on Distributed Event-Based Systems (DEBS) (ICDCSW'05) - Volume 04
Inverted files for text search engines
ACM Computing Surveys (CSUR)
A trie-based APRIORI implementation for mining frequent item sequences
Proceedings of the 1st international workshop on open source data mining: frequent pattern mining implementations
Efficient query subscription processing for prospective search engines
ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
Optimizing Frequency Queries for Data Mining Applications
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Scalable ranked publish/subscribe
Proceedings of the VLDB Endowment
A Data Structure for Sponsored Search
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Proceedings of the VLDB Endowment
Characterizing web syndication behavior and content
WISE'11 Proceedings of the 12th international conference on Web information system engineering
Efficient filtering in micro-blogging systems: we won't get flooded again
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Hi-index | 0.00 |
The explosion of published information on the Web leads to the emergence of a Web syndication paradigm, which transforms the passive reader into an active information collector. Information consumers subscribe to RSS/Atom feeds and are notified whenever a piece of news (item) is published. The success of this Web syndication now offered on Web sites, blogs, and social media, however raises scalability issues. There is a vital need for efficient real-time filtering methods across feeds, to allow users to follow effectively personally interesting information. We investigate in this paper three indexing techniques for users' subscriptions based on inverted lists or on an ordered trie. We present analytical models for memory requirements and matching time and we conduct a thorough experimental evaluation to exhibit the impact of critical workload parameters on these structures.