Index structures for selective dissemination of information under the Boolean model
ACM Transactions on Database Systems (TODS)
The design of a high performance information filtering system
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Document filtering with inference networks
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
On-line new event detection and tracking
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Influence sets based on reverse nearest neighbor queries
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Models and issues in data stream systems
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Combining fuzzy information: an overview
ACM SIGMOD Record
Index Structures for Information Filtering Under the Vector Space Model
Proceedings of the Tenth International Conference on Data Engineering
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
High dimensional reverse nearest neighbor queries
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
On the Bursty Evolution of Blogspace
World Wide Web
Continuous monitoring of top-k queries over sliding windows
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Data streams: algorithms and applications
Foundations and Trends® in Theoretical Computer Science
Analyzing feature trajectories for event detection
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Approximate NN queries on streams with guaranteed error/performance bounds
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Ad-hoc top-k query answering for data streams
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Sliding-window top-k queries on uncertain streams
Proceedings of the VLDB Endowment
Information filtering and query indexing for an information retrieval model
ACM Transactions on Information Systems (TOIS)
Mining common topics from multiple asynchronous text streams
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Efficient identification of starters and followers in social media
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
An Incremental Threshold Method for Continuous Text Search Queries
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Evaluating top-k queries over incomplete data streams
Proceedings of the 18th ACM conference on Information and knowledge management
Trend detection in folksonomies
SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
Complex pattern ranking (CPR): evaluating top-k pattern queries over event streams
Proceedings of the 5th ACM international conference on Distributed event-based system
Characterizing web syndication behavior and content
WISE'11 Proceedings of the 12th international conference on Web information system engineering
Efficient monitoring of personalized hot news over Web 2.0 streams
Computer Science - Research and Development
Distributed top-k full-text content dissemination
Distributed and Parallel Databases
Processing continuous text queries featuring non-homogeneous scoring functions
Proceedings of the 21st ACM international conference on Information and knowledge management
A thin monitoring layer for top-k aggregation queries over a database
Proceedings of the 7th International Workshop on Ranking in Databases
Top-k publish-subscribe for social annotation of news
Proceedings of the VLDB Endowment
Evaluating continuous top-k queries over document streams
World Wide Web
Hi-index | 0.00 |
Web 2.0 portals have made content generation easier than ever with millions of users contributing news stories in form of posts in weblogs or short textual snippets as in Twitter. Efficient and effective filtering solutions are key to allow users stay tuned to this ever-growing ocean of information, releasing only relevant trickles of personal interest. In classical information filtering systems, user interests are formulated using standard IR techniques and data from all available information sources is filtered based on a predefined absolute quality-based threshold. In contrast to this restrictive approach which may still overwhelm the user with the returned stream of data, we envision a system which continuously keeps the user updated with only the top-k relevant new information. Freshness of data is guaranteed by considering it valid for a particular time interval, controlled by a sliding window. Considering relevance as relative to the existing pool of new information creates a highly dynamic setting. We present POL-filter which together with our maintenance module constitute an efficient solution to this kind of problem. We show by comprehensive performance evaluations using real world data, obtained from a weblog crawl, that our approach brings performance gains compared to state-of-the-art.