Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Synchronizing a database to improve freshness
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Managing periodically updated data in relational databases: a stochastic modeling approach
Journal of the ACM (JACM)
Novelty and redundancy detection in adaptive filtering
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic query wefinement using lexical affinities with maximal information gain
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Estimating frequency of change
ACM Transactions on Internet Technology (TOIT)
A System for new event detection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Adaptive Coherency Maintenance Techniques for Time-Varying Data
RTSS '03 Proceedings of the 24th IEEE International Real-Time Systems Symposium
Effective page refresh policies for Web crawlers
ACM Transactions on Database Systems (TODS)
What's new on the web?: the evolution of the web from a search engine perspective
Proceedings of the 13th international conference on World Wide Web
Newsjunkie: providing personalized newsfeeds via analysis of information novelty
Proceedings of the 13th international conference on World Wide Web
Web-CAM: monitoring the dynamic Web to respond to continual queries
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
WWW '05 Proceedings of the 14th international conference on World Wide Web
NewsInEssence: summarizing online news topics
Communications of the ACM - The digital society
Adaptive pull-based policies for wide area data delivery
ACM Transactions on Database Systems (TODS)
Load shedding in stream databases: a control-based approach
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Resource-adaptive real-time new event detection
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Cayuga: a high-performance event processing engine
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Client behavior and feed characteristics of RSS, a publish-subscribe system for web micronews
IMC '05 Proceedings of the 5th ACM SIGCOMM conference on Internet Measurement
Efficient Monitoring Algorithm for Fast News Alerts
IEEE Transactions on Knowledge and Data Engineering
Tracking multiple topics for finding interesting articles
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
WIC: a general-purpose algorithm for monitoring web information sources
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Monitoring an Information Source Under a Politeness Constraint
INFORMS Journal on Computing
Satisfying Complex Data Needs using Pull-Based Online Monitoring of Volatile Data Sources
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Capturing Approximated Data Delivery Tradeoffs
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Reinforcement learning: a survey
Journal of Artificial Intelligence Research
Extracting user profiles from large scale data
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud
Increasing patient safety using explanation-driven personalized content recommendation
Proceedings of the 1st ACM International Health Informatics Symposium
Characterizing web syndication behavior and content
WISE'11 Proceedings of the 12th international conference on Web information system engineering
On the Relationship between Novelty and Popularity of User-Generated Content
ACM Transactions on Intelligent Systems and Technology (TIST)
Distributed top-k full-text content dissemination
Distributed and Parallel Databases
Hi-index | 0.00 |
This work addresses a novel problem of maintaining channel proflies on the Web. Such channel maintenance is essential for next generation of Web 2.0 applications that provide sophisticated search and discovery services over Web information channels. Maintaining a fresh channel profile is extremely difficult due to the the dynamic nature of the channel, especially under the constraint of a limited monitoring budget. We propose a novel monitoring scheme that learns the channels' monitoring rates. The monitoring scheme is further extended to consider the content that is published on the channels. We describe a novelty detection filter that refines the monitoring rate according to the expected rate of novel content published on the channels. We further show how inter-channel profile similarities can be utilized to refine the channel monitoring rates. Using real-world data of Web feeds we study the performance of the monitoring scheme. We experiment with several monitoring policies over a large set of Web feeds and show that a policy based on learning the monitoring rate of the channels, combined with novelty detection, outperforms alternative channel monitoring policies. Our results show that the suggested content-based policy is able to maintain high quality channel profiles under limited monitoring resources.