Efficient crawling through URL ordering
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Similarity estimation techniques from rounding algorithms
STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Estimating frequency of change
ACM Transactions on Internet Technology (TOIT)
Hi-index | 0.00 |
We track a large set of "rapidly" changing web pages and examine the assumption that the arrival of content changes follows a Poisson process on a microscale. We demonstrate that there are significant differences in the behavior of pages that can be exploited to maintain freshness in a web corpus.