Syntactic clustering of the Web
Selected papers from the sixth international conference on World Wide Web
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Latent semantic linking over homogeneous repositories
DocEng '01 Proceedings of the 2001 ACM Symposium on Document engineering
xlinkit: a consistency checking and smart link generation service
ACM Transactions on Internet Technology (TOIT)
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
A large-scale study of the evolution of web pages
WWW '03 Proceedings of the 12th international conference on World Wide Web
APSEC '03 Proceedings of the Tenth Asia-Pacific Software Engineering Conference Software Engineering Conference
What's new on the web?: the evolution of the web from a search engine perspective
Proceedings of the 13th international conference on World Wide Web
Managing versions of web documents in a transaction-time web server
Proceedings of the 13th international conference on World Wide Web
Managing inconsistent repositories via prioritized repairs
Proceedings of the 2004 ACM symposium on Document engineering
Hi-index | 0.00 |
One of the most important aspects of a Web document is its up-to-dateness or recency. Up-to-dateness is particularly relevant to Web documents because they usually contain content origining from different sources and being refreshed at different dates. Whether a Web document is relevant for a reader depends on the history of its contents and so-called external factors, i.e., the up-to-dateness of semantically related documents. In this paper, we approach automatic management of up-to-dateness of Web documents that are managed by an XML-centric Web content management system. First, the freshness for a single document is computed, taking into account its change history. A document metric estimates the distance between different versions of a document. Second, up-to-dateness of a document is determined based on its own history and the historical evolutions of semantically related documents.