Temporal Evolution of the UK Web

Authors:
Ilaria Bordino;Paolo Boldi;Debora Donato;Massimo Santini;Sebastiano Vigna
Affiliations:
-;-;-;-;-
Venue:
ICDMW '08 Proceedings of the 2008 IEEE International Conference on Data Mining Workshops
Year:
2008

Citing 0
Cited 5

Web spam filtering in internet archives

Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Scalable manipulation of archival web graphs

Proceedings of the 9th workshop on Large-scale and distributed informational retrieval
As time goes by: discovering eras in evolving social networks

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
PowerGraph: distributed graph-parallel computation on natural graphs

OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Evolving networks: Eras and turning points

Intelligent Data Analysis - Dynamic Networks and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently, a new temporal dataset has been made public: it is made of a series of twelve 100M pages snapshots of the \texttt{.uk} domain~\cite{BSVLTAG}. The Web graphs of the twelve snapshots have been merged into a single \emph{time-aware} graph that provide constant-time access to temporal information. In this paper we present the first statistical analysis performed on this graph, with the goal of checking whether the information contained in the graph is reliable (i.e., whether it depends essentially on appearance and disappearance of pages and links, or on the crawler behaviour). We perform a number of tests that show that the graph is actually reliable, and provide the first public data on the evolution of the Web that use a large scale and a significant diversity in the sites considered.