On the bursty evolution of blogspace
WWW '03 Proceedings of the 12th international conference on World Wide Web
Extracting evolution of web communities from a series of web archives
Proceedings of the fourteenth ACM conference on Hypertext and hypermedia
The connectivity sonar: detecting site functionality by structural patterns
Proceedings of the fourteenth ACM conference on Hypertext and hypermedia
Discovery of ads web hosts through traffic data analysis
Proceedings of the 9th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
How valuable is external link evidence when searching enterprise Webs?
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
Spam, damn spam, and statistics: using statistical analysis to locate spam web pages
Proceedings of the 7th International Workshop on the Web and Databases: colocated with ACM SIGMOD/PODS 2004
Search engine coverage bias: evidence and possible causes
Information Processing and Management: an International Journal
On the Bursty Evolution of Blogspace
World Wide Web
User Centric Walk: An Integrated Approach for Modeling the Browsing Behavior of Users on the Web
ANSS '05 Proceedings of the 38th annual Symposium on Simulation
Scientific web intelligence: finding relationships in university webs
Communications of the ACM - Designing for the mobile device
Detecting phrase-level duplication on the world wide web
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Exploiting the hierarchical structure for link analysis
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Mining tree queries in a graph
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Discovering large dense subgraphs in massive graphs
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Characterizing a national community web
ACM Transactions on Internet Technology (TOIT)
Efficient PageRank approximation via graph aggregation
Information Retrieval
What's really new on the web?: identifying new pages from a series of unstable web snapshots
Proceedings of the 15th international conference on World Wide Web
The web structure of e-government - developing a methodology for quantitative evaluation
Proceedings of the 15th international conference on World Wide Web
Relationship between web links and trade
Proceedings of the 15th international conference on World Wide Web
AggregateRank: bringing order to web sites
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Dynamics of the Chilean web structure
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
Characterization of national Web domains
ACM Transactions on Internet Technology (TOIT)
Computer
Computing pagerank in a distributed internet search system
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Ranking web sites with real user traffic
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Characterization of the Thai hostgraph
Proceedings of the 2nd international conference on Ubiquitous information management and communication
Guanxi in the chinese web - a study of mutual linking
Proceedings of the 17th international conference on World Wide Web
ICSOC '07 Proceedings of the 5th international conference on Service-Oriented Computing
Using argumentation to retrieve articles with similar citations from MEDLINE
JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
The Geographical Life of Search
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Nonlinear static-rank computation
Proceedings of the 18th ACM conference on Information and knowledge management
Data mining using links in open hypermedia
MIS'02 Proceedings of the 2002 international conference on Metainformatics
Modeling the web as a hypergraph to compute page reputation
Information Systems
Web mediators for accessible browsing
ERCIM'06 Proceedings of the 9th conference on User interfaces for all
Connectivity of the Thai web graph
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Structural approach to design user interface
Computers in Industry
ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
Contracted webgraphs: structure mining and scale-freeness
FAW-AAIM'11 Proceedings of the 5th joint international frontiers in algorithmics, and 7th international conference on Algorithmic aspects in information and management
Incremental web-site boundary detection using random walks
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Query-Sets++: a scalable approach for modeling web sites
SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
China web graph measurements and evolution
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
A framework for relational link discovery
AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Mining communities on the web using a max-flow and a site-oriented framework
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
A hierarchical model of web graph
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Thwarting the nigritude ultramarine: learning to identify link spam
ECML'05 Proceedings of the 16th European conference on Machine Learning
Maximum rooted spanning trees for the web
OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II
Efficient parallel computation of pagerank
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Teaching of web information retrieval: web first or IR first?
TLIR'07 Proceedings of the First international conference on Teaching and Learning of Information Retrieval
NCDawareRank: a novel ranking method that exploits the decomposable structure of the web
Proceedings of the sixth ACM international conference on Web search and data mining
Hi-index | 0.00 |
Previous studies of the web graph structure have focused on the graph structure at the level of individual pages. In actuality the web is a hierarchically nested graph, with domains, hosts and web sites introducing intermediate levels of affiliation and administrativecontrol. To better understand the growth of the web we need to understand its macro-structure, in terms of the linkage between web sites. In this paper e approximate this by studying the graph of the linkage between hosts on the web. This as done based on snapshots of the web taken by Google in Oct 1999,Aug 2000 and Jun 2001.The connectivity between hosts is represented by a directed graph, with hosts as nodes and weighted edges representingthe count of hyperlinks between pages on the corresponding hosts. We demonstrate how such a "hostgraph" an be used to study connectivity properties of hosts and domains over time, anddiscuss a modified "copy model" too explain observed link eight distributions as a function of subgraph size. We discuss changes in the web over time in the size and connectivity of web sites and country domains. We also describe a data mining application of the hostgraph: a related host finding algorithm which achieves a precision of 0.65 at rank 3.