Automatic resource compilation by analyzing hyperlink structure and associated text
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient crawling through URL ordering
WWW7 Proceedings of the seventh international conference on World Wide Web 7
What is this page known for? Computing Web page reputations
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Breadth-first crawling yields high-quality pages
Proceedings of the 10th international conference on World Wide Web
Finding authorities and hubs from link structures on the World Wide Web
Proceedings of the 10th international conference on World Wide Web
SALSA: the stochastic approach for link-structure analysis
ACM Transactions on Information Systems (TOIS)
Effective site finding using link anchor information
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 11th international conference on World Wide Web
PageRank, HITS and a unified framework for link analysis
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
I/O-efficient techniques for computing pagerank
Proceedings of the eleventh international conference on Information and knowledge management
The decay and failures of web references
Communications of the ACM
Using PageRank to Characterize Web Structure
COCOON '02 Proceedings of the 8th Annual International Conference on Computing and Combinatorics
Extrapolation methods for accelerating PageRank computations
WWW '03 Proceedings of the 12th international conference on World Wide Web
Adaptive on-line page importance computation
WWW '03 Proceedings of the 12th international conference on World Wide Web
A new paradigm for ranking pages on the world wide web
WWW '03 Proceedings of the 12th international conference on World Wide Web
WWW '03 Proceedings of the 12th international conference on World Wide Web
WWW '03 Proceedings of the 12th international conference on World Wide Web
Analysis of anchor text for web search
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Untangling compound documents on the web
Proceedings of the fourteenth ACM conference on Hypertext and hypermedia
PageRank as a function of the damping factor
WWW '05 Proceedings of the 14th international conference on World Wide Web
A uniform approach to accelerated PageRank computation
WWW '05 Proceedings of the 14th international conference on World Wide Web
Crawling a country: better strategies than breadth-first for web page ordering
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Predictive ranking: a novel page ranking approach by estimating the web structure
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Search Adaptations and the Challenges of the Web
IEEE Internet Computing
Exploiting the hierarchical structure for link analysis
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Distributed PageRank computation based on iterative aggregation-disaggregation methods
Proceedings of the 14th ACM international conference on Information and knowledge management
What's really new on the web?: identifying new pages from a series of unstable web snapshots
Proceedings of the 15th international conference on World Wide Web
Detecting nepotistic links by language model disagreement
Proceedings of the 15th international conference on World Wide Web
Divide and conquer approach for efficient pagerank computation
ICWE '06 Proceedings of the 6th international conference on Web engineering
AggregateRank: bringing order to web sites
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Generalizing PageRank: damping functions for link-based ranking algorithms
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Link spam detection based on mass estimation
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Rank synopses for efficient time travel on the web graph
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A reference collection for web spam
ACM SIGIR Forum
Characterization of national Web domains
ACM Transactions on Internet Technology (TOIT)
The discoverability of the web
Proceedings of the 16th international conference on World Wide Web
Proceedings of the 16th international conference on World Wide Web
Comparing apples and oranges: normalized pagerank for evolving graphs
Proceedings of the 16th international conference on World Wide Web
Extracting link spam using biased random walks from spam seed sets
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Web spam detection via commercial intent analysis
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Know your neighbors: web spam detection using the web topology
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
DiffusionRank: a possible penicillin for web spamming
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Link analysis for Web spam detection
ACM Transactions on the Web (TWEB)
The Viúva Negra crawler: an experience report
Software—Practice & Experience
Crawl ordering by search impact
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Can social bookmarking improve web search?
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
DirichletRank: Solving the zero-one gap problem of PageRank
ACM Transactions on Information Systems (TOIS)
Fourth international workshop on adversarial information retrieval on the web (AIRWeb 2008)
Proceedings of the 17th international conference on World Wide Web
Traps and Pitfalls of Topic-Biased PageRank
Algorithms and Models for the Web-Graph
Learning latent semantic relations from clickthrough data for query suggestion
Proceedings of the 17th ACM conference on Information and knowledge management
Query based optimal web site clustering using simulated annealing
Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
Web spam filtering in internet archives
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
PageRank: Splitting Homogeneous Singular Linear Systems of Index One
ICTIR '09 Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory
PageRank: Functional dependencies
ACM Transactions on Information Systems (TOIS)
A brief survey of computational approaches in social computing
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Distribution of PageRank mass among principle components of the web
WAW'07 Proceedings of the 5th international conference on Algorithms and models for the web-graph
Determining factors behind the PageRank log-log plot
WAW'07 Proceedings of the 5th international conference on Algorithms and models for the web-graph
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
News page discovery policy for instant crawlers
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Modeling parametric web arc weight measurement
ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III
An Inner-Outer Iteration for Computing PageRank
SIAM Journal on Scientific Computing
Foundations and Trends in Information Retrieval
Journal of Web Engineering
Discovering URLs through user feedback
Proceedings of the 20th ACM international conference on Information and knowledge management
NewPR-Combining TFIDF with pagerank
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
Using hyperlink features to personalize web search
WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
An incremental approach to link evaluation in topic-driven web resource discovery
AAIM'05 Proceedings of the First international conference on Algorithmic Applications in Management
Maximum rooted spanning trees for the web
OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II
Hierarchical link analysis for ranking web data
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Searching Steiner trees for web graph query
Computers and Industrial Engineering
Efficient parallel computation of pagerank
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Survey on web spam detection: principles and algorithms
ACM SIGKDD Explorations Newsletter
An analysis of optimal link bombs
Theoretical Computer Science
Recency-sensitive model of web page authority
Proceedings of the 21st ACM international conference on Information and knowledge management
NCDawareRank: a novel ranking method that exploits the decomposable structure of the web
Proceedings of the sixth ACM international conference on Web search and data mining
Image retrieval based on augmented relational graph representation
Applied Intelligence
Hi-index | 0.00 |
The celebrated PageRank algorithm has proved to be a very effective paradigm for ranking results of web search algorithms. In this paper we refine this basic paradigm to take into account several evolving prominent features of the web, and propose several algorithmic innovations. First, we analyze features of the rapidly growing "frontier" of the web, namely the part of the web that crawlers are unable to cover for one reason or another. We analyze the effect of these pages and find it to be significant. We suggest ways to improve the quality of ranking by modeling the growing presence of "link rot" on the web as more sites and pages fall out of maintenance. Finally we suggest new methods of ranking that are motivated by the hierarchical structure of the web, are more efficient than PageRank, and may be more resistant to direct manipulation.