Freshness matters: in flowers, food, and web authority

Authors:
Na Dai;Brian D. Davison
Affiliations:
Lehigh University, Bethlehem, PA, USA;Lehigh University, Bethlehem, PA, USA
Venue:
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Year:
2010

Citing 24
Cited 12

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Effective document presentation with a locality-based similarity heuristic

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
IR evaluation methods for retrieving highly relevant documents

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Sic transit gloria telae: towards an understanding of the web's decay

Proceedings of the 13th international conference on World Wide Web
Block-level link analysis

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
On the temporal dimension of search

Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Trend detection through temporal link analysis

Journal of the American Society for Information Science and Technology - Special issue: Webometrics
Page quality: in search of an unbiased web ranking

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Graph mining: Laws, generators, and algorithms

ACM Computing Surveys (CSUR)
BuzzRank … and the trend is your friend

Proceedings of the 15th international conference on World Wide Web
Detecting Link Spam Using Temporal Information

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Introduction to Probability Models, Ninth Edition

Introduction to Probability Models, Ninth Edition
Comparing apples and oranges: normalized pagerank for evolving graphs

Proceedings of the 16th international conference on World Wide Web
Proximity-based document representation for named entity retrieval

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Link analysis using time series of web graphs

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
BrowseRank: letting web users vote for page importance

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Expertise Search in a Time-Varying Social Network

WAIM '08 Proceedings of the 2008 The Ninth International Conference on Web-Age Information Management
The web changes everything: understanding the dynamics of web content

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Learning to recognize reliable users and content in social media with coupled mutual reinforcement

Proceedings of the 18th international conference on World wide web
A study of link farm distribution and evolution using a time series of web snapshots

Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Positional language models for information retrieval

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
A general markov framework for page importance computation

Proceedings of the 18th ACM conference on Information and knowledge management
Leveraging temporal dynamics of document content in relevance ranking

Proceedings of the third ACM international conference on Web search and data mining

Capturing page freshness for web search

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
The SHARC framework for data quality in Web archiving

The VLDB Journal — The International Journal on Very Large Data Bases
Learning to rank for freshness and relevance

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Estimation methods for ranking recent information

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Time-sensitive query auto-completion

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
An Online Learning Framework for Refining Recency Search Results with User Click Feedback

ACM Transactions on Information Systems (TOIS)
Recency-sensitive model of web page authority

Proceedings of the 21st ACM international conference on Information and knowledge management
Improving recency ranking using twitter data

ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context
URL redirection accounting for improving link-based ranking methods

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Fresh BrowseRank

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
The Impacts of Structural Difference and Temporality of Tweets on Retrieval Effectiveness

ACM Transactions on Information Systems (TOIS)
A framework for tag-aware recommender systems

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

The collective contributions of billions of users across the globe each day result in an ever-changing web. In verticals like news and real-time search, recency is an obvious significant factor for ranking. However, traditional link-based web ranking algorithms typically run on a single web snapshot without concern for user activities associated with the dynamics of web pages and links. Therefore, a stale page popular many years ago may still achieve a high authority score due to its accumulated in-links. To remedy this situation, we propose a temporal web link-based ranking scheme, which incorporates features from historical author activities. We quantify web page freshness over time from page and in-link activity, and design a web surfer model that incorporates web freshness, based on a temporal web graph composed of multiple web snapshots at different time points. It includes authority propagation among snapshots, enabling link structures at distinct time points to influence each other when estimating web page authority. Experiments on a real-world archival web corpus show our approach improves upon PageRank in both relevance and freshness of the search results.