Large scale query log analysis of re-finding

Authors:
Sarah K. Tyler;Jaime Teevan
Affiliations:
University of California, Santa Cruz, Santa Cruz, CA, USA;Microsoft Research, Redmond, WA, USA
Venue:
Proceedings of the third ACM international conference on Web search and data mining
Year:
2010

Citing 24
Cited 34

Characterizing browsing strategies in the World-Wide Web

Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
On the reuse of past optimal queries

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
How people revisit web pages: empirical findings and implications for the design of history systems

International Journal of Human-Computer Studies - Special issue: World Wide Web usability
Information archiving with bookmarks: personal Web space construction and organization

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Patterns of search: analyzing and modeling Web query refinement

UM '99 Proceedings of the seventh international conference on User modeling
Query word deletion prediction

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Mining longitudinal web queries: trends and patterns

Journal of the American Society for Information Science and Technology
Hourly analysis of a very large topically categorized web query log

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Information search and re-access strategies of experienced web users

WWW '05 Proceedings of the 14th international conference on World Wide Web
Personalizing search via automated analysis of interests and activities

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Using Web Search Engines to Find and Refind Information

Computer
Search histories for user support in user interfaces

Journal of the American Society for Information Science and Technology
A large-scale analysis of query logs for assessing personalization opportunities

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Interest-based personalized search

ACM Transactions on Information Systems (TOIS)
Web page revisitation revisited: implications of a long-term click-stream study of browser usage

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A large-scale evaluation and analysis of personalized search strategies

Proceedings of the 16th international conference on World Wide Web
Information re-retrieval: repeat queries in Yahoo's logs

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Studying the use of popular destinations to enhance web search interaction

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Large scale analysis of web revisitation patterns

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
SearchBar: a search-centric web history for task resumption and information re-finding

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
To personalize or not to personalize: modeling queries with variation in user intent

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
How people recall, recognize, and reuse search results

ACM Transactions on Information Systems (TOIS)
Learning about the world through long-term query logs

ACM Transactions on the Web (TWEB)
Examining repetition in user search behavior

ECIR'07 Proceedings of the 29th European conference on IR research

Assessing the scenic route: measuring the value of search trails in web logs

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Utilizing re-finding for personalized information retrieval

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Understanding and predicting personal navigation

Proceedings of the fourth ACM international conference on Web search and data mining
A comparison of how users search on web finding and re-finding tasks

Proceedings of the 2011 iConference
Addressing people's information needs directly in a web search result page

Proceedings of the 20th international conference on World wide web
Context-sensitive query auto-completion

Proceedings of the 20th international conference on World wide web
Beyond the usual suspects: context-aware revisitation support

Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Classification of user interest patterns using a virtual folksonomy

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Supporting revisitation with contextual suggestions

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Seeding simulated queries with user-study data for personal search evaluation

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Understanding re-finding behavior in naturalistic email interaction logs

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Persistence in the ephemeral: utilizing repeat behaviors for multi-session personalized search

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
A layered approach to revisitation prediction

ICWE'11 Proceedings of the 11th international conference on Web engineering
What and how children search on the web

Proceedings of the 20th ACM international conference on Information and knowledge management
Fusing different information retrieval systems according to query-topics: a study based on correlation in information retrieval systems and TREC topics

Information Retrieval
Exploring query patterns in email search

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Visual metaphors to model metacognitive strategies that support memory during the process of refinding information

Proceedings of the 4th Information Interaction in Context Symposium
Towards realistic known-item topics for the ClueWeb

Proceedings of the 4th Information Interaction in Context Symposium
Analyzing query logs of USPTO examiners to identify useful query terms in patent documents for query expansion in patent searching: a preliminary study

IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Human-centred workplace: re-finding physical documents in an office environment

Proceedings of the 13th International Conference of the NZ Chapter of the ACM's Special Interest Group on Human-Computer Interaction
Multi-session re-search: in pursuit of repetition and diversification

Proceedings of the 21st ACM international conference on Information and knowledge management
Re-finding physical documents: extending a digital library into a human-centred workplace

TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
FindAll: a local search engine for mobile phones

Proceedings of the 8th international conference on Emerging networking experiments and technologies
From machu_picchu to "rafting the urubamba river": anticipating information needs via the entity-query graph

Proceedings of the sixth ACM international conference on Web search and data mining
Fighting search engine amnesia: reranking repeated results

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Semantic query answering in digital repositories: Semantic Search v2 for DSpace

International Journal of Metadata, Semantics and Ontologies
Online multitasking and user engagement

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Mining search and browse logs for web search: A Survey

ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Enhancing web revisitation by contextual keywords

ICWE'13 Proceedings of the 13th international conference on Web Engineering
Slow Search: Information Retrieval without Time Constraints

Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval
User analytics with UbeOne: insights into web printing

Proceedings of the VLDB Endowment
Efficient error-tolerant query autocompletion

Proceedings of the VLDB Endowment
Analysis of Search and Browsing Behavior of Young Users on the Web

ACM Transactions on the Web (TWEB)
The dynamics of repeat consumption

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Although Web search engines are targeted towards helping people find new information, people regularly use them to re-find Web pages they have seen before. Researchers have noted the existence of this phenomenon, but relatively little is understood about how re-finding behavior differs from the finding of new information. This paper dives deeply into the differences via analysis of three large-scale data sources: 1) query logs (queries, clicks, result impressions), 2) Web browsing logs (URL visits), and 3) a daily Web crawl (page content). It appears that people learn valuable information about the pages they find that helps them re-find what they are looking for later; compared to the initial finding query, re-finding queries are typically shorter, and rank the re-found URL higher. While many instances of re-finding probably serve as a type of bookmark for a known URL, others seem to represent the resumption of a previous task; results clicked at the end of a session are more likely than those at the beginning to be re-found during a later session, while re-finding is more likely to happen at the beginning of a session than at the end. Additionally, we observe differences in cross-session and intra-session re-finding that may indicate different types of re-finding tasks. Our findings suggest there is a rich opportunity for search engines to take advantage of re-finding behavior as a means to improve the search experience.