A local search approximation algorithm for k-means clustering
Proceedings of the eighteenth annual symposium on Computational geometry
Subject categorization of query terms for exploring Web users' search interests
Journal of the American Society for Information Science and Technology
An Efficient k-Means Clustering Algorithm: Analysis and Implementation
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Philosophy of Information Retrieval Evaluation
CLEF '01 Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems
ACM SIGIR Forum
Probabilistic User Behavior Models
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Hourly analysis of a very large topically categorized web query log
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Investigating behavioral variability in web search
Proceedings of the 16th international conference on World Wide Web
Demographic prediction based on user's browsing behavior
Proceedings of the 16th international conference on World Wide Web
Information re-retrieval: repeat queries in Yahoo's logs
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Determining the informational, navigational, and transactional intent of Web queries
Information Processing and Management: an International Journal
Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs
Proceedings of the 17th ACM conference on Information and knowledge management
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Inferring search behaviors using partially observable Markov (POM) model
Proceedings of the third ACM international conference on Web search and data mining
A characterization of online browsing behavior
Proceedings of the 19th international conference on World wide web
Predicting searcher frustration
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Ready to buy or just browsing?: detecting web searcher goals from interaction data
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
The demographics of web search
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Predicting query performance using query, result, and user interaction features
RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
The intention behind web queries
SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
People searching for people: analysis of a people search engine log
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
What and how children search on the web
Proceedings of the 20th ACM international conference on Information and knowledge management
Proceedings of the 21st international conference companion on World Wide Web
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Mining web query logs to analyze political issues
Proceedings of the 3rd Annual ACM Web Science Conference
Demographic context in web search re-ranking
Proceedings of the 21st ACM international conference on Information and knowledge management
From republicans to teenagers --- group membership and search (GRUMPS)
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Drawing a data-driven portrait of Wikipedia editors
Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration
Inferring the demographics of search users: social data meets search queries
Proceedings of the 22nd international conference on World Wide Web
Mining search and browse logs for web search: A Survey
ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Similar or not similar: this is a parameter question
HCI International'13 Proceedings of the 15th international conference on Human Interface and the Management of Information: information and interaction design - Volume Part I
Analysis of Search and Browsing Behavior of Young Users on the Web
ACM Transactions on the Web (TWEB)
Who watches (and shares) what on youtube? and when?: using twitter to understand youtube viewership
Proceedings of the 7th ACM international conference on Web search and data mining
Hi-index | 0.01 |
We analyze a large query log of 2.3 million anonymous registered users from a web-scale U.S. search engine in order to jointly analyze their on-line behavior in terms of who they might be (demographics), what they search for (query topics), and how they search (session analysis). We examine basic demographics from registration information provided by the users, augmented with U.S. census data, analyze basic session statistics, classify queries into types (navigational, informational, transactional) based on click entropy, classify queries into topic categories, and cluster users based on the queries they issued. We then examine the resulting clusters in terms of demographics and search behavior. Our analysis of the data suggests that there are important differences in search behavior across different demographic groups in terms of the topics they search for, and how they search (e.g., white conservatives are those likely to have voted republican, mostly white males, who search for business, home, and gardening related topics; Baby Boomers tend to be primarily interested in Finance and a large fraction of their sessions consist of simple navigational queries related to online banking, etc.). Finally, we examine regional search differences, which seem to correlate with differences in local industries (e.g., gambling related queries are highest in Las Vegas and lowest in Salt Lake City; searches related to actors are about three times higher in L.A. than in any other region).