Characterizing browsing strategies in the World-Wide Web
Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
Automatic personalization based on Web usage mining
Communications of the ACM
What do web users do? An empirical analysis of web use
International Journal of Human-Computer Studies
Web-Log Mining for Predictive Web Caching
IEEE Transactions on Knowledge and Data Engineering
Web tap: detecting covert web traffic
Proceedings of the 11th ACM conference on Computer and communications security
On the lack of typical behavior in the global Web traffic network
WWW '05 Proceedings of the 14th international conference on World Wide Web
BackRank: an alternative for PageRank?
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Shuffling a stacked deck: the case for partially randomized ranking of search engine results
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Improving web search ranking by incorporating user behavior information
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Identifying and discriminating between web and peer-to-peer traffic in the network core
Proceedings of the 16th international conference on World Wide Web
Ranking web sites with real user traffic
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
BrowseRank: letting web users vote for page importance
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Stochastic models for tabbed browsing
Proceedings of the 19th international conference on World wide web
Agents, bookmarks and clicks: a topical model of web navigation
Proceedings of the 21st ACM conference on Hypertext and hypermedia
SudoWeb: minimizing information disclosure to third parties in single sign-on platforms
ISC'11 Proceedings of the 14th international conference on Information security
Proceedings of the 4th international conference on Security of information and networks
Discovering better navigation sequences for the session construction problem
Data & Knowledge Engineering
Conducting an ethical study of web traffic
CSET'12 Proceedings of the 5th USENIX conference on Cyber Security Experimentation and Test
Evaluating and predicting user engagement change with degraded search relevance
Proceedings of the 22nd international conference on World Wide Web
Hi-index | 0.00 |
We examine the properties of all HTTP requests generated by a thousand undergraduates over a span of two months. Preserving user identity in the data set allows us to discover novel properties of Web traffic that directly affect models of hypertext navigation. We find that the popularity of Web sites--the number of users who contribute to their traffic--lacks any intrinsic mean and may be unbounded. Further, many aspects of the browsing behavior of individual users can be approximated by log-normal distributions even though their aggregate behavior is scale-free. Finally, we show that users' click streams cannot be cleanly segmented into sessions using timeouts, affecting any attempt to model hypertext navigation using statistics of individual sessions. We propose a strictly logical definition of sessions based on browsing activity as revealed by referrer URLs; a user may have several active sessions in their click stream at any one time. We demonstrate that applying a timeout to these logical sessions affects their statistics to a lesser extent than a purely timeout-based mechanism.