Analysis of a very large web search engine query log
ACM SIGIR Forum
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Vox populi: the public searching of the Web
Journal of the American Society for Information Science and Technology
Characteristics of question format web queries: an exploratory study
Information Processing and Management: an International Journal
IEEE Internet Computing
A Study of Multitasking Web Search
ITCC '03 Proceedings of the International Conference on Information Technology: Computers and Communications
Improving Search in Peer-to-Peer Networks
ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
Tracing a Large-Scale Peer to Peer System: An Hour in the Life of Gnutella
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
Study of the relationship between peer-to-peer systems and IP multicasting
IEEE Communications Magazine
A study of a weighting scheme for information retrieval in hierarchical peer-to-peer networks
ECIR'07 Proceedings of the 29th European conference on IR research
Multi-objective peer-to-peer neighbor-selection strategy using genetic algorithm
HiPC'07 Proceedings of the 14th international conference on High performance computing
Hi-index | 0.00 |
This paper presents the trends of searching queries by users from peer-to-peer (P2P) networks over an 18-month period from July 2002 to January 2004. Four data sets of search queries collected from Gnutella were studied to describe the searching trends. Major findings include (1) the percentage of duplicate queries ranging from 34% to 68% of total queries; (2) an increase in non-English queries; (3) an approximately half of searching queries specified for video or audio file types; (4) the stop word ''the'' accounting for one-third of total stop words; (5) the shift of queries from audio to video; and (6) P2P users demanding for timely entertainment and porn materials. Based on the findings, it is worthwhile for P2P developers to consider (1) system design that allows effective searching using multiple languages; and (2) techniques that eliminate stop words for faster searching.