Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
A case study in web search using TREC algorithms
Proceedings of the 10th international conference on World Wide Web
Engineering a multi-purpose test collection for web retrieval experiments
Information Processing and Management: an International Journal
Replicating Web Structure in Small-Scale Test Collections
Information Retrieval
Index compression using fixed binary codewords
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
Improved robustness of signature-based near-replica detection via lexicon randomization
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Fast phrase querying with combined indexes
ACM Transactions on Information Systems (TOIS)
Inverted Index Compression Using Word-Aligned Binary Codes
Information Retrieval
Single-pass clustering for peer-to-peer information retrieval: the effect of document ordering
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Computing customized page ranks
ACM Transactions on Internet Technology (TOIT)
Lexicon randomization for near-duplicate detection with I-Match
The Journal of Supercomputing
Improved query difficulty prediction for the web
Proceedings of the 17th ACM conference on Information and knowledge management
Efficiency trade-offs in two-tier web search systems
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Improving the evaluation of web search systems
ECIR'03 Proceedings of the 25th European conference on IR research
Improving the evaluation of web search systems
ECIR'03 Proceedings of the 25th European conference on IR research
P2PIRB: benchmarking framework for P2PIR
Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
Exploiting locality in searching the web
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
A suite of testbeds for the realistic evaluation of peer-to-peer information retrieval systems
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Hi-index | 0.00 |
We measure the WT10g test collection, used in the TREC-9 and TREC 2001 Web Tracks, with common measures used in the web topology community, in order to see if WT10g "looks like" the web. This is not an idle question; characteristics of the web, such as power law relationships, diameter, and connected components have all been observed within the scope of general web crawls, constructed by blindly following links. In contrast, WT10g was carved out from a larger crawl specifically to be a web search test collection within the reach of university researchers. Does such a collection retain the properties of the larger web? In the case of WT10g, yes.