The Cranfield tests on index language devices
Readings in information retrieval
Readings in information retrieval
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Finding information on the World Wide Web: the retrieval effectiveness of search engines
Information Processing and Management: an International Journal
Measuring index quality using random walks on the Web
WWW '99 Proceedings of the eighth international conference on World Wide Web
On power-law relationships of the Internet topology
Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Evaluation by highly relevant documents
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Effective site finding using link anchor information
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Measuring Search Engine Quality
Information Retrieval
Searcher performance in question answering
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The Importance of Prior Probabilities for Entry Page Search
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Improving pseudo-relevance feedback in web information retrieval using web page segmentation
WWW '03 Proceedings of the 12th international conference on World Wide Web
Do TREC web collections look like the web?
ACM SIGIR Forum
Query-independent evidence in home page finding
ACM Transactions on Information Systems (TOIS)
Query type classification for web document retrieval
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Evaluating database selection algorithms for distributed search
Proceedings of the 2003 ACM symposium on Applied computing
Query expansion using associated queries
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Replicating Web Structure in Small-Scale Test Collections
Information Retrieval
Integration of multiple evidences based on a query type for web search
Information Processing and Management: an International Journal
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Index compression using fixed binary codewords
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
Improved robustness of signature-based near-replica detection via lexicon randomization
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Fast phrase querying with combined indexes
ACM Transactions on Information Systems (TOIS)
Inverted Index Compression Using Word-Aligned Binary Codes
Information Retrieval
Dempster-Shafer Theory for a Query-Biased Combination of Evidence on the Web
Information Retrieval
Text characteristics of English language university Web sites: Research Articles
Journal of the American Society for Information Science and Technology
Journal of the American Society for Information Science and Technology - Special issue: Webometrics
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
A case study of distributed information retrieval architectures to index one terabyte of text
Information Processing and Management: an International Journal
Techniques for improving web retrieval effectiveness
Information Processing and Management: an International Journal
User performance versus precision measures for simple search tasks
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Capturing collection size for distributed non-cooperative retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Computing customized page ranks
ACM Transactions on Internet Technology (TOIT)
Efficient query expansion with auxiliary data structures
Information Systems
A pipelined architecture for distributed text query evaluation
Information Retrieval
Using query logs to establish vocabularies in distributed information retrieval
Information Processing and Management: an International Journal
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Evaluation of phrasal query suggestions
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Information Processing and Management: an International Journal
A comparative study of probabilistic and language models for information retrieval
ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Efficient online index construction for text databases
ACM Transactions on Database Systems (TODS)
Affective feedback: an investigation into the role of emotions in the information seeking process
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Lexicon randomization for near-duplicate detection with I-Match
The Journal of Supercomputing
Integral based source selection for uncooperative distributed information retrieval environments
Proceedings of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval
Simple Adaptations of Data Fusion Algorithms for Source Selection
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Learning spectral graph transformations for link prediction
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
SUSHI: scoring scaled samples for server selection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Implementing and evaluating phrasal query suggestions for proximity search
Information Systems
Implementing and evaluating phrasal query suggestions for proximity search
Information Systems
MM '09 Proceedings of the 17th ACM international conference on Multimedia
A case study of distributed information retrieval architectures to index one terabyte of text
Information Processing and Management: an International Journal
Techniques for improving web retrieval effectiveness
Information Processing and Management: an International Journal
Exploring features for the automatic identification of user goals in web search
Information Processing and Management: an International Journal
Improving the evaluation of web search systems
ECIR'03 Proceedings of the 25th European conference on IR research
When are links useful? experiments in text classification
ECIR'03 Proceedings of the 25th European conference on IR research
Improving the evaluation of web search systems
ECIR'03 Proceedings of the 25th European conference on IR research
A study of a weighting scheme for information retrieval in hierarchical peer-to-peer networks
ECIR'07 Proceedings of the 29th European conference on IR research
Extracting content structure for web pages based on visual representation
APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
Modeling the web as a hypergraph to compute page reputation
Information Systems
The adaptive web
On the construction of a large scale Chinese web test collection
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Using clicks as implicit judgments: expectations versus observations
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Effective pre-retrieval query performance prediction using similarity and variability evidence
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Investigating the effectiveness of clickthrough data for document reordering
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Information Sciences: an International Journal
The importance of anchor text for ad hoc search revisited
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Search log analysis of user stereotypes, information seeking behavior, and contextual evaluation
Proceedings of the third symposium on Information interaction in context
Modeling information sources as integrals for effective and efficient source selection
Information Processing and Management: an International Journal
A search log-based approach to evaluation
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Homepage finding in hybrid peer-to-peer networks
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Foundations and Trends in Information Retrieval
Semantically enhanced Information Retrieval: An ontology-based approach
Web Semantics: Science, Services and Agents on the World Wide Web
Transactional query identification in web search
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Sample sizes for query probing in uncooperative distributed information retrieval
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Comparative evaluation of cross-language information retrieval systems
From Integrated Publication and Information Systems to Virtual Information and Knowledge Environments
ECDL'05 Proceedings of the 9th European conference on Research and Advanced Technology for Digital Libraries
Using anchor text for homepage and topic distillation search tasks
Journal of the American Society for Information Science and Technology
Reference-based search strategies in systematic reviews
EASE'09 Proceedings of the 13th international conference on Evaluation and Assessment in Software Engineering
PROMISE'12 Proceedings of the 2012 international conference on Information Retrieval Meets Information Visualization
Hi-index | 0.00 |
Past research into text retrieval methods for the Web has been restricted by the lack of a test collection capable of supporting experiments which are both realistic and reproducible. The 1.69 million document WT10g collection is proposed as a multi-purpose testbed for experiments with these attributes, in distributed IR, hyperlink algorithms and conventional ad hoc retrieval.WT10g was constructed by selecting from a superset of documents in such a way that desirable corpus properties were preserved or optimised. These properties include: a high degree of inter-server connectivity, integrity of server holdings, inclusion of documents related to a very wide spread of likely queries, and a realistic distribution of server holding sizes. We confirm that WT10g contains exploitable link information using a site (homepage) finding experiment. Our results show that, on this task, Okapi BM25 works better on propagated link anchor text than on full text.WT10g was used in TREC-9 and TREC-2000 and both topic relevance and homepage finding queries and judgments are available.