Creating a test collection: relevance judgements of cited & non-cited papers

Authors:
Anna Ritchie;Stephen Robertson;Simone Teufel
Affiliations:
University of Cambridge, Cambridge;Microsoft Research Ltd, Cambridge;University of Cambridge, Cambridge
Venue:
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Year:
2007

Citing 8
Cited 3

Hypermedia and free text retrieval

Information Processing and Management: an International Journal - Special issue on hypertext and information retrieval
The Cranfield tests on index language devices

Readings in information retrieval
Variations in relevance judgments and the measurement of retrieval effectiveness

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Topical locality in the Web

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval evaluation with incomplete information

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The text retrieval conferences (TRECS)

TIPSTER '98 Proceedings of a workshop on held at Baltimore, Maryland: October 13-15, 1998
Creating a test collection for citation-based IR experiments

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics

Comparing citation contexts for information retrieval

Proceedings of the 17th ACM conference on Information and knowledge management
Using terms from citations for IR: some first results

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Aspects and analysis of patent test collections

PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate the effect of different sources of relevant documents in the creation of a test collection in the scientific domain. Based on the Cranfield 2 design, paper authors are asked to judge their cited papers for relevance in the first stage. In a second stage, documents outside the reference list are judged. In this paper, we use the test collection with standard IR engines to compare the information contained in the judgements of the first vs second stage. Using different correlation studies, we found that the judgements of the cited papers do not predict those from the non-cited papers, which means that the combination of sources results in a higher quality collection.