Tasks, topics and relevance judging for the TREC Genomics Track: five years of experience evaluating biomedical text information retrieval systems

Authors:
Phoebe M. Roberts;Aaron M. Cohen;William R. Hersh
Affiliations:
Pfizer Research Technology Center, Cambridge, USA 02139;Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University, Portland, USA 97239-3098;Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University, Portland, USA 97239-3098
Venue:
Information Retrieval
Year:
2009

Citing 6
Cited 7

Variations in relevance judgments and the measurement of retrieval effectiveness

Information Processing and Management: an International Journal
Rutabaga by any other name: extracting biological names

Journal of Biomedical Informatics - Special issue: Sublanguage
The concept of relevance in IR

Journal of the American Society for Information Science and Technology
A system for identifying named entities in biomedical text: how results from two evaluations reflect on both the system and the evaluations: Conference Papers

Comparative and Functional Genomics
Relevance judgment: What do information users consider beyond topicality?

Journal of the American Society for Information Science and Technology - Research Articles
Exploring hedge identification in biomedical literature

Journal of Biomedical Informatics

TREC genomics special issue overview

Information Retrieval
Extracting Protein Interactions from Text with the Unified AkaneRE Event Extraction System

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Are self-assessments reliable indicators of topic knowledge?

Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47
Protocol-driven searches for medical and health-sciences systematic reviews

ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Quantifying the impact of concept recognition on biomedical information retrieval

Information Processing and Management: an International Journal
Exploring and predicting search task difficulty

Proceedings of the 21st ACM international conference on Information and knowledge management
Inferring user knowledge level from eye movement patterns

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the help of a team of expert biologist judges, the TREC Genomics track has generated four large sets of "gold standard" test collections, comprised of over a hundred unique topics, two kinds of ad hoc retrieval tasks, and their corresponding relevance judgments. Over the years of the track, increasingly complex tasks necessitated the creation of judging tools and training guidelines to accommodate teams of part-time short-term workers from a variety of specialized biological scientific backgrounds, and to address consistency and reproducibility of the assessment process. Important lessons were learned about factors that influenced the utility of the test collections including topic design, annotations provided by judges, methods used for identifying and training judges, and providing a central moderator "meta-judge".