Variations in relevance judgments and the measurement of retrieval effectiveness
Information Processing and Management: an International Journal
Rutabaga by any other name: extracting biological names
Journal of Biomedical Informatics - Special issue: Sublanguage
The concept of relevance in IR
Journal of the American Society for Information Science and Technology
Relevance judgment: What do information users consider beyond topicality?
Journal of the American Society for Information Science and Technology - Research Articles
Exploring hedge identification in biomedical literature
Journal of Biomedical Informatics
TREC genomics special issue overview
Information Retrieval
Extracting Protein Interactions from Text with the Unified AkaneRE Event Extraction System
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Are self-assessments reliable indicators of topic knowledge?
Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47
Protocol-driven searches for medical and health-sciences systematic reviews
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Quantifying the impact of concept recognition on biomedical information retrieval
Information Processing and Management: an International Journal
Exploring and predicting search task difficulty
Proceedings of the 21st ACM international conference on Information and knowledge management
Inferring user knowledge level from eye movement patterns
Information Processing and Management: an International Journal
Hi-index | 0.00 |
With the help of a team of expert biologist judges, the TREC Genomics track has generated four large sets of "gold standard" test collections, comprised of over a hundred unique topics, two kinds of ad hoc retrieval tasks, and their corresponding relevance judgments. Over the years of the track, increasingly complex tasks necessitated the creation of judging tools and training guidelines to accommodate teams of part-time short-term workers from a variety of specialized biological scientific backgrounds, and to address consistency and reproducibility of the assessment process. Important lessons were learned about factors that influenced the utility of the test collections including topic design, annotations provided by judges, methods used for identifying and training judges, and providing a central moderator "meta-judge".