Supporting biomedical information retrieval: the bioTracer approach

  • Authors:
  • Heri Ramampiaro;Chen Li

  • Affiliations:
  • Department of Computer and Information Science, Norwegian University of Science and Technology (NTNU), Trondheim, Norway;Dept. of Computer Science, University of California, Irvine, CA

  • Venue:
  • Transactions on large-scale data- and knowledge-centered systems IV
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The large amount and diversity of available biomedical information has put a high demand on existing search systems. Such a tool should be able to not only retrieve the sought information, but also filter out irrelevant documents, while giving the relevant ones the highest ranking. Focusing on biomedical information, this work investigates how to improve the ability for a system to find and rank relevant documents. To achieve this goal, we apply a series of information retrieval techniques to search in biomedical information and combine them in an optimal manner. These techniques include extending and using well-established information retrieval (IR) similarity models such as the Vector Space Model (VSM) and BM25 and their underlying scoring schemes. The techniques also allow users to affect the ranking according to their view of relevance. The techniques have been implemented and tested in a proof-of-concept prototype called BioTracer, which extends a Java-based open source search engine library. The results from our experiments using the TREC 2004 Genomic Track collection are promising. Our investigation have also revealed that involving the user in the search process will indeed have positive effects on the ranking of search results, and that the approaches used in BioTracer can be used to meet the user's information needs.