Relation-Based document retrieval for biomedical IR

Authors:
Xiaohua Zhou;Xiaohua Hu;Guangren Li;Xia Lin;Xiaodan Zhang
Affiliations:
College of Information Science & Technology, Drexel University, Philadelphia, PA;College of Information Science & Technology, Drexel University, Philadelphia, PA;Faculty of Economy, Hunan University, Changsha, China;College of Information Science & Technology, Drexel University, Philadelphia, PA;College of Information Science & Technology, Drexel University, Philadelphia, PA
Venue:
Transactions on Computational Systems Biology V
Year:
2006

Citing 10
Cited 0

Word sense disambiguation and information retrieval

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Learning Information Extraction Rules for Semi-Structured and Free Text

Machine Learning - Special issue on natural language learning
Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone

SIGDOC '86 Proceedings of the 5th annual international conference on Systems documentation
A Multi-Level Text Mining Method to Extract Biological Relationships

CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Classification of Web Documents Using a Graph Model

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Extracting Biochemical Interactions from MEDLINE Using a Link Grammar Parser

ICTAI '03 Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence
Mining knowledge from text using information extraction

ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Converting Semi-structured Clinical Medical Records into Information and Knowledge

ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
CRYSTAL inducing a conceptual dictionary

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we explore the use of term relations in information retrieval for precision-focused biomedical literature search. A relation is defined as a pair of two terms which are semantically and syntactically related to each other. Unlike the traditional “bag-of-word” model for documents, our model represents a document by a set of sense-disambiguated terms and their binary relations. Since document level co-occurrence of two terms, in many cases, does not mean this document addresses their relationships, the direct use of relation may improve the precision of very specific search, e.g. searching documents that mention genes regulated by Smad4. For this purpose, we develop a generic ontology-based approach to extract terms and their relations, and present a betweenness centrality based approach to rank retrieved documents. A prototyped IR system supporting relation-based search is then built for Medline abstract search. We use this novel IR system to improve the retrieval result of all official runs in TREC-2004 Genomics Track. The experiment shows promising performance of relation-based IR. The average P@100 (the precision of top 100 documents) for 50 topics is significantly raised from 26.37 %( the P@100 of the best run is 42.10%) to 53.69% while the MAP (mean average precision) is kept at an above-average level of 26.59%. The experiment also shows the expressiveness of relations for the representation of information needs, especially in the area of biomedical literature full of various biological relations.