BioDR: Semantic indexing networks for biomedical document retrieval

Authors:
Anália Lourenço;Rafael Carreira;Daniel Glez-Peña;José R. Méndez;Sónia Carneiro;Luis M. Rocha;Fernando Díaz;Eugénio C. Ferreira;Isabel Rocha;Florentino Fdez-Riverola;Miguel Rocha
Affiliations:
IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;Department of Informatics/CCTC, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;Computer Science Department, University of Vigo, ESEI: Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n, 32004 Ourense, Spain;Computer Science Department, University of Vigo, ESEI: Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n, 32004 Ourense, Spain;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;School of Informatics and Computing, Indiana University, 919 East Tenth Street, Bloomington, IN 47406, United States and Instituto Gulbenkian de a Ciência, Apartado 14, 2781-901 Oeiras, Portu ...;Computer Science Department, University of Valladolid, Escuela Universitaria de Informática, Plaza Santa Eulalia, 9-11, 40005 Segovia, Spain;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;Computer Science Department, University of Vigo, ESEI: Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n, 32004 Ourense, Spain;Department of Informatics/CCTC, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal
Venue:
Expert Systems with Applications: An International Journal
Year:
2010

Citing 9
Cited 1

Performance standards and evaluations in IR test collections: cluster-based retrieval models

Information Processing and Management: an International Journal
Automatic classification using supervised learning in a medical document filtering application

Information Processing and Management: an International Journal
Diagnosis and Decision Support

Case-Based Reasoning Technology, From Foundations to Applications
Rule-based extraction of experimental evidence in the biomedical domain: the KDD Cup 2002 (task 1)

ACM SIGKDD Explorations Newsletter
Automatic scientific text classification using local patterns: KDD CUP 2002 (task 1)

ACM SIGKDD Explorations Newsletter
Applying lazy learning algorithms to tackle concept drift in spam filtering

Expert Systems with Applications: An International Journal
SpamHunting: An instance-based reasoning system for spam labelling and filtering

Decision Support Systems
@Note: A workbench for Biomedical Text Mining

Journal of Biomedical Informatics
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

IdentityRank: Named entity disambiguation in the news domain

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.06

Visualization

Abstract

In Biomedical research, retrieving documents that match an interesting query is a task performed quite frequently. Typically, the set of obtained results is extensive containing many non-interesting documents and consists in a flat list, i.e., not organized or indexed in any way. This work proposes BioDR, a novel approach that allows the semantic indexing of the results of a query, by identifying relevant terms in the documents. These terms emerge from a process of Named Entity Recognition that annotates occurrences of biological terms (e.g. genes or proteins) in abstracts or full-texts. The system is based on a learning process that builds an Enhanced Instance Retrieval Network (EIRN) from a set of manually classified documents, regarding their relevance to a given problem. The resulting EIRN implements the semantic indexing of documents and terms, allowing for enhanced navigation and visualization tools, as well as the assessment of relevance for new documents.