Biomedical Text Mining Applied to Document Retrieval and Semantic Indexing

Authors:
Anália Lourenço;Sónia Carneiro;Eugénio C. Ferreira;Rafael Carreira;Luis M. Rocha;Daniel Glez-Peña;José R. Méndez;Florentino Fdez-Riverola;Fernando Diaz;Isabel Rocha;Miguel Rocha
Affiliations:
IBB/CEB, University of Minho, Campus Gualtar, Braga, Portugal;IBB/CEB, University of Minho, Campus Gualtar, Braga, Portugal;IBB/CEB, University of Minho, Campus Gualtar, Braga, Portugal;IBB/CEB, University of Minho, Campus Gualtar, Braga, Portugal and CCTC, University of Minho, Braga, Portugal;School of Informatics, Indiana University, Bloomington, USA;Computer Science Dept., Univ. Vigo, Ourense, Spain;Computer Science Dept., Univ. Vigo, Ourense, Spain;Computer Science Dept., Univ. Vigo, Ourense, Spain;Computer Science Department, University of Valladolid, Segóvia, Spain;IBB/CEB, University of Minho, Campus Gualtar, Braga, Portugal;CCTC, University of Minho, Braga, Portugal
Venue:
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living
Year:
2009

Citing 3
Cited 0

Automatic classification using supervised learning in a medical document filtering application

Information Processing and Management: an International Journal
Diagnosis and Decision Support

Case-Based Reasoning Technology, From Foundations to Applications
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

In Biomedical research, the ability to retrieve the adequate information from the ever growing literature is an extremely important asset. This work provides an enhanced and general purpose approach to the process of document retrieval that enables the filtering of PubMed query results. The system is based on semantic indexing providing, for each set of retrieved documents, a network that links documents and relevant terms obtained by the annotation of biological entities (e.g. genes or proteins). This network provides distinct user perspectives and allows navigation over documents with similar terms and is also used to assess document relevance. A network learning procedure, based on previous work from e-mail spam filtering, is proposed, receiving as input a training set of manually classified documents.