@Note: A workbench for Biomedical Text Mining

Authors:
Anália Lourenço;Rafael Carreira;Sónia Carneiro;Paulo Maia;Daniel Glez-Peña;Florentino Fdez-Riverola;Eugénio C. Ferreira;Isabel Rocha;Miguel Rocha
Affiliations:
IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal and Department of Informatics/CCTC, Univers ...;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal and Department of Informatics/CCTC, Univers ...;Dept. Informática, University of Vigo, Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n, 32004 Ourense, Spain;Dept. Informática, University of Vigo, Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n, 32004 Ourense, Spain;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;IBB - Institute for Biotechnology and Bioengineering, Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal;Department of Informatics/CCTC, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal
Venue:
Journal of Biomedical Informatics
Year:
2009

Citing 20
Cited 6

Rule-based extraction of experimental evidence in the biomedical domain: the KDD Cup 2002 (task 1)

ACM SIGKDD Explorations Newsletter
A Multi-Level Text Mining Method to Extract Biological Relationships

CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Improving the performance of dictionary-based approaches in protein name recognition

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Text Mining Biomedical Literature for Discovering Gene-to-Gene Relationships: A Comparative Study of Algorithms

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Notions of correctness when evaluating protein name taggers

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Enhancing a biomedical information extraction system with dictionary mining and context disambiguation

IBM Journal of Research and Development
MedPost: a part-of-speech tagger for bioMedical text

Bioinformatics
Data mining in bioinformatics using Weka

Bioinformatics
Gene name ambiguity of eukaryotic nomenclatures

Bioinformatics
Literature mining and database annotation of protein phosphorylation using a rule-based system

Bioinformatics
Implementing the iHOP concept for navigation of biomedical literature

Bioinformatics
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
EBIMed---text crunching to gather facts for proteins from Medline

Bioinformatics
Hierarchically SVM classification based on support vector clustering method and its application to document categorization

Expert Systems with Applications: An International Journal
Discovering gene-gene relations from sequential sentence patterns in biomedical literature

Expert Systems with Applications: An International Journal
Summarization from medical documents: a survey

Artificial Intelligence in Medicine
Biomedical named entities recognition using conditional random fields model

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Recognizing biomedical named entities using SVMs: improving recognition performance with a minimal set of features

KDLL'06 Proceedings of the 2006 international conference on Knowledge Discovery in Life Science Literature
Developing a robust part-of-speech tagger for biomedical text

PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Identification of related gene/protein names based on an HMM of name variations

Computational Biology and Chemistry

BioDR: Semantic indexing networks for biomedical document retrieval

Expert Systems with Applications: An International Journal
AIBench: A rapid application development framework for translational research in biomedicine

Computer Methods and Programs in Biomedicine
Development and application of a keyword-based knowledge map for effective R&D planning

Scientometrics
A clustering study of a 7000 EU document inventory using MDS and SOM

Expert Systems with Applications: An International Journal
A JAVA application framework for scientific software development

Software—Practice & Experience
BioAnnote: A software platform for annotating biomedical documents with application in medical learning environments

Computer Methods and Programs in Biomedicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Biomedical Text Mining (BioTM) is providing valuable approaches to the automated curation of scientific literature. However, most efforts have addressed the benchmarking of new algorithms rather than user operational needs. Bridging the gap between BioTM researchers and biologists' needs is crucial to solve real-world problems and promote further research. We present @Note, a platform for BioTM that aims at the effective translation of the advances between three distinct classes of users: biologists, text miners and software developers. Its main functional contributions are the ability to process abstracts and full-texts; an information retrieval module enabling PubMed search and journal crawling; a pre-processing module with PDF-to-text conversion, tokenisation and stopword removal; a semantic annotation schema; a lexicon-based annotator; a user-friendly annotation view that allows to correct annotations and a Text Mining Module supporting dataset preparation and algorithm evaluation. @Note improves the interoperability, modularity and flexibility when integrating in-home and open-source third-party components. Its component-based architecture allows the rapid development of new applications, emphasizing the principles of transparency and simplicity of use. Although it is still on-going, it has already allowed the development of applications that are currently being used.