Utilizing text mining results: the PastaWeb system
BioMed '02 Proceedings of the ACL-02 workshop on Natural language processing in the biomedical domain - Volume 3
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Manual curation of biological databases is an expensive and labor-intensive process in Genomics and Systems Biology. We report the implem-entation of a state-of-the-art, rule-based Natural Language Processing system that creates computer-readable networks of regulatory interactions directly from abstracts and full-text papers. We evaluate its output against a manually-curated standard database, and test the possibilities and limitations of automatic and semi-automatic curation of the so-called biobibliome. We also propose a novel Regulatory Interaction Mining Markup Language suited for representing this data, useful both for biologists and for text-mining specialists.