An architecture for biological information extraction and representation

Authors:
Aditya Vailaya;Peter Bluvas;Robert Kincaid;Allan Kuchinsky;Michael Creech;Annette Adler
Affiliations:
Agilent Laboratories, Palo Alto, CA;Agilent Laboratories, Palo Alto, CA;Agilent Laboratories, Palo Alto, CA;Agilent Laboratories, Palo Alto, CA;Agilent Laboratories, Palo Alto, CA;Agilent Laboratories, Palo Alto, CA
Venue:
Proceedings of the 2004 ACM symposium on Applied computing
Year:
2004

Citing 3
Cited 0

Algorithms for clustering data

Algorithms for clustering data
Extracting the names of genes and gene products with a hidden Markov model

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
When worlds collide: molecular biology as interdisciplinary collaboration

ECSCW'01 Proceedings of the seventh conference on European Conference on Computer Supported Cooperative Work

Quantified Score

Hi-index	0.00

Visualization

Abstract

Technological advances in biomedical research are generating a plethora of heterogeneous data at a high rate. There is a critical need for extraction, integration and management tools for information discovery and synthesis from these heterogeneous data. In this paper, we present a general architecture, called ALFA, for information extraction and representation from diverse biological data. The ALFA architecture consists of: (i) a networked, hierarchical object model for representing information from heterogeneous data sources in a standardized, structured format; and (ii) a suite of integrated, interactive software tools for information extraction and representation from diverse biological data sources. As part of our research efforts to explore this space, we have currently prototyped the ALFA object model and a set of interactive software tools for searching, filtering, and extracting information from scientific text. In particular, we describe BioFerret, a meta-search tool for searching and filtering relevant information from the web, and ALFA Text Viewer, an interactive tool for user-guided extraction, disambiguation, and representation of information from scientific text. We further demonstrate the potential of our tools in integrating the extracted information with experimental data and diagrammatic biological models via the common underlying ALFA representation.