EBIMed---text crunching to gather facts for proteins from Medline

  • Authors:
  • Dietrich Rebholz-Schuhmann;Harald Kirsch;Miguel Arregui;Sylvain Gaudan;Mark Riethoven;Peter Stoehr

  • Affiliations:
  • European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK;European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK;European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK;European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK;European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK;European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK

  • Venue:
  • Bioinformatics
  • Year:
  • 2007

Quantified Score

Hi-index 3.84

Visualization

Abstract

Summary: To allow efficient and systematic retrieval of statements from Medline we have developed EBIMed, a service that combines document retrieval with co-occurrence-based analysis of Medline abstracts. Upon keyword query, EBIMed retrieves the abstracts from EMBL-EBI's installation of Medline and filters for sentences that contain biomedical terminology maintained in public bioinformatics resources. The extracted sentences and terminology are used to generate an overview table on proteins, Gene Ontology (GO) annotations, drugs and species used in the same biological context. All terms in retrieved abstracts and extracted sentences are linked to their entries in biomedical databases. We assessed the quality of the identification of terms and relations in the retrieved sentences. More than 90% of the protein names found indeed represented a protein. According to the analysis of four protein--protein pairs from the Wnt pathway we estimated that 37% of the statements containing such a pair mentioned a meaningful interaction and clarified the interaction of Dkk with LRP. We conclude that EBIMed improves access to information where proteins and drugs are involved in the same biological process, e.g. statements with GO annotations of proteins, protein--protein interactions and effects of drugs on proteins. Availability: Available at http://www.ebi.ac.uk/Rebholz-srv/ebimed Supplementary Data: Supplementary Data are available at Bioinformatics online. Contact: Rebholz@ebi.ac.uk