High-Throughput identification of chemistry in life science texts

  • Authors:
  • Peter Corbett;Peter Murray-Rust

  • Affiliations:
  • Unilever center for Moleclular Sciences Informatics, Cambridge;Unilever center for Moleclular Sciences Informatics, Cambridge

  • Venue:
  • CompLife'06 Proceedings of the Second international conference on Computational Life Sciences
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

OSCAR3 is an open extensible system for the automated annotation of chemistry in scientific articles, which can process thousands of articles per hour. This XML annotation supports applications such as interactive browsing and chemically-aware searching, and has been designed for integration with larger text-analysis systems. We report its application to the high-throughput analysis of the small-molecule chemistry content of texts in life sciences, such as PubMed abstracts.