ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text

  • Authors:
  • Burr Settles

  • Affiliations:
  • Department of Computer Sciences and Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison Madison, WI 52706, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.85

Visualization

Abstract

Summary: ABNER (A Biomedical Named Entity Recognizer) is an open source software tool for molecular biology text mining. At its core is a machine learning system using conditional random fields with a variety of orthographic and contextual features. The latest version is 1.5, which has an intuitive graphical interface and includes two modules for tagging entities (e.g. protein and cell line) trained on standard corpora, for which performance is roughly state of the art. It also includes a Java application programming interface allowing users to incorporate ABNER into their own systems and train models on new corpora. Availability: ABNER is available as an executable Java archive and source code from http://www.cs.wisc.edu/~bsettles/abner/ Contact: bsettles@cs.wisc.edu