GROBID: combining automatic bibliographic data recognition and term extraction for scholarship publications

  • Authors:
  • Patrice Lopez

  • Affiliations:
  • European Patent Office, Berlin, Germany

  • Venue:
  • ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Based on state of the art machine learning techniques, GROBID (GeneRation Of BIbliographic Data) performs reliable bibliographic data extractions from scholar articles combined with multi-level term extractions. These two types of extraction present synergies and correspond to complementary descriptions of an article. This tool is viewed as a component for enhancing the existing and the future large repositories of technical and scientific publications.