The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers

  • Authors:
  • Nigel Collier;Hyun Seok Park;Norihiro Ogata;Yuka Tateishi;Chikashi Nobata;Tomoko Ohta;Tateshi Sekimizu;Hisao Imai;Katsutoshi Ibushi;Jun-ichi Tsujii

  • Affiliations:
  • University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan

  • Venue:
  • EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an outline of the genome information acquisition (GENIA) project for automatically extracting biochemical information from journal papers and abstracts. GENIA will be available over the Internet and is designed to aid in information extraction, retrieval and visualisation and to help reduce information overload on researchers. The vast repository of papers available online in databases such as MEDLINE is a natural environment in which to develop language engineering methods and tools and is an opportunity to show how language engineering can play a key role on the Internet.