The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers

Authors:
Nigel Collier;Hyun Seok Park;Norihiro Ogata;Yuka Tateishi;Chikashi Nobata;Tomoko Ohta;Tateshi Sekimizu;Hisao Imai;Katsutoshi Ibushi;Jun-ichi Tsujii
Affiliations:
University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan
Venue:
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Year:
1999

Citing 4
Cited 23

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
C4.5: programs for machine learning

C4.5: programs for machine learning
Automatic routing and retrieval using Smart: TREC-2

TREC-2 Proceedings of the second conference on Text retrieval conference
Distributional clustering of words for text classification

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval

Extracting the names of genes and gene products with a hidden Markov model

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data

Journal of Biomedical Informatics
Comparison of character-level and part of speech features for name recognition in biomedical texts

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Enhancing performance of protein and gene name recognizers with filtering and integration strategies

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Notions of correctness when evaluating protein name taggers

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Enhancing performance of protein name recognizers using collocation

BioMed '03 Proceedings of the ACL 2003 workshop on Natural language processing in biomedicine - Volume 13
Collective information extraction with relational Markov networks

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Using heuristics, syntax and a local dynamic dictionary for protein name tagging

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Event-based information extraction for the biomedical domain: the Caderige project

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
The BioScope corpus: annotation for negation, uncertainty and their scope in biomedical texts

BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
A memory-based learning approach to event extraction in biomedical texts

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Learning the scope of hedge cues in biomedical texts

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
A metalearning approach to processing the scope of negation

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Learning the scope of negation in biomedical texts

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Corpus design for biomedical natural language processing

ISMB '05 Proceedings of the ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics
Detecting speculations and their scopes in scientific text

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Extracting information of anti-AIDS inhibitor from the biological literature based on ontology

FAW'07 Proceedings of the 1st annual international conference on Frontiers in algorithmics
Automatic annotation of speculation in biomedical texts: new perspectives and large-scale evaluation

NeSp-NLP '10 Proceedings of the Workshop on Negation and Speculation in Natural Language Processing
A unified framework for scope learning via simplified shallow semantic parsing

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Learning the scope of negation via shallow semantic parsing

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A machine-learning approach to negation and speculation detection in clinical texts

Journal of the American Society for Information Science and Technology
Modality and negation: An introduction to the special issue

Computational Linguistics
Speculation and negation: Rules, rankers, and the role of syntax

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present an outline of the genome information acquisition (GENIA) project for automatically extracting biochemical information from journal papers and abstracts. GENIA will be available over the Internet and is designed to aid in information extraction, retrieval and visualisation and to help reduce information overload on researchers. The vast repository of papers available online in databases such as MEDLINE is a natural environment in which to develop language engineering methods and tools and is an opportunity to show how language engineering can play a key role on the Internet.