Information Extraction as an Ontology Population Task and Its Application to Genic Interactions

Authors:
Alain-Pierre Manine;Erick Alphonse;Philippe Bessières
Affiliations:
-;-;-
Venue:
ICTAI '08 Proceedings of the 2008 20th IEEE International Conference on Tools with Artificial Intelligence - Volume 02
Year:
2008

Citing 0
Cited 4

Learning the ontological theory of an information extraction system in the multi-predicate ILP setting

Proceedings of the 2009 ACM symposium on Applied Computing
BioNLP Shared Task 2011: bacteria gene interactions and renaming

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
BioNLP 2011 task bacteria biotope: the Alvis system

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
Extraction of genic interactions with the recursive logical theory of an ontology

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ontologies are a well-motivated formal representation to model knowledge needed to extract and encode data from text. Yet, their tight integration with Information Extraction (IE) systems is still a research issue, a fortiori with complex ones that go beyond hierarchies. In this paper, we introduce an original architecture where IE is specified by designing an ontology, and the extraction process is seen as an Ontology Population (OP) task. Concepts and relations of the ontology define a normalized text representation. As their abstraction level is irrelevant for text extraction, we introduced a Lexical Layer (LL) along with the ontology, i.e. relations and classes at an intermediate level of normalization between raw text and concepts. On the contrary to previous IE systems, the extraction process only involves normalizing the outputs of Natural Language Processing (NLP) modules with instances of the ontology and the LL. All the remaining reasoning is left to a query module, which uses the inference rules of the ontology to derive new instances by deduction. In this context, these inference rules subsume classical extraction rules or patterns by providing access to appropriate abstraction level and domain knowledge. To acquire those rules, we adopt an Ontology Learning (OL) perspective, and automatically acquire the inference rules with relational Machine Learning (ML). Our approach is validated on a genic interaction extraction task from a Bacillus subtilis bacterium text corpus. We reach a global recall of 89.3% and a precision of 89.6%, with high scores for the ten conceptual relations in the ontology.