Inductive logic programming for corpus-based acquisition of semantic lexicons

Authors:
Pascale Sébillot;Pierrette Bouillon;Cécile Fabre
Affiliations:
IRISA - Campus de Beaulieu - Rennes cedex - France;TIM/ISSCO - ETI - Université de Genève, Geneva - Switzerland;ERSS - Université de Toulouse II, Toulouse cedex - France
Venue:
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Year:
2000

Citing 8
Cited 3

Semantic feature extraction from technical texts with limited human intervention

Semantic feature extraction from technical texts with limited human intervention
Explorations in Automatic Thesaurus Discovery

Explorations in Automatic Thesaurus Discovery
Machine Learning

Machine Learning
Short Query Linguistic Expansion Techniques: Palliating One-Word Queries by Providing Intermediate Structure to Text

SCIE '97 International Summer School on Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology
A Comparison of ILP and Propositional Systems on Propositional Traffic Data

ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
Lexical semantic techniques for corpus analysis

Computational Linguistics - Special issue on using large corpora: II
Automatic extraction of subcategorization from corpora

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2

'NAIL': Artificial Intelligence Software for Learning Natural Language

ICGI '02 Proceedings of the 6th International Colloquium on Grammatical Inference: Algorithms and Applications
Learning semantic lexicons from a part-of-speech and semantically tagged corpus using inductive logic programming

The Journal of Machine Learning Research
Acquiring word-meaning mappings for natural language interfaces

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose an Inductive Logic Programming learning method which aims at automatically extracting special Noun-Verb (N-V) pairs from a corpus in order to build up semantic lexicons based on Pustejovsky's Generative Lexicon (GL) principles (Pustejovsky, 1995). In one of the components of this lexical model, called the qualia structure, words are described in terms of semantic roles. For example, the telic role indicates the purpose or function of an item (cut for knife), the agentive role its creation mode (build for house), etc. The qualia structure of a noun is mainly made up of verbal associations, encoding relational information. The Inductive Logic Programming learning method that we have developed enables us to automatically extract from a corpus N-V pairs whose elements are linked by one of the semantic relations defined in the qualia structure in GL, and to distinguish them, in terms of surrounding categorial context from N-V pairs also present in sentences of the corpus but not relevant. This method has been theoretically and empirically validated, on a technical corpus. The N-V pairs that have been extracted will further be used in information retrieval applications for index expansion.