Semantic feature extraction from technical texts with limited human intervention
Semantic feature extraction from technical texts with limited human intervention
Explorations in Automatic Thesaurus Discovery
Explorations in Automatic Thesaurus Discovery
Machine Learning
SCIE '97 International Summer School on Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology
A Comparison of ILP and Propositional Systems on Propositional Traffic Data
ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
Lexical semantic techniques for corpus analysis
Computational Linguistics - Special issue on using large corpora: II
Automatic extraction of subcategorization from corpora
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
'NAIL': Artificial Intelligence Software for Learning Natural Language
ICGI '02 Proceedings of the 6th International Colloquium on Grammatical Inference: Algorithms and Applications
The Journal of Machine Learning Research
Acquiring word-meaning mappings for natural language interfaces
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
In this paper, we propose an Inductive Logic Programming learning method which aims at automatically extracting special Noun-Verb (N-V) pairs from a corpus in order to build up semantic lexicons based on Pustejovsky's Generative Lexicon (GL) principles (Pustejovsky, 1995). In one of the components of this lexical model, called the qualia structure, words are described in terms of semantic roles. For example, the telic role indicates the purpose or function of an item (cut for knife), the agentive role its creation mode (build for house), etc. The qualia structure of a noun is mainly made up of verbal associations, encoding relational information. The Inductive Logic Programming learning method that we have developed enables us to automatically extract from a corpus N-V pairs whose elements are linked by one of the semantic relations defined in the qualia structure in GL, and to distinguish them, in terms of surrounding categorial context from N-V pairs also present in sentences of the corpus but not relevant. This method has been theoretically and empirically validated, on a technical corpus. The N-V pairs that have been extracted will further be used in information retrieval applications for index expansion.