Learning Logical Descriptions for Document Understanding: A Rough Sets-Based Approach

Authors:
Emmanuelle Martienne;Mohamed Quafafou
Affiliations:
-;-
Venue:
RSCTC '98 Proceedings of the First International Conference on Rough Sets and Current Trends in Computing
Year:
1998

Citing 8
Cited 0

Inductive logic programming

New Generation Computing - Selected papers from the international workshop on algorithmic learning theory,1990
The Utility of Knowledge in Inductive Learning

Machine Learning
Rough Sets: Theoretical Aspects of Reasoning about Data

Rough Sets: Theoretical Aspects of Reasoning about Data
Learning Logical Definitions from Relations

Machine Learning
A Guided Tour Through Hypothesis Spaces in ILP

ECML '95 Proceedings of the 8th European Conference on Machine Learning
Using Empirical Subsumption to Reduce the Search Space in Learning

ICCS '95 Proceedings of the Third International Conference on Conceptual Structures: Applications, Implementation and Theory
Learning First Order Theories

ISMIS '94 Proceedings of the 8th International Symposium on Methodologies for Intelligent Systems
Learning Flexible Concepts from Uncertain Data

ISMIS '97 Proceedings of the 10th International Symposium on Foundations of Intelligent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Inductive learning systems in a logical framework are prone to difficulties when dealing with huge amount of information. In particular, the learning cost is greatly increased, and it becomes difficult to find descriptions of concepts in a reasonable time. In this paper, we present a learning approach based on Rough Set Theory, and more especially on its basic notion of concept approximation. In accordance with RST, a learning process is splitted into three steps, namely (1) partitioning of knowledge, (2) approximation of the target concept, and finally (3) induction of a logical description of this concept. The second step of approximation reduces the volume of the learning data, by computing well-chosen portions of the background knowledge which represent approximations of the concept to learn. Then, only one of these portions is used during the induction of the description, which allows for reducing the learning cost. In the first part of this paper, we report how RST's basic notions namely indiscernibility, as well as lower and upper approximations of a concept have been adapted in order to cope with a logical framework. In the remainder of the paper, some empirical results obtained with a concrete implementation of the approach, i.e., the EAGLE system, are given. These results show the relevance of the approach, in terms of learning cost gain, on a learning problem related to the document understanding.