Combining evidence with logic and preferences to learn relations from structured few sparse textual data

Authors:
Nadia Zerida;Khaldoun Zreik
Affiliations:
Paragraph Laboratory, Paris 8 University, Saint Denis, France;Paragraph Laboratory, Paris 8 University, Saint Denis, France
Venue:
ISIICT'09 Proceedings of the Third international conference on Innovation and Information and Communication Technology
Year:
2009

Citing 5
Cited 0

Efficient mining of emerging patterns: discovering trends and differences

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
An associative classifier based on positive and negative rules

Proceedings of the 9th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Hypothesis Visualization

INFOVIS '04 Proceedings of the IEEE Symposium on Information Visualization
Evaluating Six Candidate Solutions for the Small-Disjunct Problem and Choosing the Best Solution via Meta-Learning

Artificial Intelligence Review
Combining linguistic and structural descriptors for mining biomedical literature

Proceedings of the 2006 ACM symposium on Document engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the literature, it is commonly believed that learning from few data problem can be resolved by using classifiers that consider interclass relationships. In this work, we will adopt this point of view in learning from few sparse textual data, essentially, by considering the sparseness of the latter as a good support for inducing theories about generalization. Therefore, we opt for an inductive approach based on combining: evidence-based analysis of patterns, logic and preferences. More precisely, we are interested in supervised learning of biomedical articles by exploiting a multi-scale hybrid description and constrained pattern-based data mining techniques. Unlike existing works, we will highlight the relevance of the absence/weakness of patterns and we will associate to their absence a semantic value compared to their presence. The main characteristic of our approach is that of considering local and global contexts, which connect textual data by introducing regret ratio measures and generalized exclusive patterns in order to avoid a crisp effect between the absence and presence of patterns. Experimental results show the effectiveness of our approach.