A Language-Based Similarity Measure

Authors:
Lionel Martin;Frédéric Moal
Affiliations:
-;-
Venue:
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Year:
2001

Citing 10
Cited 0

Toward memory-based reasoning

Communications of the ACM - Special issue on parallelism
Incremental, instance-based learning of independent and graded concept descriptions

Proceedings of the sixth international workshop on Machine learning
A Weighted Nearest Neighbor Algorithm for Learning with Symbolic Features

Machine Learning
Relational instance-based learning with lists and terms

Machine Learning - Special issue on inducive logic programming
Scope Classification: An Instance-Based Learning Algorithm with a Rule-Based Characterisation

ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Rule-Based Similarity Measure

EWCBR '93 Selected papers from the First European Workshop on Topics in Case-Based Reasoning
Cases as terms: A feature term approach to the structured representation of cases

ICCBR '95 Proceedings of the First International Conference on Case-Based Reasoning Research and Development
Distance Induction in First Order Logic

ILP '97 Proceedings of the 7th International Workshop on Inductive Logic Programming
An Efficient Metric for Heterogeneous Inductive Learning Applications in the Attribute-Value Language

An Efficient Metric for Heterogeneous Inductive Learning Applications in the Attribute-Value Language
Rule induction and instance-based learning a unified approach

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper presents an unified framework for the definition of similarity measures for various formalisms (attribute-value, first order logic...). The underlying idea is that the similarity between two objects does not depend only on the attribute values of the objects, but more especially on the set of the potentially relevant definitions of concepts for the problem considered. In our framework, the user defines a language with a grammar to specify the similarity measure. Each term of the language represents a property of the objects. The similarity between two objects is the probability that these two objects both satisfy or both reject simultaneously the properties of the given language. When this probability is not computable, we use a stochastic generation procedure to approximate it. This measure can be applied for both clustering and classification tasks. The empirical evaluation on common classification problems shows a very good accuracy.