Evaluation of analogical proportions through Kolmogorov complexity

Authors:
Meriam Bayoudh;Henri Prade;Gilles Richard
Affiliations:
Centre IRD de Guyane, Route de Montabo BP165, 97323 Cayenne CEDEX, France;IRIT, 118 Route de Narbonne, 31062 Toulouse Cedex 9, United Kingdom;IRIT, 118 Route de Narbonne, 31062 Toulouse Cedex 9, United Kingdom
Venue:
Knowledge-Based Systems
Year:
2012

Citing 25
Cited 0

The structure-mapping engine: algorithm and examples

Artificial Intelligence
The mechanisms of analogical learning

Similarity and analogical reasoning
Experiment on linguistically-based term associations

Information Processing and Management: an International Journal
An introduction to Kolmogorov complexity and its applications (2nd ed.)

An introduction to Kolmogorov complexity and its applications (2nd ed.)
Knowledge Representation and Metaphor

Knowledge Representation and Metaphor
Design, Analogy, and Creativity

IEEE Expert: Intelligent Systems and Their Applications
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Some complexity questions related to distributive computing(Preliminary Report)

STOC '79 Proceedings of the eleventh annual ACM symposium on Theory of computing
An information statistics approach to data stream and communication complexity

Journal of Computer and System Sciences - Special issue on FOCS 2002
Frequency estimates for statistical word similarity measures

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Corpus-based Learning of Analogies and Semantic Relations

Machine Learning
An analogy-oriented type hierarchy for linguistic creativity

Knowledge-Based Systems
An Introduction to Kolmogorov Complexity and Its Applications

An Introduction to Kolmogorov Complexity and Its Applications
Handling Analogical Proportions in Classical Logic and Fuzzy Logics Settings

ECSQARU '09 Proceedings of the 10th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
A uniform approach to analogies, synonyms, antonyms, and associations

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
CogSketch

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
The latent relation mapping engine: algorithm and experiments

Journal of Artificial Intelligence Research
A logical approach to reasoning by analogy

IJCAI'87 Proceedings of the 10th international joint conference on Artificial intelligence - Volume 1
Discovering pattern-based subspace clusters by pattern tree

Knowledge-Based Systems
An analogical learner for morphological analysis

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Nonapproximability of the normalized information distance

Journal of Computer and System Sciences
Kolmogorov complexity and combinatorial methods in communication complexity

Theoretical Computer Science
Information distance

IEEE Transactions on Information Theory
The similarity metric

IEEE Transactions on Information Theory
Clustering by compression

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we try to identify analogical proportions, i.e., statements of the form ''a is to b as c is to d'', expressed in linguistic terms. While it is conceivable to use an algebraic model for testing proportions such as ''2 is to 4 as 5 is to 10'', or even such as ''read is to reader as lecture is to lecturer'', there is no algebraic framework to support statements such as ''engine is to car as heart is to human'' or ''wine is to France as beer is to England'', helping to recognize them as meaningful analogical proportions. The idea is then to rely on text corpora, or even on the Web itself, where one may expect to find the pragmatics and the semantics of the words, in their common use. In that context, in order to attach a numerical value to the ''analogical ratio'' corresponding to the phrase ''a is to b'', we start from the works of Kolmogorov on complexity theory. This is the basis for a universal measure of the information content of a word a, or of a word a with respect to another one b, which, in practice, is estimated in a statistical manner. We investigate the link between a purely logical, recently introduced view of analogical proportions and its counterpart based on Kolmogorov theory. The criteria proposed for testing candidate proportions fit with the expected properties (symmetry, central permutation) of analogical proportions. This leads to a new computational method to define, and ultimately to try to detect, analogical proportions in natural language. Experiments with classifiers based on these ideas are reported, and results are rather encouraging with respect to the recognition of common sense linguistic analogies. The approach is also compared with existing works on similar problems.