Using semantic distance in a content-based heterogeneous information retrieval system

Authors:
Ahmad El Sayed;Hakim Hacid;Djamel Zighed
Affiliations:
University of Lyon 2, ERIC Laboratory, Bron cedex, France;University of Lyon 2, ERIC Laboratory, Bron cedex, France;University of Lyon 2, ERIC Laboratory, Bron cedex, France
Venue:
MCD'07 Proceedings of the 3rd ECML/PKDD international conference on Mining complex data
Year:
2007

Citing 15
Cited 0

Vision texture for annotation

Multimedia Systems - Special issue on content-based retrieval
WordNet: a lexical database for English

Communications of the ACM
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Multiple-Instance Learning for Natural Scene Classification

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
On image auto-annotation with latent space models

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Using corpus statistics and WordNet relations for sense identification

Computational Linguistics - Special issue on word sense disambiguation
A fuzzy ontology for medical document retrieval

ACSW Frontiers '04 Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation - Volume 32
Word association norms, mutual information, and lexicography

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Verbs semantics and lexical selection

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Noun classification from predicate-argument structures

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Neighborhood graphs for semi-automatic annotation of large image databases

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper brings two contributions in relation with the semantic heterogeneous (documents composed of texts and images) information retrieval: (1) A new context-based semantic distance measure for textual data, and (2) an IR system providing a conceptual and an automatic indexing of documents by considering their heterogeneous content using a domain specific ontology. The proposed semantic distance measure is used in order to automatically fuzzify our domain ontology. The two proposals are evaluated and very interesting results were obtained. Using our semantic distance measure, we obtained a correlation ratio of 0.89 with human judgments on a set of words pairs which led our measure to outperform all the other measures. Preliminary combination results obtained on a specialized corpus of web pages are also reported.