Strong similarity measures for ordered sets of documents in information retrieval

Authors:
L. Egghe;C. Michel
Affiliations:
LUC, Universitaire Campus, B-3590 Diepenbeek, Belgium. UIA, Universiteitsplein 1, B-2610 Antwerpen (Wilrijk), Belgium;CEM-GRESIC, MSHA, D.U. Bordeaux III, Esplanade des Antilles, F-33607, Pessac Cedex, France
Venue:
Information Processing and Management: an International Journal
Year:
2002

Citing 7
Cited 11

A theory of continuous rates and applications to the theory of growth and obsolescence rates

Information Processing and Management: an International Journal
Improving information retrieval by combining user profile and document segmentation

Information Processing and Management: an International Journal
Text retrieval and filtering: analytic models of performance

Text retrieval and filtering: analytic models of performance
Ordered similarity measures taking into account the rank of documents

Information Processing and Management: an International Journal
Information Retrieval

Information Retrieval
Information Retrieval: Algorithms and Heuristics

Information Retrieval: Algorithms and Heuristics
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval

Construction of weak and strong similarity measures for ordered sets of documents using fuzzy set techniques

Information Processing and Management: an International Journal
Editorial: expansion of the field of informetrics: Origins and consequences

Information Processing and Management: an International Journal - Special issue: Infometrics
Fuzzy semantic tagging and flexible querying of XML documents extracted from the Web

Journal of Intelligent Information Systems
Classical retrieval and overlap measures satisfy the requirements for rankings based on a Lorenz curve

Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Identifying synonymous concepts in preparation for technology mining

Journal of Information Science
Description and classification of complex structured objects by applying similarity measures

International Journal of Approximate Reasoning
Editorial: Expansion of the field of informetrics: Origins and consequences

Information Processing and Management: an International Journal - Special issue: Infometrics
Classical retrieval and overlap measures satisfy the requirements for rankings based on a Lorenz curve

Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Similarity-Based Classification in Relational Databases

Fundamenta Informaticae
Feature selection strategies for automated classification of digital media content

Journal of Information Science
Using fuzzy conceptual graphs to map ontologies

ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

A general method is presented to construct ordered similarity measures (OS-measures), i.e., similarity measures for ordered sets of documents (as, e.g., being the result of an IR-process), based on classical, well-known similarity measures for ordinary sets (measures such as Jaccard, Dice, Cosine or overlap measures). To this extent, we first present a review of these measures and their relationships.The method given here to construct OS-measures extends the one given by Michel in a previous paper so that it becomes applicable on any pair of ordered sets. Concrete expressions of this method, applied to the classical similarity measures, are given.Some of these measures are then tested in the IR-system Profil-Doc. The engine SPIRIT© extracts ranked document sets in three different contexts, each for 550 requests. The practical usability of the OS-measures is then discussed based on these experiments.