A theory of continuous rates and applications to the theory of growth and obsolescence rates
Information Processing and Management: an International Journal
Improving information retrieval by combining user profile and document segmentation
Information Processing and Management: an International Journal
Text retrieval and filtering: analytic models of performance
Text retrieval and filtering: analytic models of performance
Ordered similarity measures taking into account the rank of documents
Information Processing and Management: an International Journal
Performance measurement in a fuzzy retrieval environment
SIGIR '81 Proceedings of the 4th annual international ACM SIGIR conference on Information storage and retrieval: theoretical issues in information retrieval
Information Retrieval
Information Retrieval: Algorithms and Heuristics
Information Retrieval: Algorithms and Heuristics
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Strong similarity measures for ordered sets of documents in information retrieval
Information Processing and Management: an International Journal
Editorial: expansion of the field of informetrics: Origins and consequences
Information Processing and Management: an International Journal - Special issue: Infometrics
Concept integration of document databases using different indexing languages
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
A triparametric family of cardinality-based fuzzy similarity measures
Fuzzy Sets and Systems
Personalized information retrieval based on context and ontological knowledge
The Knowledge Engineering Review
Editorial: Expansion of the field of informetrics: Origins and consequences
Information Processing and Management: an International Journal - Special issue: Infometrics
Concept integration of document databases using different indexing languages
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Measuring the incremental information value of documents
Information Sciences: an International Journal
Flexible method for a distance measure between communicative agents’ stored perceptions
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Neural network approach for learning of the world structure by cognitive agents
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Hi-index | 0.00 |
Ordered sets of documents are encountered more and more in information distribution systems, such as information retrieval systems. Classical similarity measures for ordinary sets of documents hence need to be extended to these ordered sets. This is done in this paper using fuzzy set techniques. First a general similarity measure is developed which contains the classical strong similarity measures such as Jaccard, Dice, Cosine and which contains the classical weak similarity measures such as Recall and Precision.Then these measures are extended to comparing fuzzy sets of documents. Measuring the similarity for ordered sets of documents is a special case of this, where, the higher the rank of a document, the lower its weight is in the fuzzy set. Concrete forms of these similarity measures are presented. All these measures are new and the ones for the weak similarity measures are the first of this kind (other strong similarity measures have been given in a previous paper by Egghe and Michel).Some of these measures are then tested in the IR-system Profil-Doc. The engine SPIRIT© extracts ranked documents sets in three different contexts, each for 600 request. The practical useability of the OS-measures is then discussed based on these experiments.