2005 Special Issue: The context-tree kernel for strings
Neural Networks - Special issue on neural networks and kernel methods for structured domains
A Low-complexity Distance for DNA Strings
Fundamenta Informaticae
The Google Similarity Distance
IEEE Transactions on Knowledge and Data Engineering
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Information distance from a question to an answer
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Music genre classification using MIDI and audio features
EURASIP Journal on Applied Signal Processing
An extension of the Burrows–Wheeler Transform
Theoretical Computer Science
Distance measures for biological sequences: Some recent approaches
International Journal of Approximate Reasoning
Generative models for similarity-based classification
Pattern Recognition
Dictionary based color image retrieval
Journal of Visual Communication and Image Representation
Targeting Physically Addressable Memory
DIMVA '07 Proceedings of the 4th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
On Universal Transfer Learning
ALT '07 Proceedings of the 18th international conference on Algorithmic Learning Theory
Information shared by many objects
Proceedings of the 17th ACM conference on Information and knowledge management
Sublinear Algorithms for Approximating String Compressibility
APPROX '07/RANDOM '07 Proceedings of the 10th International Workshop on Approximation and the 11th International Workshop on Randomization, and Combinatorial Optimization. Algorithms and Techniques
Semantic Map Generation from Satellite Images for Humanitarian Scenarios Applications
ACIVS '08 Proceedings of the 10th International Conference on Advanced Concepts for Intelligent Vision Systems
Synchronic and Diachronic Emergence
Minds and Machines
On universal transfer learning
Theoretical Computer Science
New information distance measure and its application in question answering system
Journal of Computer Science and Technology
On the similarity metric and the distance metric
Theoretical Computer Science
Sustaining diversity using behavioral information distance
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
ZARAMIT: A System for the Evolutionary Study of Human Mitochondrial DNA
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living
Measure software - and its evolution - using information content
Proceedings of the joint international and annual ERCIM workshops on Principles of software evolution (IWPSE) and software evolution (Evol) workshops
Inferring user's preferences using ontologies
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
The subsequence composition of a string
Theoretical Computer Science
Forensic Authorship Attribution Using Compression Distances to Prototypes
IWCF '09 Proceedings of the 3rd International Workshop on Computational Forensics
International Journal of Knowledge Engineering and Soft Data Paradigms
Approximation of the two-part MDL code
IEEE Transactions on Information Theory
Similarity Grouping of Paintings by Distance Measure and Self Organizing Map
KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part II
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Detecting visually similar Web pages: Application to phishing detection
ACM Transactions on Internet Technology (TOIT)
IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Novelty detection in patient histories: experiments with measures based on text compression
IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
International Journal of Computer Vision
Testing component independence using data compressors
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Biological information as set-based complexity
IEEE Transactions on Information Theory - Special issue on information theory in molecular biology and neuroscience
Sustaining behavioral diversity in NEAT
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Information distance based fitness and diversity metrics
Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Rate distortion and denoising of individual data using Kolmogorov complexity
IEEE Transactions on Information Theory
Causal inference using the algorithmic Markov condition
IEEE Transactions on Information Theory
Measuring the non-compositionality of multiword expressions
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Using virtual worlds for behaviour clustering-based analysis
Proceedings of the 2010 ACM workshop on Surreal media and virtual cloning
Intelligent text processing techniques for textual-profile gene characterization
CIBB'09 Proceedings of the 6th international conference on Computational intelligence methods for bioinformatics and biostatistics
Relevance of contextual information in compression-based text clustering
IDEAL'10 Proceedings of the 11th international conference on Intelligent data engineering and automated learning
A Fast Quartet tree heuristic for hierarchical clustering
Pattern Recognition
Clustering based on kolmogorov information
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part I
A new approach for multi-document update summarization
Journal of Computer Science and Technology
The Journal of Machine Learning Research
Nonapproximability of the normalized information distance
Journal of Computer and System Sciences
An extended assessment of type-3 clones as detected by state-of-the-art tools
Software Quality Control
Packing it all up in search for a language independent MT quality measure tool - part two
LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
A new multiword expression metric and its applications
Journal of Computer Science and Technology - Special issue on natural language processing
Towards a universal information distance for structured data
Proceedings of the Fourth International Conference on SImilarity Search and APplications
Attribute mapping as a foundation of ontology alignment
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Studying software evolution using artefacts' shared information content
Science of Computer Programming
Measuring multi-language software evolution: a case study
Proceedings of the 12th International Workshop on Principles of Software Evolution and the 7th annual ERCIM Workshop on Software Evolution
Applied Intelligence
Information distance and its extensions
DS'11 Proceedings of the 14th international conference on Discovery science
"Tell me more": finding related items from user provided feedback
DS'11 Proceedings of the 14th international conference on Discovery science
Scalable detection of frequent substrings by grammar-based compression
DS'11 Proceedings of the 14th international conference on Discovery science
Tweet classification by data compression
Proceedings of the 2011 international workshop on DETecting and Exploiting Cultural diversiTy on the social web
Clustering pairwise distances with missing data: maximum cuts versus normalized cuts
DS'06 Proceedings of the 9th international conference on Discovery Science
Analysis of EU languages through text compression
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
A new combinatorial approach to sequence comparison
ICTCS'05 Proceedings of the 9th Italian conference on Theoretical Computer Science
Automatic upright orientation and good view recognition for 3D man-made models
Pattern Recognition
A fast compression-based similarity measure with applications to content-based image retrieval
Journal of Visual Communication and Image Representation
Information distance and its applications
CIAA'06 Proceedings of the 11th international conference on Implementation and Application of Automata
CPM'05 Proceedings of the 16th annual conference on Combinatorial Pattern Matching
Evaluation of analogical proportions through Kolmogorov complexity
Knowledge-Based Systems
Image similarity: from syntax to weak semantics
Multimedia Tools and Applications
Maximal words in sequence comparisons based on subword composition
Algorithms and Applications
Clustering the normalized compression distance for influenza virus data
Algorithms and Applications
High-Dimensional normalized mutual information for image registration using random lines
WBIR'06 Proceedings of the Third international conference on Biomedical Image Registration
Complexity profiles of DNA sequences using finite-context models
USAB'11 Proceedings of the 7th conference on Workgroup Human-Computer Interaction and Usability Engineering of the Austrian Computer Society: information Quality in e-Health
Is the contextual information relevant in text clustering by compression?
Expert Systems with Applications: An International Journal
Soft topographic maps for clustering and classifying bacteria using housekeeping genes
Advances in Artificial Neural Systems
On the detection of unknown locally repeating patterns in images
ICIAR'12 Proceedings of the 9th international conference on Image Analysis and Recognition - Volume Part I
A Low-complexity Distance for DNA Strings
Fundamenta Informaticae
Conceptualizing Birkhoff's aesthetic measure using Shannon entropy and Kolmogorov complexity
Computational Aesthetics'07 Proceedings of the Third Eurographics conference on Computational Aesthetics in Graphics, Visualization and Imaging
Toward Auvers period: evolution of van Gogh's style
Computational Aesthetics'10 Proceedings of the Sixth international conference on Computational Aesthetics in Graphics, Visualization and Imaging
Informational dialogue with van Gogh's paintings
Computational Aesthetics'08 Proceedings of the Fourth Eurographics conference on Computational Aesthetics in Graphics, Visualization and Imaging
A new quartet approach for reconstructing phylogenetic trees: quartet joining method
COCOON'07 Proceedings of the 13th annual international conference on Computing and Combinatorics
KORE: keyphrase overlap relatedness for entity disambiguation
Proceedings of the 21st ACM international conference on Information and knowledge management
Correlation-aware multipath selection to enhance path diversity in ubiquitous computing environment
International Journal of Ad Hoc and Ubiquitous Computing
Expert Systems with Applications: An International Journal
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
Supervised texture classification using a novel compression-based similarity measure
ICCVG'12 Proceedings of the 2012 international conference on Computer Vision and Graphics
Evaluation of malware clustering based on its dynamic behaviour
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Service-independent payload analysis to improve intrusion detection in network traffic
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Information distance between what I said and what it heard
Communications of the ACM
Toward a compression-based self-organizing recognizer: Preliminary implementation of PRDC-CSOR
Pattern Recognition Letters
Further results on dissimilarity spaces for hyperspectral images RF-CBIR
Pattern Recognition Letters
Legal documents categorization by compression
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law
Dictionary-based color image retrieval using multiset theory
Journal of Visual Communication and Image Representation
Bid evaluation behavior in online procurement auctions involving technical and business experts
Electronic Commerce Research and Applications
The Thermodynamic Cost of Fast Thought
Minds and Machines
Automatic Abstract Tag Detection for Social Image Tag Refinement and Enrichment
Journal of Signal Processing Systems
Exploring programmable self-assembly in non-DNA based molecular computing
Natural Computing: an international journal
Hi-index | 755.04 |
A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied. We propose a new "normalized information distance," based on the noncomputable notion of Kolmogorov complexity, and show that it is in this class and it minorizes every computable distance in the class (that is, it is universal in that it discovers all computable similarities). We demonstrate that it is a metric and call it the similarity metric . This theory forms the foundation for a new practical tool. To evidence generality and robustness, we give two distinctive applications in widely divergent areas using standard compression programs like gzip and GenCompress. First, we compare whole mitochondrial genomes and infer their evolutionary history. This results in a first completely automatic computed whole mitochondrial phylogeny tree. Secondly, we fully automatically compute the language tree of 52 different languages.