Representative Views and Paths for Volume Models
SG '08 Proceedings of the 9th international symposium on Smart Graphics
Evaluating the Impact of Information Distortion on Normalized Compression Distance
ICMCTA '08 Proceedings of the 2nd international Castle meeting on Coding Theory and Applications
The Normalized Compression Distance as a Distance Measure in Entity Identification
ICDM '09 Proceedings of the 9th Industrial Conference on Advances in Data Mining. Applications and Theoretical Aspects
The Normalised Compression Distance as a file fragment classifier
Digital Investigation: The International Journal of Digital Forensics & Incident Response
Hi-index | 754.84 |
This correspondence studies the influence of noise on the normalized compression distance (NCD), a measure based on the use of compressors to compute the degree of similarity of two files. This influence is approximated by a first order differential equation which gives rise to a complex effect, which explains the fact that the NCD may give values greater than 1, observed by other authors. The model is tested experimentally with good adjustment. Finally, the influence of noise on the clustering of files of different types is explored, finding that the NCD performs well even in the presence of quite high noise levels