LOF: identifying density-based local outliers
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Algorithms for Mining Distance-Based Outliers in Large Datasets
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Mining distance-based outliers in near linear time with randomization and a simple pruning rule
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
IEEE Transactions on Knowledge and Data Engineering
A tutorial on spectral clustering
Statistics and Computing
Unsupervised face-name association via commute distance
Proceedings of the 20th ACM international conference on Multimedia
A time-dependent enhanced support vector machine for time series regression
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
On the embeddability of random walk distances
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
We present a method to find outliers using ‘commute distance' computed from a random walk on graph Unlike Euclidean distance, commute distance between two nodes captures both the distance between them and their local neighborhood densities Indeed commute distance is the Euclidean distance in the space spanned by eigenvectors of the graph Laplacian matrix We show by analysis and experiments that using this measure, we can capture both global and local outliers effectively with just a distance based method Moreover, the method can detect outlying clusters which other traditional methods often fail to capture and also shows a high resistance to noise than local outlier detection method Moreover, to avoid the O(n3) direct computation of commute distance, a graph component sampling and an eigenspace approximation combined with pruning technique reduce the time to O(nlogn) while preserving the outlier ranking.