Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Elements of information theory
Elements of information theory
On a relation between graph edit distance and maximum common subgraph
Pattern Recognition Letters
A New Algorithm for Error-Tolerant Subgraph Isomorphism Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence
A graph distance metric based on the maximal common subgraph
Pattern Recognition Letters
ACM Computing Surveys (CSUR)
Graph distances using graph union
Pattern Recognition Letters
A graph distance metric combining maximum common subgraph and minimum common supergraph
Pattern Recognition Letters
Machine Learning
Self-organizing map for clustering in the graph domain
Pattern Recognition Letters - In memory of Professor E.S. Gelsema
Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters
IEEE Transactions on Computers
Case-Based Reasoning for Invoice Analysis and Recognition
ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Visualizing graph dynamics and similarity for enterprise network security and management
Proceedings of the Seventh International Symposium on Visualization for Cyber Security
A novel approach for clustering sentiments in Chinese blogs based on graph similarity
Computers & Mathematics with Applications
A mixed graph model for community detection
International Journal of Intelligent Information and Database Systems
Hi-index | 0.00 |
In this paper we describe work relating to clustering of document collections. We compare the conventional vector-model approach using cosine similarity and Euclidean distance to a novel method we have developed for clustering graph-based data with the standard k- means algorithm. The proposed method is evaluated using five different graph distance measures under three clustering performance indices. The experiments are performed on two separate document collections. The results show the graph-based approach performs as well as vector-based methods or even better when using normalized graph distance measures.