Simple fast algorithms for the editing distance between trees and related problems
SIAM Journal on Computing
Clustering transactions using large items
Proceedings of the eighth international conference on Information and knowledge management
ACM Computing Surveys (CSUR)
Evaluation of hierarchical clustering algorithms for document datasets
Proceedings of the eleventh international conference on Information and knowledge management
Xyleme: A Dynamic Warehouse for XML Data of the Web
IDEAS '01 Proceedings of the International Database Engineering & Applications Symposium
CLOPE: a fast and effective clustering algorithm for transactional data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Information Systems - Special issue on web data integration
Fast Detection of XML Structural Similarity
IEEE Transactions on Knowledge and Data Engineering
Knowledge and Information Systems
XMine: a methodology for mining XML structure
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
XML schema clustering with semantic and hierarchical similarity measures
Knowledge-Based Systems
Investigating Semantic Measures in XML Clustering
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
XEdge: clustering homogeneous and heterogeneous XML documents using edge summaries
Proceedings of the 2008 ACM symposium on Applied computing
A heuristic algorithm for clustering rooted ordered trees
Intelligent Data Analysis
Document Clustering Using Incremental and Pairwise Approaches
Focused Access to XML Documents
Process of applying data mining techniques to XML data
Proceedings of the 2006 conference on Advances in Intelligent IT: Active Media Technology 2006
Semantic clustering of XML documents
ACM Transactions on Information Systems (TOIS)
A weighted common structure based clustering technique for XML documents
Journal of Systems and Software
Collaborative clustering of XML documents
Journal of Computer and System Sciences
XML documents clustering by structures
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Exploring dictionary-based semantic relatedness in labeled tree data
Information Sciences: an International Journal
Hierarchical clustering of XML documents focused on structural components
Data & Knowledge Engineering
Structural and semantic similarity for XML comparison
Proceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems
Hi-index | 0.00 |
We present a novel clustering algorithm to group the XML documents by similar structures. We introduce a Level structure format to represent the XML documents for efficient processing. We develop a global criterion function that do not require the pair-wise similarity to be computed between two individual documents, rather measures the similarity at clustering level utilising structural information of the XML documents. The experimental analysis shows the method to be fast and accurate.