Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Semantic integration of semistructured and structured data sources
ACM SIGMOD Record
An Efficient and Scalable Algorithm for Clustering XML Documents by Structure
IEEE Transactions on Knowledge and Data Engineering
Document clustering via adaptive subspace iteration
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A DTD for an XML-based mathematical modeling language
ICCS'03 Proceedings of the 2003 international conference on Computational science
A metadata tool for retrieval from heterogeneous distributed XML documents
ICCS'03 Proceedings of the 2003 international conference on Computational science
Structural similarity between XML documents and DTDs
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Hi-index | 0.00 |
Researches on the similarity measure between XML documents are being progressed in order to effectively control and retrieve various XML documents. Previous works mostly suggest similarity-measuring methods focusing only on the tag structure of XML documents. However, they have a problem of incorrectly calculating the semantic similarity of XML contents. In this paper, we propose a new similarity measurement method considering not only the structural information of tags in XML documents but also the semantic information of tags and text content information related with the tags. Our experiments demonstrate that our proposed method improves the accuracy of similarity, compared to the previous works.