An effective detection method for clustering similar XML DTDs using tag sequences

  • Authors:
  • Hyun-Joo Moon;Jae-Woo Yoo;Jongmyung Choi

  • Affiliations:
  • School of Computing, Soongsil University, Seoul, Korea;School of Computing, Soongsil University, Seoul, Korea;Dept. of Computer Engineering, Mokpo National University, Jeonnam, Korea

  • Venue:
  • ICCSA'07 Proceedings of the 2007 international conference on Computational science and Its applications - Volume Part II
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

The importance and usage of XML technologies increase with the explorative growth of Internet usage, heterogeneous computing platforms, and ubiquitous computing technologies. With the growth of XML usage, we need similarity detection method because it is a fundamental technology for efficient document management. In this paper, we introduce a similarity detection method that can check both semantic similarity and structural similarity between XML DTDs. For semantic checking, we adopt ontology technology, and we apply longest common string and longest nesting common string methods for structural checking. Our similarity detection method uses multi-tag sequences instead of traversing XML schema trees, so that it gets fast and reasonable results.