Analysis of tree edit distance on XML data

  • Authors:
  • Yu-Fang Wu;Shu-Fen Lin;Hsu-Chun Yen

  • Affiliations:
  • National Taiwan University, Taipei, Taiwan, R.O.C.;National Taiwan University, Taipei, Taiwan, R.O.C.;National Taiwan University, Taipei, Taiwan, R.O.C.

  • Venue:
  • CIIT '07 The Sixth IASTED International Conference on Communications, Internet, and Information Technology
  • Year:
  • 2007

Quantified Score

Hi-index 0.02

Visualization

Abstract

The problem of comparing tree structures occurs in various areas in computer science and engineering, including the application to XML data processing. To solve this problem, tree edit distance is a common and significant measurement defining the difference between two tree structures quantitatively. Efficient tree edit distance embedding algorithms are therefore of significant importance in comparing large streaming XML document trees. In this paper, we propose a new algorithm to obtain edit distance between unlabeled ordered trees derived from streaming XML data. In comparison with the previous work, our contribution lies in simplifying the procedure of obtaining the tree edit distance without increasing the time and space complexities. The upper and lower bounds of distortion as well as the error probability of our algorithm are also analyzed in this paper.