Tree edit distance from information theory

  • Authors:
  • Andrea Torsello;Edwin R. Hancock

  • Affiliations:
  • Dept. of Computer Science, University of York, Heslington, York, UK;Dept. of Computer Science, University of York, Heslington, York, UK

  • Venue:
  • GbRPR'03 Proceedings of the 4th IAPR international conference on Graph based representations in pattern recognition
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a method for estimating the cost of tree edit operations. The approach poses the problem as that of estimating a generative model for a set of tree samples. The generative model uses the tree-union as the structural archetype for every tree in the distribution and assigns to each node in the archetype the probability that the node is present in a sample. A minimum descriptor length formulation is then used to estimate the structure and parameters of this tree model as well as the node-correspondences between trees in the sample-set and the tree model.