TreeRank: a similarity measure for nearest neighbor searching in phylogenetic database

Authors:
Jason T. L. Wang;Huiyuan Shan;Dennis Shasha;William H. Piel
Affiliations:
New Jersey Institute of Technology;New Jersey Institute of Technology;New York University;University at Buffalo
Venue:
SSDBM '03 Proceedings of the 15th International Conference on Scientific and Statistical Database Management
Year:
2003

Citing 0
Cited 12

Unordered Tree Mining with Applications to Phylogeny

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Detecting similar Java classes using tree algorithms

Proceedings of the 2006 international workshop on Mining software repositories
Dynamic knowledge validation and verification for CBR teledermatology system

Artificial Intelligence in Medicine
Finding consensus trees by evolutionary, variable neighborhood search, and hybrid algorithms

Proceedings of the 10th annual conference on Genetic and evolutionary computation
Multi-granularity Parallel Computing in a Genome-Scale Molecular Evolution Application

PaCT '09 Proceedings of the 10th International Conference on Parallel Computing Technologies
Interactive knowledge validation and query refinement in CBR

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Mining tree-structured data on multicore systems

Proceedings of the VLDB Endowment
Validation of computational prediction of horizontal gene transfer events--XenoCluster

The Journal of Supercomputing
On the application of evolutionary algorithms to the consensus tree problem

EvoCOP'05 Proceedings of the 5th European conference on Evolutionary Computation in Combinatorial Optimization
Hierarchical clustering, languages and cancer

EuroGP'06 Proceedings of the 2006 international conference on Applications of Evolutionary Computing
XenoCluster: a grid computing approach to finding ancient evolutionary genetic anomalies

PaCT'05 Proceedings of the 8th international conference on Parallel Computing Technologies
A fast algorithmic technique for comparing large phylogenetic trees

SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Phylogenetic trees are unordered labeled trees in which each leaf node has a label and the order among siblings is unimportant. In this paper we propose a new similarity measure, called TreeRank, for phylogenetic trees and present an algorithm for computing TreeRank scores. Given a query or pattern tree P and a data tree D, the TreeRank score from P to D is a measure of the topological relationships in P that are found to be the same or similar in D. The proposed algorithm calculates the TreeRank score in O(M2 + N) time where M is the number of nodes appearing in both P and D, and N is the number of nodes in D. We then develop a search engine that, given a query or pattern tree P and a database of trees D, finds and ranks the nearest neighbors of P in D where the "nearness" is measured by the proposed similarity function. This structure-based search engine is fully operational and is available on the World Wide Web.