An information-theoretic perspective of tf—idf measures
Information Processing and Management: an International Journal
Nodal Distance Algorithm: Calculating a Phylogenetic Tree Comparison Metric
BIBE '03 Proceedings of the 3rd IEEE Symposium on BioInformatics and BioEngineering
Bioinformatics
An approach for determining evolutionary distance in network-based phylogenetic analysis
ISBRA'08 Proceedings of the 4th international conference on Bioinformatics research and applications
Hi-index | 0.00 |
Network-based phylogenetic analysis typically involves representing metabolic networks as graphs and analyzing the characteristics of vertex sets using set theoretic measures. Such approaches, however, fail to take into account the structural characteristics of graphs. In this paper we propose a new pattern recognition technique, TopEVM, to help representing metabolic networks as weighted vectors. We assign weights according to co-occurrence patterns and topology patterns of enzymes, where the former are determined in a manner similar to the Tf-Idfapproach used in document clustering, and the latter are determined using the degree centrality of enzymes. By comparing the weighted vectors of organisms, we determine the evolutionary distances and construct the phylogenetic trees. The resulting TopEVMtrees are compared to the previous NCEtrees with the NCBI Taxonomy trees as reference. It shows that TopEVMcan construct trees much closer to the NCBI Taxonomy trees than the previous NCEmethods.