Classifying molecular sequences using a linkage graph with their pairwise similarities
Theoretical Computer Science - Special issue: Genome informatics
Hi-index | 0.00 |
In order to find the function of genes from gene-expression profiles, the hierarchical clustering has been used generally. But this method has problems, for example a dendrogram tends to change by data dependence, therefore it is easy to be influenced of the error of anexperimental noise.To cope with the problems, we propose another type of clustering. We formulate the problem of the clustering as a graph-covering problem by connected subgraphs wherevertices and edges of the graph denote genes and similarities between genes, respectively. The method is based on the p-quasi complete linkage algorithm for describing clusters. We present the outline of an algorithm for clustering a set of genes into subsets corresponding top-quasi complete linkage graphs.