Clustering genome data based on approximate matching
International Journal of Data Analysis Techniques and Strategies
Hi-index | 0.00 |
DNA sequences of several bacteria were classified using artificial neural network model. The “dinucleotides compositions” method was used to characterize the DNA sequences which transform every DNA sequence to a 16-dimension vector. Back-propagation artificial neural network was developed and trained using “leave-one-out” method. Results showed that the accuracy of classification was 84.3%, which proved that the model was satisfactory in summary. However, the author stated that the applicability of the characterization strategy needs to be improved.