C4.5: programs for machine learning
C4.5: programs for machine learning
Machine Learning
ECML '95 Proceedings of the 8th European Conference on Machine Learning
Generating Accurate Rule Sets Without Global Optimization
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Knowledge Discovery in Bioinformatics: Techniques, Methods, and Applications (Wiley Series in Bioinformatics)
Hi-index | 0.00 |
To explore the mechanism of protein folding is one of the important topics in protein research. The accurate prediction of protein folding rate change is helpful and useful in protein design. In earlier study, we have firstly analyzed the prediction of folding rate change upon single point mutation and constructed a non-redundant dataset of F467. F467 consists of 467 mutants with various features and widely distributed on secondary structure, solvent accessibility, conservation score and long-range contacts. In this work, we therefore focused on effectively developing the knowledge in F467 dataset. We have systematically analyzed the dataset and presented several representative data mining techniques, including decision tree, decision table and association rule algorithms. Furthermore, we have interpreted, evaluated, and compared the knowledge obtained from different techniques. The experimental results showed that the present approach can effectively develop the knowledge in the dataset and the outcomes can increase the understanding of predicting protein folding rate change upon single mutation. We have also created a website with related information about this work and it is freely available at http://bioinformatics.myweb.hinet.net/kdfreedom.htm .