C4.5: programs for machine learning
C4.5: programs for machine learning
Rule Induction with CN2: Some Recent Improvements
EWSL '91 Proceedings of the European Working Session on Machine Learning
Adapting classification rule induction to subgroup discovery
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Intelligent data analysis
Hi-index | 0.00 |
There are three domains in living nature: archaea, bacteria and eukarya. It has been shown, trough a number of multivariate tools, that codon usage, a 64 dimensional vector that stablishes how often a given organism makes use of each codon, is related to domain. Another method is proposed here based in rule and tree induction from codon usage of several organisms. It is shown that domain can be identified trough codon usage and a simple set of rules. Two methods were applied, CN 2 and C 4.5. Obtained rules describe data better than other methods, in the sense that are topological interpretable and have phenomenological meaning.