A note on genetic algorithms for large-scale feature selection
Pattern Recognition Letters
Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
Further Research on Feature Selection and Classification Using Genetic Algorithms
Proceedings of the 5th International Conference on Genetic Algorithms
A Branch and Bound Algorithm for Feature Subset Selection
IEEE Transactions on Computers
Genetic Programming and Evolvable Machines
Learning Classifier Systems, From Foundations to Applications
Feature Extraction for the k-Nearest Neighbour Classifier with Genetic Programming
EuroGP '01 Proceedings of the 4th European Conference on Genetic Programming
Handbook of data mining and knowledge discovery
Genetic Programming with a Genetic Algorithm for Feature Construction and Selection
Genetic Programming and Evolvable Machines
Improving the human readability of features constructed by genetic programming
Proceedings of the 9th annual conference on Genetic and evolutionary computation
Comparison of Strategies Based on Evolutionary Computation for the Design of Similarity Functions
Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Feature construction and selection using genetic programming and a genetic algorithm
EuroGP'03 Proceedings of the 6th European conference on Genetic programming
A survey on the application of genetic programming to classification
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Expert Systems with Applications: An International Journal
Embedding monte carlo search of features in tree-based ensemble methods
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Hi-index | 0.00 |
We have previously shown how a genetic algorithm (GA) can be used to perform "data mining," the discovery of particular/important data within large datasets, by finding optimal data classifications using known examples. However, these approaches, while successful, limited data relationships to those that were "fixed" before the GA run. We report here on an extension of our previous work, substituting a genetic program (GP) for a GA. The GP could optimize data classification, as did the GA, but could also determine the functional relationships among the features. This gave improved performance and new information on important relationships among features. We discuss the overall approach, and compare the effectiveness of the GA vs. GP on a biochemistry problem, the determination of the involvement of bound water molecules in protein interactions.