Parallel distributed processing: explorations in the microstructure, vol. 2: psychological and biological models
Feature subset selection by Bayesian network-based optimization
Artificial Intelligence
Efficient and Accurate Parallel Genetic Algorithms
Efficient and Accurate Parallel Genetic Algorithms
Feature Selection for Knowledge Discovery and Data Mining
Feature Selection for Knowledge Discovery and Data Mining
Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation
Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation
A Survey of Optimization by Building and Using Probabilistic Models
Computational Optimization and Applications
From Recombination of Genes to the Estimation of Distributions I. Binary Parameters
PPSN IV Proceedings of the 4th International Conference on Parallel Problem Solving from Nature
Feature Subset Selection By Estimation Of Distribution Algorithms
GECCO '02 Proceedings of the Genetic and Evolutionary Computation Conference
ICPPW '05 Proceedings of the 2005 International Conference on Parallel Processing Workshops
On the sample complexity of learning Bayesian networks
UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence
Parallel Implementation of EDAs Based on Probabilistic Graphical Models
IEEE Transactions on Evolutionary Computation
Order or not: does parallelization of model building in hBOA affect its scalability?
Proceedings of the 9th annual conference on Genetic and evolutionary computation
Estimation of distribution algorithms: from available implementations to potential developments
Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Hi-index | 0.00 |
This paper describes the application of a collection of data mining methods to solve a calibration problem in a quantitative chemistry environment. Experimental data obtained from reactions which involve known concentrations of two or more components are used to calibrate a model that, later, will be used to predict the (unknown) concentrations of those components in a new reaction. This problem can be seen as a selection + prediction one, where the goal is to obtain good values for the variables to predict while minimizing the number of the input variables needed, taking a small subset of really significant ones. Initial approaches to the problem were principal components analysis and filtering combined with two prediction techniques: artificial neural networks and partial least squares regression. Finally, a parallel estimation of distribution algorithm was used to reduce the number of variables to be used for prediction, yielding the best models for all the considered problems.