Multidimensional similarity structure analysis
Multidimensional similarity structure analysis
Introduction to statistical pattern recognition (2nd ed.)
Introduction to statistical pattern recognition (2nd ed.)
Advanced algorithms for neural networks: a C++ sourcebook
Advanced algorithms for neural networks: a C++ sourcebook
From data mining to knowledge discovery: an overview
Advances in knowledge discovery and data mining
Handbook of Evolutionary Computation
Handbook of Evolutionary Computation
A Nonlinear Mapping for Data Structure Analysis
IEEE Transactions on Computers
RSFDGrC'03 Proceedings of the 9th international conference on Rough sets, fuzzy sets, data mining, and granular computing
A fast and elitist multiobjective genetic algorithm: NSGA-II
IEEE Transactions on Evolutionary Computation
Data mining of gene expression changes in Alzheimer brain
Artificial Intelligence in Medicine
Artificial neural networks for feature extraction and multivariate data projection
IEEE Transactions on Neural Networks
ISMIS'08 Proceedings of the 17th international conference on Foundations of intelligent systems
Hi-index | 0.00 |
This paper introduces a multi-objective optimization approach to the problem of computing virtual reality spaces for the visual representation of relational structures (e.g. databases), symbolic knowledge and others, in the context of visual data mining and knowledge discovery. Procedures based on evolutionary computation are discussed. In particular, the NSGA-II algorithm is used as a framework for an instance of this methodology; simultaneously minimizing Sammon's error for dissimilarity measures, and mean cross-validation error on a k-nn pattern classifier. The proposed approach is illustrated with an example from cancer genomics data (e.g. lung cancer) by constructing virtual reality spaces resulting from multi-objective optimization. Selected solutions along the Pareto front approximation are used as nonlinearly transformed features for new spaces that compromise similarity structure preservation (from an unsupervised perspective) and class separability (from a supervised pattern recognition perspective), simultaneously. The possibility of spanning a range of solutions between these two important goals, is a benefit for the knowledge discovery and data understanding process. The quality of the set of discovered solutions is superior to the ones obtained separately, from the point of view of visual data mining.