A practical approach to feature selection
ML92 Proceedings of the ninth international workshop on Machine learning
Selection of relevant features and examples in machine learning
Artificial Intelligence - Special issue on relevance
Assessing data mining results via swap randomization
ACM Transactions on Knowledge Discovery from Data (TKDD)
A review of feature selection techniques in bioinformatics
Bioinformatics
Fast variable selection for memetracker phrases time series prediction
Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments
Hi-index | 0.00 |
The dimensionality of biological data is often very high. Feature selection can be used to tackle the problem of high dimensionality. However, majority of the work in feature selection consists of supervised feature selection methods which require class labels. The problem further escalates when the data is time-series gene expression measurements that measure the effect of external stimuli on biological system. In this paper we propose an unsupervised method for gene selection from time-series gene expression data founded on statistical significance testing and swap randomization. We perform experiments with a publicly available mouse gene expression dataset and also a human gene expression dataset describing the exposure to asbestos. The results in both datasets show a considerable decrease in number of genes.