Feature Subset Selection and Feature Ranking for Multivariate Time Series

Authors:
Hyunjin Yoon;Kiyoung Yang;Cyrus Shahabi
Affiliations:
-;-;IEEE
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
2005

Citing 15
Cited 25

Neural networks for pattern recognition

Neural networks for pattern recognition
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
Data mining: concepts and techniques

Data mining: concepts and techniques
Unsupervised Feature Selection Using Feature Similarity

IEEE Transactions on Pattern Analysis and Machine Intelligence
Motion-Based Recognition

Motion-Based Recognition
Gene Selection for Cancer Classification using Support Vector Machines

Machine Learning
An introduction to variable and feature selection

The Journal of Machine Learning Research
Use of the zero norm with linear models and kernel methods

The Journal of Machine Learning Research
Pattern Recognition Algorithms for Data Mining: Scalability, Knowledge Discovery, and Soft Granular Computing

Pattern Recognition Algorithms for Data Mining: Scalability, Knowledge Discovery, and Soft Granular Computing
A PCA-based similarity measure for multivariate time series

Proceedings of the 2nd ACM international workshop on Multimedia databases
Temporal classification: extending the classification paradigm to multivariate time series

Temporal classification: extending the classification paradigm to multivariate time series
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Active feature selection using classes

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Performance analysis of time-distance gait parameters under different speeds

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Variable grouping in multivariate time series via correlation

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Continuous archival and analysis of user data in virtual and immersive game environments

CARPE '05 Proceedings of the 2nd ACM workshop on Continuous archival and retrieval of personal experiences
On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
An efficient k nearest neighbor search for multivariate time series

Information and Computation
Support feature machine for classification of abnormal brain activity

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Novel Algorithm for Coexpression Detection in Time-Varying Microarray Data Sets

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Advances in clustering and visualization of time series using GTM through time

Neural Networks
Mutual Information Based Input Variable Selection Algorithm and Wavelet Neural Network for Time Series Prediction

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Using acceleration measurements for activity recognition: An effective learning algorithm for constructing neural classifiers

Pattern Recognition Letters
Bagging Constraint Score for feature selection with pairwise constraints

Pattern Recognition
Relationship preserving feature selection for unlabelled clinical trials time-series

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
Discrete wavelet transform-based time series analysis and mining

ACM Computing Surveys (CSUR)
A review on time series data mining

Engineering Applications of Artificial Intelligence
Identifying user preferences with Wrapper-based Decision Trees

Expert Systems with Applications: An International Journal
Two-Step Cross-Entropy Feature Selection for Microarrays—Power Through Complementarity

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
User-perceived quality assessment of streaming media using reduced feature sets

ACM Transactions on Internet Technology (TOIT)
Review: Situation identification techniques in pervasive computing: A review

Pervasive and Mobile Computing
Discovering key sequences in time series data for pattern classification

ICDM'06 Proceedings of the 6th Industrial Conference on Data Mining conference on Advances in Data Mining: applications in Medicine, Web Mining, Marketing, Image and Signal Mining
Time series relevance determination through a topology-constrained hidden markov model

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Ensemble based positive unlabeled learning for time series classification

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Positive unlabeled learning for time series classification

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Time-series data mining

ACM Computing Surveys (CSUR)
Feature selection techniques with class separability for multivariate time series

Neurocomputing
Early prediction on imbalanced multivariate time series

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
An approach to dimensionality reduction in time series

Information Sciences: an International Journal
Unsupervised categorization of human motion sequences

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Feature subset selection (FSS) is a known technique to preprocess the data before performing any data mining tasks, e.g., classification and clustering. FSS provides both cost-effective predictors and a better understanding of the underlying process that generated the data. We propose a family of novel unsupervised methods for feature subset selection from Multivariate Time Series (MTS) based on Common Principal Component Analysis, termed {\schmi CL}e{\schmi V}er. Traditional FSS techniques, such as Recursive Feature Elimination (RFE) and Fisher Criterion (FC), have been applied to MTS data sets, e.g., Brain Computer Interface (BCI) data sets. However, these techniques may lose the correlation information among features, while our proposed techniques utilize the properties of the principal component analysis to retain that information. In order to evaluate the effectiveness of our selected subset of features, we employ classification as the target data mining task. Our exhaustive experiments show that {\schmi CL}e{\schmi V}er outperforms RFE, FC, and random selection by up to a factor of two in terms of the classification accuracy, while taking up to 2 orders of magnitude less processing time than RFE and FC.