Unsupervised Feature Selection Using Feature Similarity

Authors:
Pabitra Mitra;C. A. Murthy;Sankar K. Pal
Affiliations:
Indian Statistical Institute, Calcutta, India;Indian Statistical Institute, Calcutta, India;Indian Statistical Institute, Calcutta, India
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2002

Citing 7
Cited 146

A practical approach to feature selection

ML92 Proceedings of the ninth international workshop on Machine learning
Estimating attributes: analysis and extensions of RELIEF

ECML-94 Proceedings of the European conference on machine learning on Machine Learning
Floating search methods in feature selection

Pattern Recognition Letters
Data mining and knowledge discovery in databases

Communications of the ACM
Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Feature Subset Selection and Order Identification for Unsupervised Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Unsupervised feature evaluation: a neuro-fuzzy approach

IEEE Transactions on Neural Networks

Improving similarity measures of histograms using smoothing projections

Pattern Recognition Letters
Subspace clustering for high dimensional data: a review

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Automated hierarchical mixtures of probabilistic principal component analyzers

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Simultaneous Feature Selection and Clustering Using Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Feature Selection via Analysis of Relevance and Redundancy

The Journal of Machine Learning Research
Toward Integrating Feature Selection Algorithms for Classification and Clustering

IEEE Transactions on Knowledge and Data Engineering
Feature Subset Selection and Feature Ranking for Multivariate Time Series

IEEE Transactions on Knowledge and Data Engineering
Cross-relational clustering with user's guidance

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Choosing SNPs Using Feature Selection

CSB '05 Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference
Content-based image retrieval: approaches and trends of the new age

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Combining Feature Reduction and Case Selection in Building CBR Classifiers

IEEE Transactions on Knowledge and Data Engineering
Information-preserving hybrid data reduction based on fuzzy-rough techniques

Pattern Recognition Letters
An outlier-based data association method for linking criminal incidents

Decision Support Systems - Special issue: Intelligence and security informatics
MILES: Multiple-Instance Learning via Embedded Instance Selection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature Subset Selection and Ranking for Data Dimensionality Reduction

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning word senses with feature selection and order identification capabilities

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
A case-based reasoning system for PCB principal process parameter identification

Expert Systems with Applications: An International Journal
A fast and effective method to find correlations among attributes in databases

Data Mining and Knowledge Discovery
Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation

Pattern Recognition
Fuzzy feature selection based on min-max learning rule and extension matrix

Pattern Recognition
Localized feature selection for clustering

Pattern Recognition Letters
A correlation-based model for unsupervised feature selection

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Predictor output sensitivity and feature similarity-based feature selection

Fuzzy Sets and Systems
Guilt-by-association feature selection: Identifying biomarkers from proteomic profiles

Journal of Biomedical Informatics
Consensus unsupervised feature ranking from multiple views

Pattern Recognition Letters
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Hierarchical fuzzy filter method for unsupervised feature selection

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Developing a feature weight self-adjustment mechanism for a K-means clustering algorithm

Computational Statistics & Data Analysis
Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm

Pattern Recognition
Informative sampling for large unbalanced data sets

Proceedings of the 10th annual conference companion on Genetic and evolutionary computation
Objective reduction using a feature selection technique

Proceedings of the 10th annual conference on Genetic and evolutionary computation
Unsupervised feature selection for principal components analysis

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Feature selection using localized generalization error for supervised classification problems using RBFNN

Pattern Recognition
Perfect Population Classification on Hapmap Data with a Small Number of SNPs

Neural Information Processing
A new feature selection method for Gaussian mixture clustering

Pattern Recognition
Online phenotype discovery based on minimum classification error model

Pattern Recognition
Feature Selection for Clustering on High Dimensional Data

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Computational accounting in determining Chart of Accounts using nominal data analysis and concept of entropy

Expert Systems with Applications: An International Journal
Feature selection with dynamic mutual information

Pattern Recognition
Similarity-Based Feature Selection for Learning from Examples with Continuous Values

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Feature selection based on loss-margin of nearest neighbor classification

Pattern Recognition
A Cluster-Based Feature Selection Approach

HAIS '09 Proceedings of the 4th International Conference on Hybrid Artificial Intelligence Systems
Unsupervised Feature Selection in High Dimensional Spaces and Uncertainty

HAIS '09 Proceedings of the 4th International Conference on Hybrid Artificial Intelligence Systems
Feature subset selection in large dimensionality domains

Pattern Recognition
Topic and keyword re-ranking for LDA-based topic modeling

Proceedings of the 18th ACM conference on Information and knowledge management
Computer-aided diagnosis of thyroid malignancy using an artificial immune system classification algorithm

IEEE Transactions on Information Technology in Biomedicine - Special section on computational intelligence in medical systems
Computational intelligence in gait research: a perspective on current applications and future challenges

IEEE Transactions on Information Technology in Biomedicine - Special section on computational intelligence in medical systems
Improved Visual Clustering through Unsupervised Dimensionality Reduction

RSFDGrC '09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Feature Selection Using Non Linear Feature Relation Index

PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
Metric in Feature Space

PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
An outlier-based data association method for linking criminal incidents

Decision Support Systems - Special issue: Intelligence and security informatics
Object tracking based on the combination of learning and cascade particle filter

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Selecting discrete and continuous features based on neighborhood decision error minimization

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
The research of feature selection model based on the game theory

WiCOM'09 Proceedings of the 5th International Conference on Wireless communications, networking and mobile computing
Feature selection for genomic data sets through feature clustering

International Journal of Data Mining and Bioinformatics
Tree view self-organisation of web content

Neurocomputing
Feature analysis and classification of protein secondary structure data

ICANN/ICONIP'03 Proceedings of the 2003 joint international conference on Artificial neural networks and neural information processing
Texture defect detection

CAIP'07 Proceedings of the 12th international conference on Computer analysis of images and patterns
Combining image, voice, and the patient's questionnaire data to categorize laryngeal disorders

Artificial Intelligence in Medicine
A novel kernel clustering algorithm based selective neural network ensemble model for economic forecasting

ISICA'07 Proceedings of the 2nd international conference on Advances in computation and intelligence
Feature selection for identifying critical variables of principal components based on k-nearest neighbor rule

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Criminal incident data association using the OLAP technology

ISI'03 Proceedings of the 1st NSF/NIJ conference on Intelligence and security informatics
An efficient feature selection approach for clustering: using a Gaussian mixture model of data dissimilarity

ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part I
Unsupervised texture segmentation using feature selection and fusion

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
SVM-FuzCoC: A novel SVM-based feature selection method using a fuzzy complementary criterion

Pattern Recognition
Optimizing reservoir features in oil exploration management based on fusion of soft computing

Applied Soft Computing
A new wrapper feature selection approach using neural network

Neurocomputing
A novel hypothesis-margin based approach for feature selection with side pairwise constraints

Neurocomputing
Discriminative codeword selection for image representation

Proceedings of the international conference on Multimedia
Evolutionary-rough feature selection for face recognition

Transactions on rough sets XII
Clustering of human motions based on feature-level fusion of multiple body sensor data

Proceedings of the 1st ACM International Health Informatics Symposium
Unsupervised subjectivity-lexicon generation based on vector space model for multi-dimensional opinion analysis in blogosphere

ICIC'10 Proceedings of the 6th international conference on Advanced intelligent computing theories and applications: intelligent computing
Survey on speech emotion recognition: Features, classification schemes, and databases

Pattern Recognition
Nearest-neighbor guided evaluation of data reliability and its applications

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Measures for unsupervised fuzzy-rough feature selection

International Journal of Hybrid Intelligent Systems - Advances in Intelligent Agent Systems
RFID-based human behavior modeling and anomaly detection for elderly care

Mobile Information Systems
Automatic revision of the control knowledge used by trial and error methods: Application to cartographic generalisation

Applied Soft Computing
RFID-based human behavior modeling and anomaly detection for elderly care

Mobile Information Systems
Linear dimensionality reduction through eigenvector selection for object recognition

ISVC'10 Proceedings of the 6th international conference on Advances in visual computing - Volume Part I
Document clustering using synthetic cluster prototypes

Data & Knowledge Engineering
Multi-objective semi-supervised feature selection and model selection based on Pearson's correlation coefficient

CIARP'10 Proceedings of the 15th Iberoamerican congress conference on Progress in pattern recognition, image analysis, computer vision, and applications
Adapt the mRMR criterion for unsupervised feature selection

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
A Boolean function approach to feature selection in consistent decision information systems

Expert Systems with Applications: An International Journal
Simultaneous feature selection and classifier training via linear programming: a case study for face expression recognition

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Unsupervised feature selection for salient object detection

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Fast transient stability assessment of large power system using probabilistic neural network with feature reduction techniques

Expert Systems with Applications: An International Journal
A hybrid feature selection scheme for unsupervised learning and its application in bearing fault diagnosis

Expert Systems with Applications: An International Journal
A robust template tracking algorithm with weighted active drift correction

Pattern Recognition Letters
Feature selection and fusion for texture classification

ISNN'05 Proceedings of the Second international conference on Advances in neural networks - Volume Part II
Multi-objective genetic algorithm evaluation in feature selection

EMO'11 Proceedings of the 6th international conference on Evolutionary multi-criterion optimization
Correntropy based feature selection using binary projection

Pattern Recognition
Hierarchical audio content classification system using an optimal feature selection algorithm

Multimedia Tools and Applications
A filter based feature selection approach using lempel ziv complexity

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification

Information Sciences: an International Journal
Graph Laplacian for semi-supervised feature selection in regression problems

IWANN'11 Proceedings of the 11th international conference on Artificial neural networks conference on Advances in computational intelligence - Volume Part I
Investigating a novel GA-based feature selection method using improved KNN classifiers

International Journal of Information and Communication Technology
Eigenvector sensitive feature selection for spectral clustering

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Combining linear dimensionality reduction and locality preserving projections with feature selection for recognition tasks

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Decision tree based light weight intrusion detection using a wrapper approach

Expert Systems with Applications: An International Journal
Hybrid segmentation, characterization and classification of basal cell nuclei from histopathological images of normal oral mucosa and oral submucous fibrosis

Expert Systems with Applications: An International Journal
TIARA: Interactive, Topic-Based Visual Text Summarization and Analysis

ACM Transactions on Intelligent Systems and Technology (TIST)
Conditional infomax learning: an integrated framework for feature extraction and fusion

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Classifying credit ratings for Asian banks using integrating feature selection and the CPDA-based rough sets approach

Knowledge-Based Systems
Fuzzy criteria for feature selection

Fuzzy Sets and Systems
An affinity-based new local distance function and similarity measure for kNN algorithm

Pattern Recognition Letters
Immune multiobjective optimization algorithm for unsupervised feature selection

EuroGP'06 Proceedings of the 2006 international conference on Applications of Evolutionary Computing
Feature Selection and Extraction Methods for Power Systems Transient Stability Assessment Employing Computational Intelligence Techniques

Neural Processing Letters
A hybrid classifier based on rough set theory and support vector machines

FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part I
Relevant gene selection using normalized cut clustering with maximal compression similarity measure

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
A new maximum-relevance criterion for significant gene selection

PRIB'06 Proceedings of the 2006 international conference on Pattern Recognition in Bioinformatics
Unsupervised gene selection and clustering using simulated annealing

WILF'05 Proceedings of the 6th international conference on Fuzzy Logic and Applications
An unsupervised feature selection framework based on clustering

PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
An algorithm for sample and data dimensionality reduction using fast simulated annealing

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
Large-margin feature selection for monotonic classification

Knowledge-Based Systems
Local and global structure preserving based feature selection

Neurocomputing
A semi-supervised feature ranking method with ensemble learning

Pattern Recognition Letters
A new fuzzy segmentation approach based on S-FCM type 2 using LBP-GCO features

Image Communication
An unsupervised approach to feature discretization and selection

Pattern Recognition
Efficient feature selection filters for high-dimensional data

Pattern Recognition Letters
Localized graph-based feature selection for clustering

ICIAR'12 Proceedings of the 9th international conference on Image Analysis and Recognition - Volume Part I
Neighborhood effective information ratio for hybrid feature subset evaluation and selection

Neurocomputing
Unsupervised feature selection in digital mammogram image using rough set theory

International Journal of Bioinformatics Research and Applications
Feature selection based on cluster and variability analyses for ordinal multi-class classification problems

Knowledge-Based Systems
A modified support vector data description based novelty detection approach for machinery components

Applied Soft Computing
Automatic dimensionality estimation for manifold learning through optimal feature selection

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Unsupervised fuzzy-rough set-based dimensionality reduction

Information Sciences: an International Journal
A scalable approach to simultaneous evolutionary instance and feature selection

Information Sciences: an International Journal
Fuzzy Linear Discriminant Analysis-guided maximum entropy fuzzy clustering algorithm

Pattern Recognition
Dimensionality Reduction with Unsupervised Feature Selection and Applying Non-Euclidean Norms for Classification Accuracy

International Journal of Data Warehousing and Mining
Dimensionality reduction in data summarization approach to learning relational data

ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
On online high-dimensional spherical data clustering and feature selection

Engineering Applications of Artificial Intelligence
Unsupervised Feature Selection with Feature Clustering

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Hybrid wrapper-filter approaches for input feature selection using maximum relevance-minimum redundancy and artificial neural network input gain measurement approximation (ANNIGMA)

ACSC '11 Proceedings of the Thirty-Fourth Australasian Computer Science Conference - Volume 113
Wavelet neural networks: A practical guide

Neural Networks
Feature selection techniques with class separability for multivariate time series

Neurocomputing
L1 graph based on sparse coding for feature selection

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part I
A graph Laplacian based approach to semi-supervised feature selection for regression problems

Neurocomputing
Discriminative two-level feature selection for realistic human action recognition

Journal of Visual Communication and Image Representation
Modeling hybrid rough set-based classification procedures to identify hemodialysis adequacy for end-stage renal disease patients

Computers in Biology and Medicine
Facing the classification of binary problems with a hybrid system based on quantum-inspired binary gravitational search algorithm and K-NN method

Engineering Applications of Artificial Intelligence
Fuzzy rough sets, and a granular neural network for unsupervised feature selection

Neural Networks
A survey on feature selection methods

Computers and Electrical Engineering
Mixed feature selection in incomplete decision table

Knowledge-Based Systems
Integration of dense subgraph finding with feature clustering for unsupervised feature selection

Pattern Recognition Letters
Improving learning accuracy by using synthetic samples for small datasets with non-linear attribute dependency

Decision Support Systems
Feature ranking fusion for text classifier

Intelligent Data Analysis

Quantified Score

Hi-index	0.15

Visualization

Abstract

In this article, we describe an unsupervised feature selection algorithm suitable for data sets, large in both dimension and size. The method is based on measuring similarity between features whereby redundancy therein is removed. This does not need any search and, therefore, is fast. A new feature similarity measure, called maximum information compression index, is introduced. The algorithm is generic in nature and has the capability of multiscale representation of data sets. The superiority of the algorithm, in terms of speed and performance, is established extensively over various real-life data sets of different sizes and dimensions. It is also demonstrated how redundancy and information loss in feature selection can be quantified with an entropy measure.