Dimensionality Reduction of Unsupervised Data

Authors:
Affiliations:
Venue:
ICTAI '97 Proceedings of the 9th International Conference on Tools with Artificial Intelligence
Year:
1997

Citing 0
Cited 35

Dimensionality Reduction in Unsupervised Learning of Conditional Gaussian Networks

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Rough Set-Aided System for Sorting WWW Bookmarks

WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
A Dynamic Approach to Reducing Dialog in On-Line Decision Guides

EWCBR '00 Proceedings of the 5th European Workshop on Advances in Case-Based Reasoning
Data reduction: feature selection

Handbook of data mining and knowledge discovery
Forecasting Association Rules Using Existing Data Sets

IEEE Transactions on Knowledge and Data Engineering
Subspace clustering for high dimensional data: a review

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
A selective sampling approach to active feature selection

Artificial Intelligence
Toward Integrating Feature Selection Algorithms for Classification and Clustering

IEEE Transactions on Knowledge and Data Engineering
Semi-supervised verb class discovery using noisy features

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Experiments on the Automatic Induction of German Semantic Verb Classes

Computational Linguistics
A fast and effective method to find correlations among attributes in databases

Data Mining and Knowledge Discovery
Dependency-based feature selection for clustering symbolic data

Intelligent Data Analysis
A correlation-based model for unsupervised feature selection

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Data dimensionality reduction with application to improving classification performance and explaining concepts of data sets

International Journal of Business Intelligence and Data Mining
Constructing Classification Rules Based on SVR and Its Derivative Characteristics

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
A New Approach to Division of Attribute Space for SVR Based Classification Rule Extraction

ISNN '08 Proceedings of the 5th international symposium on Neural Networks: Advances in Neural Networks
An entropy clustering analysis based on genetic algorithm

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Fuzzy theory and technology with applications
Computational accounting in determining Chart of Accounts using nominal data analysis and concept of entropy

Expert Systems with Applications: An International Journal
An efficient approach for building customer profiles from business data

Expert Systems with Applications: An International Journal
Identifying fall-related injuries: Text mining the electronic medical record

Information Technology and Management
Feature selection for genomic data sets through feature clustering

International Journal of Data Mining and Bioinformatics
Tree view self-organisation of web content

Neurocomputing
A new approach to symbolic classification rule extraction based on SVM

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
A new clustering algorithm for transaction data via caucus

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Collaborative optimization of clustering by fuzzy c-means and weight determination by ReliefF

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
An efficient feature selection approach for clustering: using a Gaussian mixture model of data dissimilarity

ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part I
Using Data Mining Techniques to Discover Bias Patterns in Missing Data

Journal of Data and Information Quality (JDIQ)
Optimizing reservoir features in oil exploration management based on fusion of soft computing

Applied Soft Computing
Proactive control of manufacturing processes using historical data

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part II
Investigating a novel GA-based feature selection method using improved KNN classifiers

International Journal of Information and Communication Technology
Multiobjective optimization of indexes obtained by clustering for feature selection methods evaluation in genes expression microarrays

IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
Assessment of an unsupervised feature selection method for generative topographic mapping

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
An unsupervised feature selection framework based on clustering

PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
Feature selection using structural similarity

Information Sciences: an International Journal
RPCA: a novel preprocessing method for PCA

Advances in Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dimensionality reduction is an important problem for efficient handling of large databases. Many feature selection methods exist for supervised data having class information. Little work has been done for dimensionality reduction of unsupervised data in which class information is not available. Principal Component Analysis (PCA) is often used. However, PCA creates new features. It is difficult to obtain intuitive understanding of the data using the new features only. In this paper we are concerned with the problem of determining and choosing the important original features for unsupervised data. Our method is based on the observation that removing an irrelevant feature from the feature set may not change the underlying concept of the data, but not so otherwise. We propose an entropy measure for ranking features, and conduct extensive experiments to show that our method is able to find the important features. Also it compares well with a similar feature ranking method (Relief) that requires class information unlike our method.