Feature selection using principal feature analysis

Authors:
Yijuan Lu;Ira Cohen;Xiang Sean Zhou;Qi Tian
Affiliations:
University of Texas at San Antonio, San Antonio, TX;Hewlett-Packard Labs, Palo Alto, CA;Siemens Medical Solutions USA Inc., Malvern, PA;University of Texas at San Antonio, San Antonio, TX
Venue:
Proceedings of the 15th international conference on Multimedia
Year:
2007

Citing 1
Cited 8

Sensitivity Methods for Variable Selection Using the MLP

NICROSP '96 Proceedings of the 1996 International Workshop on Neural Networks for Identification, Control, Robotics, and Signal/Image Processing (NICROSP '96)

CLUEBOX: a performance log analyzer for automated troubleshooting

WASL'08 Proceedings of the First USENIX conference on Analysis of system logs
On efficient use of multi-view data for activity recognition

Proceedings of the Fourth ACM/IEEE International Conference on Distributed Smart Cameras
Dominant Feature Identification for Industrial Fault Detection and Isolation Applications

Expert Systems with Applications: An International Journal
Comparison of feature selection methods in ECG signal classification

Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
sonLP: social network link prediction by principal component regression

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Improving protein complex classification accuracy using amino acid composition profile

Computers in Biology and Medicine
Execution time prediction for grid infrastructures based on runtime provenance data

WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Clustering approach to characterize haptic expressions of emotions

ACM Transactions on Applied Perception (TAP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dimensionality reduction of a feature set is a common preprocessing step used for pattern recognition and classification applications. Principal Component Analysis (PCA) is one of the popular methods used, and can be shown to be optimal using different optimality criteria. However, it has the disadvantage that measurements from all the original features are used in the projection to the lower dimensional space. This paper proposes a novel method for dimensionality reduction of a feature set by choosing a subset of the original features that contains most of the essential information, using the same criteria as PCA. We call this method Principal Feature Analysis (PFA). The proposed method is successfully applied for choosing the principal features in face tracking and content-based image retrieval (CBIR) problems. Automated annotation of digital pictures has been a highly challenging problem for computer scientists since the invention of computers. The capability of annotating pictures by computers can lead to breakthroughs in a wide range of applications including Web image search, online picture-sharing communities, and scientific experiments. In our work, by advancing statistical modeling and optimization techniques, we can train computers about hundreds of semantic concepts using example pictures from each concept. The ALIPR (Automatic Linguistic Indexing of Pictures - Real Time) system of fully automatic and high speed annotation for online pictures has been constructed. Thousands of pictures from an Internet photo-sharing site, unrelated to the source of those pictures used in the training process, have been tested. The experimental results show that a single computer processor can suggest annotation terms in real-time and with good accuracy.