A training algorithm for optimal margin classifiers
COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
The nature of statistical learning theory
The nature of statistical learning theory
Machine Learning
Selection of relevant features and examples in machine learning
Artificial Intelligence - Special issue on relevance
Large margin classification using the perceptron algorithm
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
Theoretical Computer Science
Feature Selection via Concave Minimization and Support Vector Machines
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Feature Selection Via Mathematical Programming
INFORMS Journal on Computing
Inference for the Generalization Error
Machine Learning
Training Support Vector Machines: an Application to Face Detection
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Sparse bayesian learning and the relevance vector machine
The Journal of Machine Learning Research
Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)
Empirical risk minimization for support vector classifiers
IEEE Transactions on Neural Networks
An introduction to variable and feature selection
The Journal of Machine Learning Research
Second Order Cone Programming Formulations for Feature Selection
The Journal of Machine Learning Research
Sparse Multinomial Logistic Regression: Fast Algorithms and Generalization Bounds
IEEE Transactions on Pattern Analysis and Machine Intelligence
Variable selection and ranking for analyzing automobile traffic accident data
Proceedings of the 2005 ACM symposium on Applied computing
Using recursive classification to discover predictive features
Proceedings of the 2005 ACM symposium on Applied computing
Feature Subset Selection and Feature Ranking for Multivariate Time Series
IEEE Transactions on Knowledge and Data Engineering
Online feature selection for pixel classification
ICML '05 Proceedings of the 22nd international conference on Machine learning
Combined SVM-Based Feature Selection and Classification
Machine Learning
Active learning via transductive experimental design
ICML '06 Proceedings of the 23rd international conference on Machine learning
Multi-class feature selection for texture classification
Pattern Recognition Letters
Variable selection in kernel Fisher discriminant analysis by means of recursive feature elimination
Computational Statistics & Data Analysis
Non-parametric classifier-independent feature selection
Pattern Recognition
Analysis of SVM regression bounds for variable ranking
Neurocomputing
Feature selection for the SVM: An application to hypertension diagnosis
Expert Systems with Applications: An International Journal
A Stochastic Algorithm for Feature Selection in Pattern Recognition
The Journal of Machine Learning Research
Feature selection in a kernel space
Proceedings of the 24th international conference on Machine learning
Direct convex relaxations of sparse SVM
Proceedings of the 24th international conference on Machine learning
Minimum reference set based feature selection for small sample classifications
Proceedings of the 24th international conference on Machine learning
Supervised feature selection via dependence estimation
Proceedings of the 24th international conference on Machine learning
Sparse eigen methods by D.C. programming
Proceedings of the 24th international conference on Machine learning
A hybrid genetic algorithm for feature selection wrapper based on mutual information
Pattern Recognition Letters
Feature selection and blind source separation in an EEG-based brain-computer interface
EURASIP Journal on Applied Signal Processing
A bilinear formulation for vector sparsity optimization
Signal Processing
Development of Two-Stage SVM-RFE Gene Selection Strategy for Microarray Expression Data Analysis
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Gene expression modeling through positive boolean functions
International Journal of Approximate Reasoning
Applying genetic algorithms and support vector machines to the gene selection problem
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - VIII Brazilian Symposium on Neural Networks
An efficient ant colony optimization approach to attribute reduction in rough set theory
Pattern Recognition Letters
Kernel discriminant analysis based feature selection
Neurocomputing
Fusion of feature selection methods for pairwise scoring SVM
Neurocomputing
Fast Optimization Methods for L1 Regularization: A Comparative Study and Two New Approaches
ECML '07 Proceedings of the 18th European conference on Machine Learning
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Expert Systems with Applications: An International Journal
Arbitrary norm support vector machines
Neural Computation
A Regularized Framework for Feature Selection in Face Detection and Authentication
International Journal of Computer Vision
A wrapper method for feature selection using Support Vector Machines
Information Sciences: an International Journal
Partially supervised feature selection with regularized linear models
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Non-monotonic feature selection
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Efficient linearization of tree kernel functions
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Feature Selection by Transfer Learning with Linear Regularized Models
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
A mathematical programming formulation for sparse collaborative computer aided diagnosis
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Kernel Methods in Computer Vision
Foundations and Trends® in Computer Graphics and Vision
A sparsity-enforcing method for learning face features
IEEE Transactions on Image Processing
Discriminative semi-supervised feature selection via manifold regularization
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Zero Norm Least Squares Proximal SVR
PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
Reverse engineering of tree kernel feature spaces
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Improved variable and value ranking techniques for mining categorical traffic accident data
Expert Systems with Applications: An International Journal
Recovering sparse signals with a certain family of nonconvex penalties and DC programming
IEEE Transactions on Signal Processing
Model Selection: Beyond the Bayesian/Frequentist Divide
The Journal of Machine Learning Research
The Journal of Machine Learning Research
A genetic embedded approach for gene selection and classification of microarray data
EvoBIO'07 Proceedings of the 5th European conference on Evolutionary computation, machine learning and data mining in bioinformatics
L0-constrained regression for data mining
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Softening the margin in discrete SVM
ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
A regularized approach to feature selection for face detection
ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part II
Feature selection by nonparametric Bayes error minimization
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Linear Separability of Gene Expression Data Sets
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Feature selection for SVM via optimization of kernel polarization with Gaussian ARD kernels
Expert Systems with Applications: An International Journal
Concave programming for minimizing the zero-norm over polyhedral sets
Computational Optimization and Applications
IEEE Transactions on Neural Networks
A greedy algorithm for gene selection based on SVM and correlation
International Journal of Bioinformatics Research and Applications
Medical coding classification by leveraging inter-code relationships
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Grafting-light: fast, incremental feature selection and structure learning of Markov random fields
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Sparse learning for support vector classification
Pattern Recognition Letters
Variable selection using random forests
Pattern Recognition Letters
Discriminative semi-supervised feature selection via manifold regularization
IEEE Transactions on Neural Networks
Orientation distance-based discriminative feature extraction for multi-class classification
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Simultaneous feature selection and classification using kernel-penalized support vector machines
Information Sciences: an International Journal
A hierarchical classifier applied to multi-way sentiment detection
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Part-based feature synthesis for human detection
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
The support feature machine for classifying with the least number of features
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Improving the Computational Efficiency of Recursive Cluster Elimination for Gene Selection
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
SVM based feature selection: why are we using the dual?
IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
A sparse nearest mean classifier for high dimensional multi-class problems
Pattern Recognition Letters
Improving accuracy of microarray classification by a simple multi-task feature selection filter
International Journal of Data Mining and Bioinformatics
On the problem of finding the least number of features by L1-norm minimisation
ICANN'11 Proceedings of the 21th international conference on Artificial neural networks - Volume Part I
Expert Systems with Applications: An International Journal
An experimental comparison of gene selection by Lasso and Dantzig selector for cancer classification
Computers in Biology and Medicine
TAKES: a fast method to select features in the kernel space
Proceedings of the 20th ACM international conference on Information and knowledge management
Feature selection based on kernel discriminant analysis
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
Evaluating feature selection for SVMs in high dimensions
ECML'06 Proceedings of the 17th European conference on Machine Learning
A trace compression algorithm targeting power estimation of long benchmarks
Proceedings of the International Conference on Computer-Aided Design
Active learning of combinatorial features for interactive optimization
LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Embedded feature selection for support vector machines: state-of-the-art and future challenges
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Biological specifications for a synthetic gene expression data generation model
WILF'05 Proceedings of the 6th international conference on Fuzzy Logic and Applications
Sparse weighted voting classifier selection and its linear programming relaxations
Information Processing Letters
Motion recognition using local auto-correlation of space-time gradients
Pattern Recognition Letters
Feature selection via dependence maximization
The Journal of Machine Learning Research
EP-GIG priors and applications in bayesian sparse learning
The Journal of Machine Learning Research
Self-taught dimensionality reduction on the high-dimensional small-sized data
Pattern Recognition
Review: Supervised classification and mathematical optimization
Computers and Operations Research
Feature selection for link prediction
Proceedings of the 5th Ph.D. workshop on Information and knowledge
Massively parallel feature selection: an approach based on variance preservation
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Sparse signal recovery by difference of convex functions algorithms
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part II
Sparse activity and sparse connectivity in supervised learning
The Journal of Machine Learning Research
White box radial basis function classifiers with component selection for clinical prediction models
Artificial Intelligence in Medicine
Intelligent Data Analysis
Hi-index | 0.01 |
We explore the use of the so-called zero-norm of the parameters of linear models in learning. Minimization of such a quantity has many uses in a machine learning context: for variable or feature selection, minimizing training error and ensuring sparsity in solutions. We derive a simple but practical method for achieving these goals and discuss its relationship to existing techniques of minimizing the zero-norm. The method boils down to implementing a simple modification of vanilla SVM, namely via an iterative multiplicative rescaling of the training data. Applications we investigate which aid our discussion include variable and feature selection on biological microarray data, and multicategory classification.