Feature selection using linear classifier weights: interaction with classification models

Authors:
Dunja Mladenić;Janez Brank;Marko Grobelnik;Natasa Milic-Frayling
Affiliations:
Jožef Stefan Institute, Ljubljana, Slovenia;Jožef Stefan Institute, Ljubljana, Slovenia;Jožef Stefan Institute, Ljubljana, Slovenia;Microsoft Research Ltd, Cambridge, United Kingdom
Venue:
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2004

Citing 5
Cited 38

The perception: a probabilistic model for information storage and organization in the brain

Neurocomputing: foundations of research
Support-Vector Networks

Machine Learning
Making large-scale support vector machine learning practical

Advances in kernel methods
Feature Selection for Unbalanced Class Distribution and Naive Bayes

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Fast and accurate text classification via multiple linear discriminant projections

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

A phonotactic-semantic paradigm for automatic spoken document classification

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Local sparsity control for naive Bayes with extreme misclassification costs

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Using names and topics for new event detection

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Dynamic category profiling for text filtering and classification

Information Processing and Management: an International Journal
Raising the baseline for high-precision text classifiers

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Language morphology offset: Text classification on a Croatian-English parallel corpus

Information Processing and Management: an International Journal
Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums

ACM Transactions on Information Systems (TOIS)
Using ambiguity measure feature selection algorithm for support vector machine classifier

Proceedings of the 2008 ACM symposium on Applied computing
Object Class Recognition and Localization Using Sparse Features with Limited Receptive Fields

International Journal of Computer Vision
BNS feature scaling: an improved representation over tf-idf for svm text classification

Proceedings of the 17th ACM conference on Information and knowledge management
Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection

Expert Systems with Applications: An International Journal
Using scatterplots to understand and improve probabilistic models for text categorization and retrieval

International Journal of Approximate Reasoning
Proposing a new term weighting scheme for text categorization

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Avoidance of model re-induction in SVM-based feature selection for text categorization

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Inferring long-term user properties based on users' location history

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Boosting KNN text classification accuracy by using supervised term weighting schemes

Proceedings of the 18th ACM conference on Information and knowledge management
A Novel Weightless Artificial Neural Based Multi-Classifier for Complex Classifications

Neural Processing Letters
Hierarchical appearance-based classifiers for qualitative spatial localization

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Does SVM really scale up to large bag of words feature spaces?

IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
N-grams and morphological normalization in text classification: a comparison on a Croatian-English parallel corpus

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Kernel matching reduction algorithms for classification

RSKT'08 Proceedings of the 3rd international conference on Rough sets and knowledge technology
Improving image annotation via representative feature vector selection

Neurocomputing
Margin-maximizing feature elimination methods for linear and nonlinear kernel-based discriminant functions

IEEE Transactions on Neural Networks
Data mining to predict and prevent errors in health insurance claims processing

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
An advanced combination strategy for multi-classifiers employed in large multi-class problem domains

Applied Soft Computing
Dictionary of features in a biologically inspired approach to image classification

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: models and applications - Volume Part II
Collaborative classification over P2P networks

Proceedings of the 20th international conference companion on World wide web
Anomaly Detection in Dynamic Systems Using Weak Estimators

ACM Transactions on Internet Technology (TOIT)
Identifying disease diagnosis factors by proximity-based mining of medical texts

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part II
Interactive learning for efficiently detecting errors in insurance claims

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Evaluation of feature combination approaches for text categorisation

ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
Dynamic category profiling for text filtering and classification

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Structural and visual comparisons for web page archiving

Proceedings of the 2012 ACM symposium on Document engineering
Hierarchical Classifiers for Robust Topological Robot Localization

Journal of Intelligent and Robotic Systems
Reduction of training noises for text classifiers

ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part II
A stochastic hyperheuristic for unsupervised matching of partial information

Advances in Artificial Intelligence
Boosting masked dominant orientation templates for efficient object detection

Computer Vision and Image Understanding
Feature ranking fusion for text classifier

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper explores feature scoring and selection based on weights from linear classification models. It investigates how these methods combine with various learning models. Our comparative analysis includes three learning algorithms: Naïve Bayes, Perceptron, and Support Vector Machines (SVM) in combination with three feature weighting methods: Odds Ratio, Information Gain, and weights from linear models, the linear SVM and Perceptron. Experiments show that feature selection using weights from linear SVMs yields better classification performance than other feature weighting methods when combined with the three explored learning algorithms. The results support the conjecture that it is the sophistication of the feature weighting method rather than its apparent compatibility with the learning algorithm that improves classification performance.