Combination of rough sets and genetic algorithms for text classification

Authors:
Rujiang Bai;Xiaoyue Wang;Junhua Liao
Affiliations:
Shandong University of Technology, China;Shandong University of Technology, China;Shandong University of Technology, China
Venue:
AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
Year:
2007

Citing 14
Cited 1

Rough sets: probabilistic versus deterministic approach

International Journal of Man-Machine Studies
Rough membership functions

Advances in the Dempster-Shafer theory of evidence
The nature of statistical learning theory

The nature of statistical learning theory
Support Vector Machines for 3D Object Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Genetic Algorithms for Machine Learning

Genetic Algorithms for Machine Learning
Rough Sets: Theoretical Aspects of Reasoning about Data

Rough Sets: Theoretical Aspects of Reasoning about Data
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Rough-Fuzzy Hybridization: A New Trend in Decision Making

Rough-Fuzzy Hybridization: A New Trend in Decision Making
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Feature Selection Via Mathematical Programming

INFORMS Journal on Computing
An SVM-based Algorithm for Identification of Photosynthesis-specific Genome Features

CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Feature Selection for Support Vector Machines by Means of Genetic Algorithms

ICTAI '03 Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence

A Web page classification system based on a genetic algorithm using tagged-terms as features

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic categorization of documents into pre-defined taxonomies is a crucial step in data mining and knowledge discovery. Standard machine learning techniques like support vector machines(SVM) and related large margin methods have been successfully applied for this task. Unfortunately, the high dimensionality of input feature vectors impacts on the classification speed. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy. The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed. In order to improve classification speed we spent rough sets theory to reduce the feature vector space. We present a genetic algorithm approach for feature selection and parameters optimization to improve classification accuracy. Experimental results indicate our method is more effective than traditional SVM methods and other traditional methods.