Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
Towards language independent automated learning of text categorization models
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Learning to extract symbolic knowledge from the World Wide Web
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Advances in kernel methods: support vector learning
Advances in kernel methods: support vector learning
Making large-scale support vector machine learning practical
Advances in kernel methods
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Neural Networks: A Comprehensive Foundation
Neural Networks: A Comprehensive Foundation
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Effective Methods for Improving Naive Bayes Text Classifiers
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
A Multilingual Text Mining Approach Based on Self-Organizing Maps
Applied Intelligence
On Machine Learning Methods for Chinese Document Categorization
Applied Intelligence
Authorship Attribution with Support Vector Machines
Applied Intelligence
Fast and accurate text classification via multiple linear discriminant projections
The VLDB Journal — The International Journal on Very Large Data Bases
Spam filters: bayes vs. chi-squared; letters vs. words
ISICT '03 Proceedings of the 1st international symposium on Information and communication technologies
A class of edit kernels for SVMs to predict translation initiation sites in eukaryotic mRNAs
RECOMB '04 Proceedings of the eighth annual international conference on Resaerch in computational molecular biology
Kernel Methods for Pattern Analysis
Kernel Methods for Pattern Analysis
Introduction to Machine Learning (Adaptive Computation and Machine Learning)
Introduction to Machine Learning (Adaptive Computation and Machine Learning)
A Hierarchical Neural Network Document Classifier with Linguistic Feature Selection
Applied Intelligence
An Optimization Method for Selecting Parameters in Support Vector Machines
ICMLA '07 Proceedings of the Sixth International Conference on Machine Learning and Applications
Text categorization via generalized discriminant analysis
Information Processing and Management: an International Journal
Semi-supervised Classification from Discriminative Random Walks
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
IEEE Transactions on Knowledge and Data Engineering
Estimation of individual prediction reliability using the local sensitivity analysis
Applied Intelligence
Text classification from unlabeled documents with bootstrapping and feature projection techniques
Information Processing and Management: an International Journal
Expert Systems with Applications: An International Journal
Feature selection for text classification with Naïve Bayes
Expert Systems with Applications: An International Journal
Distributional Features for Text Categorization
IEEE Transactions on Knowledge and Data Engineering
Using the self organizing map for clustering of text documents
Expert Systems with Applications: An International Journal
A new maximal-margin spherical-structured multi-class support vector machine
Applied Intelligence
A dynamic holding strategy in public transit systems with real-time information
Applied Intelligence
Discovering domain-specific composite kernels
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Analysis of the distance between two classes for tuning SVM hyperparameters
IEEE Transactions on Neural Networks
An SVM-based machine learning method for accurate internet traffic classification
Information Systems Frontiers
Parameters optimization of support vector machine based on simulated annealing and genetic algorithm
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Automatically computed document dependent weighting factor facility for Naïve Bayes classification
Expert Systems with Applications: An International Journal
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Random projections for linear SVM ensembles
Applied Intelligence
Building a qualitative recruitment system via SVM with MCDM approach
Applied Intelligence
Recognition of Arabic (Indian) bank check digits using log-gabor filters
Applied Intelligence
A comparison of methods for multiclass support vector machines
IEEE Transactions on Neural Networks
Expert Systems with Applications: An International Journal
A multi-threshold segmentation approach based on Artificial Bee Colony optimization
Applied Intelligence
Accelerated max-margin multiple kernel learning
Applied Intelligence
Expert Systems with Applications: An International Journal
The decomposed k-nearest neighbor algorithm for imbalanced text classification
FGIT'12 Proceedings of the 4th international conference on Future Generation Information Technology
A semantic social network-based expert recommender system
Applied Intelligence
A distance sum-based hybrid method for intrusion detection
Applied Intelligence
A belief classification rule for imprecise data
Applied Intelligence
Least squares twin parametric-margin support vector machine for classification
Applied Intelligence
An SVM-AdaBoost facial expression recognition system
Applied Intelligence
Hi-index | 0.00 |
This paper presents the implementation of a new text document classification framework that uses the Support Vector Machine (SVM) approach in the training phase and the Euclidean distance function in the classification phase, coined as Euclidean-SVM. The SVM constructs a classifier by generating a decision surface, namely the optimal separating hyper-plane, to partition different categories of data points in the vector space. The concept of the optimal separating hyper-plane can be generalized for the non-linearly separable cases by introducing kernel functions to map the data points from the input space into a high dimensional feature space so that they could be separated by a linear hyper-plane. This characteristic causes the implementation of different kernel functions to have a high impact on the classification accuracy of the SVM. Other than the kernel functions, the value of soft margin parameter, C is another critical component in determining the performance of the SVM classifier. Hence, one of the critical problems of the conventional SVM classification framework is the necessity of determining the appropriate kernel function and the appropriate value of parameter C for different datasets of varying characteristics, in order to guarantee high accuracy of the classifier. In this paper, we introduce a distance measurement technique, using the Euclidean distance function to replace the optimal separating hyper-plane as the classification decision making function in the SVM. In our approach, the support vectors for each category are identified from the training data points during training phase using the SVM. In the classification phase, when a new data point is mapped into the original vector space, the average distances between the new data point and the support vectors from different categories are measured using the Euclidean distance function. The classification decision is made based on the category of support vectors which has the lowest average distance with the new data point, and this makes the classification decision irrespective of the efficacy of hyper-plane formed by applying the particular kernel function and soft margin parameter. We tested our proposed framework using several text datasets. The experimental results show that this approach makes the accuracy of the Euclidean-SVM text classifier to have a low impact on the implementation of kernel functions and soft margin parameter C.