Application of distributed SVM architectures in classifying forest data cover types

Authors:
Mira Trebar;Nigel Steele
Affiliations:
University of Ljubljana, Faculty of Computer and Information Science, Trzaska 25, 1000 Ljubljana, Slovenia;Department of Mathematical Sciences, Coventry University, Priory Street, Coventry CV1 5FB, UK
Venue:
Computers and Electronics in Agriculture
Year:
2008

Citing 3
Cited 9

The nature of statistical learning theory

The nature of statistical learning theory
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms

Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
Boosting prediction accuracy on imbalanced datasets with SVM ensembles

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining

Influence of Hyperparameters on Random Forest Accuracy

MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
A Rotary-wing Unmanned Air Vehicle for Aquatic Weed Surveillance and Management

Journal of Intelligent and Robotic Systems
Eggshell crack detection using a wavelet-based support vector machine

Computers and Electronics in Agriculture
Review: Development of soft computing and applications in agricultural and biological engineering

Computers and Electronics in Agriculture
Original paper: Classification of foreign fibers in cotton lint using machine vision and multi-class support vector machine

Computers and Electronics in Agriculture
A comparative study of the multi-objective optimization algorithms for coal-fired boilers

Expert Systems with Applications: An International Journal
Classification of high dimensional and imbalanced hyperspectral imagery data

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Exploring synergetic effects of dimensionality reduction and resampling tools on hyperspectral imagery data classification

MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Inconsistency-based active learning for support vector machines

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many 'real-world' applications, a classification of large data sets, which are often also imbalanced, is difficult due to the small, but usually more interesting classes. In this study, a large data set, forest cover type classes, which is actually multi-class classification defined with seven imbalanced classes and used as a resource inventory information was analyzed and evaluated. The data set was transformed into seven new data sets and a support vector machine (SVM) was employed to solve a binary classification problem of balanced and imbalanced data sets with various sizes. In the two approaches considered, the use of distributed SVM architectures, which basically reduces the complexity of the quadratic optimization problem of very large data sets, and the use of two sampling approaches for classification of imbalanced data sets were combined and results presented. The experimental results of distributed SVM architectures show the improvement of the accuracy for larger data sets in comparison to a single SVM classifier and their ability to improve the correct classification of the minority class.