Crop classification by support vector machine with intelligently selected training data for an operational application

Authors:
Ajay Mathur;Giles M. Foody
Affiliations:
Punjab Remote Sensing Centre, PAU Campus, Ludhiana-141 004, India;School of Geography, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
Venue:
International Journal of Remote Sensing
Year:
2008

Citing 7
Cited 4

Decision Combination in Multiple Classifier Systems

IEEE Transactions on Pattern Analysis and Machine Intelligence
The nature of statistical learning theory

The nature of statistical learning theory
Assessment of the effectiveness of support vector machines for hyperspectral data

Future Generation Computer Systems - Special issue: Geocomputation
Combined SVM-Based Feature Selection and Classification

Machine Learning
Computer Processing of Remotely-Sensed Images: An Introduction

Computer Processing of Remotely-Sensed Images: An Introduction
Harshness in image classification accuracy assessment

International Journal of Remote Sensing
A comparison of methods for multiclass support vector machines

IEEE Transactions on Neural Networks

RVM-based multi-class classification of remotely sensed data

International Journal of Remote Sensing
Increasing the accuracy of neural network classification using refined training data

Environmental Modelling & Software
Original paper: Evaluating high resolution SPOT 5 satellite imagery for crop identification

Computers and Electronics in Agriculture
Support vector machines and object-based classification for obtaining land-use/cover cartography from Hyperion hyperspectral imagery

Computers & Geosciences

Quantified Score

Hi-index	0.00

Visualization

Abstract

The accuracy of a supervised classification is dependent to a large extent on the training data used. The aim in training is often to capture a large training set to fully describe the classes spectrally, commonly with the requirements of a conventional statistical classifier in mind. However, it is not always necessary to provide a complete description of the classes, especially if using a support vector machine (SVM) as the classifier. An SVM seeks to fit an optimal hyperplane between the classes and uses only some of the training samples that lie at the edge of the class distributions in feature space (support vectors). This should allow the definition of the most informative training samples prior to the analysis. An approach to identify informative training samples was demonstrated for the classification of agricultural classes in south-western part of Punjab state, India. A small, intelligently selected, training dataset was acquired in the field with the aid of ancillary information. This dataset contained the data from training sites that were predicted before the classification to be amongst the most informative for an SVM classification. The intelligent training collection scheme yielded a classification of comparable accuracy, ∼91%, to one derived using a larger training set acquired by a conventional approach. Moreover, from inspection of the training sets it was apparent that the intelligently defined training set contained a greater proportion of support vectors (0.70), useful training sites, than that acquired by the conventional approach (0.41). By focusing on the most informative training samples, the intelligent scheme required less investment in training than the conventional approach and its adoption would have reduced the total financial outlay in classification production and evaluation by ∼26%. Additionally, the analysis highlighted the possibility to further reduce the training set size without any significant negative impact on classification accuracy.