Information retrieval and classification with wavelets and support vector machines

  • Authors:
  • S. Dormido-Canto;J. Vega;J. Sánchez;G. Farias

  • Affiliations:
  • Dpto. de Informática y Automática – E.T.S.I. Informática, U.N.E.D., Madrid, Spain;Asociación EURATOM/CIEMAT, Madrid, Spain;Dpto. de Informática y Automática – E.T.S.I. Informática, U.N.E.D., Madrid, Spain;Dpto. de Informática y Automática – E.T.S.I. Informática, U.N.E.D., Madrid, Spain

  • Venue:
  • IWINAC'05 Proceedings of the First international work-conference on the Interplay Between Natural and Artificial Computation conference on Artificial Intelligence and Knowledge Engineering Applications: a bioinspired approach - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Since fusion plasma experiment generates hundreds of signals. In analyzing these signals it is important to have automatic mechanisms for searching similarities and retrieving of specific data in the waveform database. Wavelet transform (WT) is a transformation that allows to map signals to spaces of lower dimensionality, that is, a smoothed and compressed version of the original signal. Support vector machine (SVM) is a very effective method for general purpose pattern recognition. Given a set of input vectors which belong to two different classes, the SVM maps the inputs into a high-dimensional feature space through some non-linear mapping, where an optimal separating hyperplane is constructed. This hyperplane minimizes the risk of misclassification and it is determined by a subset of points of the two classes, named support vectors (SV). In this work, the combined use of WT and SVM is proposed for searching and retrieving similar waveforms in the TJ-II database. In a first stage, plasma signals will be preprocessed by WT in order to reduce the dimensionality of the problem and to extract their main features. In the next stage, and using the new smoothed signals produced by the WT, SVM will be applied to show up the efficency of the proposed method to deal with the problem of sorting out thousands of fusion plasma signals.