Principles of multi-kernel data mining

  • Authors:
  • Vadim Mottl;Olga Krasotkina;Oleg Seredin;Ilya Muchnik

  • Affiliations:
  • Computing Center of the Russian Academy of Sciences, Moscow, Russia;Computing Center of the Russian Academy of Sciences, Moscow, Russia;Computing Center of the Russian Academy of Sciences, Moscow, Russia;DIMACS, Rutgers University, Piscataway, NJ

  • Venue:
  • MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The scientific community has accumulated an immense experience in processing data represented in finite-dimensional linear spaces of numerical features of entities, whereas the kit of mathematical instruments for dissimilarity-based processing of data in metric spaces representing distances between entities, for which sufficiently informative features cannot be found, is much poorer. In this work, the problem of embedding the given set of entities into a linear space with inner product by choosing an appropriate kernel function is considered as the major challenge in the featureless approach to estimating dependences in data sets of arbitrary kind. As a rule, several kernels may be heuristically suggested within the bounds of the same data analysis problem. We treat several kernels on a set of entities as Cartesian product of the respective number of linear spaces, each supplied with a specific kernel function as a specific inner product. The main requirement here is to avoid discrete selection in eliminating redundant kernels with the purpose of achieving acceptable computational complexity of the fusion algorithm.