Learning figures with the hausdorff metric by fractals

  • Authors:
  • Mahito Sugiyama;Eiju Hirowatari;Hideki Tsuiki;Akihiro Yamamoto

  • Affiliations:
  • Graduate School of Informatics, Kyoto University, Kyoto, Japan and Japan Society for the Promotion of Science;Center for Fundamental Education, The University of Kitakyushu;Graduate School of Human and Environmental Studies, Kyoto University;Graduate School of Informatics, Kyoto University, Kyoto, Japan

  • Venue:
  • ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Discretization is a fundamental process for machine learning from analog data such as continuous signals. For example, the discrete Fourier analysis is one of the most essential signal processing methods for learning or recognition from continuous signals. However, only the direction of the time axis is discretized in the method, meaning that each datum is not purely discretized. To give a completely computational theoretical basis for machine learning from analog data, we construct a learning framework based on the Gold-style learning model. Using a modern mathematical computability theory in the field of Computable Analysis, we show that scalable sampling of analog data can be formulated as effective Gold-style learning. On the other hand, recursive algorithms are a key expression for models or rules explaining analog data. For example, FFT (Fast Fourier Transformation) is a fundamental recursive algorithm for discrete Fourier analysis. In this paper we adopt fractals, since they are general geometric concepts of recursive algorithms, and set learning objects as nonempty compact sets in the Euclidean space, called figures, in order to introduce fractals into Gold-style learning model, where the Hausdorff metric can be used to measure generalization errors. We analyze learnable classes of figures from informants (positive and negative examples) and from texts (positive examples), and reveal the hierarchy of learnabilities under various learning criteria. Furthermore, we measure the number of positive examples, one of complexities of learning, by using the Hausdorff dimension, which is the central concept of Fractal Geometry, and the VC dimension, which is used to measure the complexity of classes of hypotheses in the Valiant-style learning model. This work provides theoretical support for machine learning from analog data.