Global and local choice of the number of nearest neighbors in locally linear embedding

  • Authors:
  • Andrés Álvarez-Meza;Juliana Valencia-Aguirre;Genaro Daza-Santacoloma;Germán Castellanos-Domínguez

  • Affiliations:
  • Signal Processing and Recognition Group, Universidad Nacional de Colombia, Sede Manizales, Km. 9 vía Al Aeropuerto, Campus La Nubia, Manizales, Colombia;Signal Processing and Recognition Group, Universidad Nacional de Colombia, Sede Manizales, Km. 9 vía Al Aeropuerto, Campus La Nubia, Manizales, Colombia;Signal Processing and Recognition Group, Universidad Nacional de Colombia, Sede Manizales, Km. 9 vía Al Aeropuerto, Campus La Nubia, Manizales, Colombia;Signal Processing and Recognition Group, Universidad Nacional de Colombia, Sede Manizales, Km. 9 vía Al Aeropuerto, Campus La Nubia, Manizales, Colombia

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2011

Quantified Score

Hi-index 0.10

Visualization

Abstract

The crux in the locally linear embedding algorithm is the selection of the number of nearest neighbors k. Some previous techniques have been developed for finding this parameter based on embedding quality measures. Nevertheless, they do not achieve suitable results when they are tested on several kind of manifolds. In this work is presented a new method for automatically computing the number of neighbors by means of analyzing global and local properties of the embedding results. Besides, it is also proposed a second strategy for choosing the parameter k, on manifolds where the density and the intrinsic dimensionality of the neighborhoods are changeful. The first proposed technique, called preservation neighborhood error, calculates a unique value of k for the whole manifold. Moreover, the second method, named local neighborhood selection, computes a suitable number of neighbors for each sample point in the manifold. The methodologies were tested on artificial and real-world datasets which allow us to visually confirm the quality of the embedding. According to the results our methods aim to find suitable values of k and appropriated embeddings.