A search space reduction methodology for large databases: a case study

  • Authors:
  • Angel Kuri-Morales;Fátima Rodríguez-Erazo

  • Affiliations:
  • Departamento de Computación, Instituto Tecnológico Autónomo de México;Posgrado en Ciencias e Ingeniería de la Computacion, Universidad Nacional Autónoma de México, Mexico

  • Venue:
  • ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given the present need for Customer Relationship and the increased growth of the size of databases, many new approaches to large database clustering and processing have been attempted. In this work we propose a methodology based on the idea that statistically proven search space reduction is possible in practice. Two clustering models are generated: one corresponding to the full data set and another pertaining to the sampled data set. The resulting empirical distributions were mathematically tested to verify a tight non-linear significant approximation.