A search space reduction methodology for data mining in large databases

  • Authors:
  • Angel Kuri-Morales;Fátima Rodríguez-Erazo

  • Affiliations:
  • Department of Computer Science, Instituto Tecnologico Autonomo de Mexico, Rio Hondo No. 1, Col. Tizapan San Angel, C.P. 01000 México D.F., Mexico;Posgrado en Ciencia e Ingeniería de la Computación, Universidad Nacional Autónoma de México, Ciudad Universitaria, Del. Coyoacán, C.P. 04510 México D.F., Mexico

  • Venue:
  • Engineering Applications of Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given the present need for Customer Relationship and the increased growth of the size of databases, many new approaches to large database clustering and processing have been attempted. In this work, we propose a methodology based on the idea that statistically proven search space reduction is possible in practice. Two clustering models are generated: one corresponding to the full data set and another pertaining to the sampled data set. The resulting empirical distributions were mathematically tested to verify a tight non-linear significant approximation.