A metropolis sampling method for drawing representative samples from large databases

  • Authors:
  • Hong Guo;Wen-Chi Hou;Feng Yan;Qiang Zhu

  • Affiliations:
  • Department of Computer Science, Southern Illinois University, Carbondale, IL;Department of Computer Science, Southern Illinois University, Carbondale, IL;Department of Computer Science, Southern Illinois University, Carbondale, IL;Dept. of Computer & Info. Science, Michigan University-Dearborn, Dearborn, MI

  • Venue:
  • DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a sampling method based on the Metropolis algorithm is proposed. It is able to draw samples that have the same distribution as the underlying probability distribution. It is a simple, efficient, and powerful method suitable for all distributions. We have performed experiments to examine the qualities of the samples by comparing their statistical properties with the underlying population. The experimental results show that the samples selected by our method are bona fide representative.