An efficient algorithm for sequential random sampling

  • Authors:
  • Jeffrey Scott Vitter

  • Affiliations:
  • Brown Univ., Providence, RI

  • Venue:
  • ACM Transactions on Mathematical Software (TOMS)
  • Year:
  • 1987

Quantified Score

Hi-index 0.00

Visualization

Abstract

We examine several methods for drawing a sequential random sample of n records from a file containing N records. Method D is recommended for general use. The algorithm is on-line (so that CPU time can be overlapped with I/O), has a small constant memory requirement, and is easy to program. An improved implementation is detailed in the Appendix.