Random sampling with a reservoir
ACM Transactions on Mathematical Software (TOMS)
The art of computer programming, volume 2 (3rd ed.): seminumerical algorithms
The art of computer programming, volume 2 (3rd ed.): seminumerical algorithms
Hi-index | 0.01 |
This paper describes a simple extension to the reservoir sampling algorithm to allow its use with ranked records. Here the fixed-sized sample must include records in order of rank, but a fair selection must occur to choose the lowest-ranked records included. The result is produced after a single pass through the records. Copyright © 2007 John Wiley & Sons, Ltd.