Improving compressed counting

  • Authors:
  • Ping Li

  • Affiliations:
  • Cornell University, Ithaca, NY

  • Venue:
  • UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Compressed Counting (CC) [22] was recently proposed for estimating the αth frequency moments of data streams, where 0 This paper presents a new algorithm for improving CC. The improvement is most substantial when α → 1--. For example, when α = 0.99, the new algorithm reduces the estimation variance roughly by 100-fold. This new algorithm would make CC considerably more practical for estimating Shannon entropy. Furthermore, the new algorithm is statistically optimal when α = 0.5.