Filtered statistics

  • Authors:
  • Pawel Terlecki;Hardik Bati;Cesar Galindo-Legaria;Peter Zabback

  • Affiliations:
  • Microsoft Corp., Redmond, WA, USA;Microsoft Corp., Redmond, WA, USA;Microsoft Corp., Redmond, WA, USA;Microsoft Corp., Redmond, WA, USA

  • Venue:
  • Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Column statistics are an important element of cardinality estimation frameworks. More accurate estimates allow the optimizer of a RDBMS to generate better plans and improve the overall system's efficiency. This paper introduces filtered statistics, which model value distribution over a set of rows restricted by a predicate. This feature, available in Microsoft SQL Server, can be used to handle column correlation, as well as focus on interesting data ranges. In particular, it fits well for scenarios with logical subtables, like flexible schema or multi-tenant applications. Integration with the existing cardinality estimation infrastructure is presented.