Quick multivariate kernel density estimation for massive data sets: Research Articles

  • Authors:
  • K. F. Cheng;C. K. Chu;Dennis K. J. Lin

  • Affiliations:
  • Institute of Statistics, National Central University, Chungli, Taiwan;Department of Applied Mathematics, National Donghwa University, Hualien, Taiwan;Department of Supply Chain and Information Systems, The Pennsylvania State University, University Park, PA 16802, U.S.A.

  • Venue:
  • Applied Stochastic Models in Business and Industry - Business, Industry and Government (BIG) Statistics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Massive data sets are becoming popular in this information era. Due to the limitation of computer memory space and the computing time, the kernel density estimation for massive data sets, although strongly demanding, is rather challenging. In this paper, we propose a quick algorithm for multivariate density estimation which is suitable for massive data sets. The term quick is referred to indicate the computing ease. Theoretical properties of the proposed algorithm are developed. Its empirical performance is demonstrated through a credit card example and numerous simulation studies. It is shown that in addition to its computational ease, the proposed algorithm is as good as the traditional methods (for the situations where these traditional methods are feasible). Copyright © 2006 John Wiley & Sons, Ltd.