Quick multivariate kernel density estimation for massive data sets: Research Articles

Authors:
K. F. Cheng;C. K. Chu;Dennis K. J. Lin
Affiliations:
Institute of Statistics, National Central University, Chungli, Taiwan;Department of Applied Mathematics, National Donghwa University, Hualien, Taiwan;Department of Supply Chain and Information Systems, The Pennsylvania State University, University Park, PA 16802, U.S.A.
Venue:
Applied Stochastic Models in Business and Industry - Business, Industry and Government (BIG) Statistics
Year:
2006

Citing 0
Cited 1

Estimation of 2D jump location curve and 3D jump location surface in nonparametric regression

Statistics and Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Massive data sets are becoming popular in this information era. Due to the limitation of computer memory space and the computing time, the kernel density estimation for massive data sets, although strongly demanding, is rather challenging. In this paper, we propose a quick algorithm for multivariate density estimation which is suitable for massive data sets. The term quick is referred to indicate the computing ease. Theoretical properties of the proposed algorithm are developed. Its empirical performance is demonstrated through a credit card example and numerous simulation studies. It is shown that in addition to its computational ease, the proposed algorithm is as good as the traditional methods (for the situations where these traditional methods are feasible). Copyright © 2006 John Wiley & Sons, Ltd.