PRIM analysis

  • Authors:
  • Wolfgang Polonik;Zailong Wang

  • Affiliations:
  • Department of Statistics, University of California, One Shields Ave., Davis, CA 95616-8705, United States;Novartis Pharmaceuticals Corporation, One Heath Plaza, East Hanover, NJ 07936-1080, United States

  • Venue:
  • Journal of Multivariate Analysis
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper analyzes a data mining/bump hunting technique known as PRIM [1]. PRIM finds regions in high-dimensional input space with large values of a real output variable. This paper provides the first thorough study of statistical properties of PRIM. Amongst others, we characterize the output regions PRIM produces, and derive rates of convergence for these regions. Since the dimension of the input variables is allowed to grow with the sample size, the presented results provide some insight about the qualitative behavior of PRIM in very high dimensions. Our investigations also reveal some shortcomings of PRIM, resulting in some proposals for modifications.