Robust estimation in very small samples

Authors:
Peter J. Rousseeuw;Sabine Verboven
Affiliations:
Department of Mathematics and Computer Science, Universitaire Instelling Antwerpen, Universiteitsplein 1, Antwerpen, Belgium;Department of Mathematics and Computer Science, Universitaire Instelling Antwerpen, Universiteitsplein 1, Antwerpen, Belgium
Venue:
Computational Statistics & Data Analysis
Year:
2002

Citing 1
Cited 4

Robust regression and outlier detection

Robust regression and outlier detection

Analysis of minute features in speckled imagery with maximum likelihood estimation

EURASIP Journal on Applied Signal Processing
Evaluation of robust estimators applied to fluorescence assays

EURASIP Journal on Advances in Signal Processing
Noisy time series prediction using M-estimator based robust radial basis function neural networks with growing and pruning techniques

Expert Systems with Applications: An International Journal
A smoothing principle for the Huber and other location M-estimators

Computational Statistics & Data Analysis

Quantified Score

Hi-index	0.03

Visualization

Abstract

In experimental science measurements are typically repeated only a few times, yielding a sample size n of the order of 3 to 8. One then wants to summarize the measurements by a central value and measure their variability, i.e. estimate location and scale. These estimates should preferably be robust against outliers, as reflected by their small-sample breakdown value. The estimator's stylized empirical influence function should be smooth, monotone increasing for location, and decreasing-increasing for scale. It turns out that location can be estimated robustly for n ≥ 3, whereas for scale n ≥ 4 is needed. Several well-known robust estimators are studied for small n, yielding some surprising results. For instance, the Hodges-Lehmann estimator equals the average when n=4. Also location M-estimators with auxiliary scale are studied, addressing issues like the difference between one-step and fully iterated M-estimators. Simultaneous M-estimators of location and scale ('Huber's Proposal 2') are considered as well, and it turns out that their lack of robustness is already noticeable for such small samples. Recommendations are given as to which estimators to use.