Model Diagnostics for Remote Access Regression Servers

  • Authors:
  • Jerome P. Reiter

  • Affiliations:
  • Institute of Statistics and Decision Sciences, Box 90251, Duke University, Durham, NC 27708, USA

  • Venue:
  • Statistics and Computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

To protect public-use microdata, one approach is not to allow users access to the microdata. Instead, users submit analyses to a remote computer that reports back basic output from the fitted model, such as coefficients and standard errors. To be most useful, this remote server also should provide some way for users to check the fit of their models, without disclosing actual data values. This paper discusses regression diagnostics for remote servers. The proposal is to release synthetic diagnostics—i.e. simulated values of residuals and dependent and independent variables–constructed to mimic the relationships among the real-data residuals and independent variables. Using simulations, it is shown that the proposed synthetic diagnostics can reveal model inadequacies without substantial increase in the risk of disclosures. This approach also can be used to develop remote server diagnostics for generalized linear models.