The cost of using decomposable Gaussian graphical models for computational convenience

  • Authors:
  • A. Marie Fitch;Beatrix Jones

  • Affiliations:
  • -;-

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2012

Quantified Score

Hi-index 0.03

Visualization

Abstract

Graphical models are a powerful tool for describing patterns of conditional independence, and can also be used to regularize the covariance matrix. Vertices in the graph represent variables, and in the Gaussian setting, edges between vertices are equivalent to non-zero elements in the inverse covariance matrix. Models that can be represented as a decomposable (triangulated) graph are more computationally tractable; in fact, in the high-dimensional Bayesian setting it is common to restrict model selection procedures to decomposable models. We consider estimation of the covariance and inverse covariance matrix where the true model forms a cycle, but estimation is performed supposing that the pattern of zeros is a decomposable graphical model, where the elements restricted to zero are a subset of those in the true matrix. The variance of the maximum likelihood estimator based on the decomposable model is demonstrably larger than for the true non-decomposable model, and which decomposable model is selected affects the variance of particular elements of the matrix. When estimating the inverse covariance matrix the cost in terms of accuracy for using the decomposable model is fairly small, even when the difference in sparsity is large and the sample size is fairly small (e.g., the true model is a cycle of size 50, and the sample size is 51). However, when estimating the covariance matrix, the estimators for most elements had a dramatic increase in variance (200-fold in some cases) when a decomposable model was substituted. These increases become more pronounced as the difference in sparsity between models increases.