How good are the bayesian information criterion and the minimum description length principle for model selection? a bayesian network analysis

  • Authors:
  • Nicandro Cruz-Ramírez;Héctor-Gabriel Acosta-Mesa;Rocío-Erandi Barrientos-Martínez;Luis-Alonso Nava-Fernández

  • Affiliations:
  • Facultad de Física e Inteligencia Artificial, Universidad Veracruzana, Xalapa, Veracruz, México;Facultad de Física e Inteligencia Artificial, Universidad Veracruzana, Xalapa, Veracruz, México;Facultad de Física e Inteligencia Artificial, Universidad Veracruzana, Xalapa, Veracruz, México;Instituto de Investigaciones en Educación, Universidad Veracruzana, Xalapa, Veracruz, México

  • Venue:
  • MICAI'06 Proceedings of the 5th Mexican international conference on Artificial Intelligence
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Bayesian Information Criterion (BIC) and the Minimum Description Length Principle (MDL) have been widely proposed as good metrics for model selection. Such scores basically include two terms: one for accuracy and the other for complexity. Their philosophy is to find a model that rightly balances these terms. However, it is surprising that both metrics do often not work very well in practice for they overfit the data. In this paper, we present an analysis of the BIC and MDL scores using the framework of Bayesian networks that supports such a claim. To this end, we carry out different tests that include the recovery of gold-standard network structures as well as the construction and evaluation of Bayesian network classifiers. Finally, based on these results, we discuss the disadvantages of both metrics and propose some future work to examine these limitations more deeply.