Mining Bayesian Network Structure for Large Sets of Variables

  • Authors:
  • Mieczyslaw A. Klopotek

  • Affiliations:
  • -

  • Venue:
  • ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

A well-known problem with Bayesian networks (BN) is the practical limitation for the number of variables for which a Bayesian network can be learned in reasonable time. Even the complexity of simplest tree-like BN learning algorithms is prohibitive for large sets of variables. The paper presents a novel algorithm overcoming this limitation for the tree-like class of Bayesian networks. The new algorithm space consumption grows linearly with the number of variables n while the execution time is proportional to n ln(n), out performing any known algorithm. This opens new perspectives in construction of Bayesian networks from data containing tens of thousands and more variables, e.g. in automatic text categorization.