Multiple intervals versus smoothing of boundaries in the discretization of performance indicators used for diagnosis in cellular networks

  • Authors:
  • Raquel Barco;Pedro Lázaro;Luis Díez;Volker Wille

  • Affiliations:
  • Departamento de Ingeniería de Comunicaciones, Universidad de Málaga, Málaga, Spain;Departamento de Ingeniería de Comunicaciones, Universidad de Málaga, Málaga, Spain;Departamento de Ingeniería de Comunicaciones, Universidad de Málaga, Málaga, Spain;Nokia Networks, Performance Services, Cambridge, UK

  • Venue:
  • ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part IV
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most real-world applications of diagnosis involve continuous-valued attributes, which are normally discretized before the existing classification algorithms are applied. The discretization may be based on data or on human expertise. In cellular networks the number of classified examples is very limited. Thus, the diagnosis experts should specify the boundaries of the intervals for each discretized symptom. The large number of values makes it difficult to specify precise parameters. Even if boundaries are obtained from classified examples, due to the limited number of cases, the obtained values are not very accurate. In this paper two techniques to improve the performance of diagnosis systems based on Bayesian Networks are compared. Some empirical results are presented for diagnosis in a GSM network. The first method, Smooth Bayesian Networks, is shown to be more robust to imprecise setting of boundaries. The second method, Multiple Uniform Intervals, is superior if accurately defined boundaries are available.