Rule Learning with Probabilistic Smoothing

  • Authors:
  • Gianni Costa;Massimo Guarascio;Giuseppe Manco;Riccardo Ortale;Ettore Ritacco

  • Affiliations:
  • ICAR-CNR, Rende, Italy 87036;ICAR-CNR, Rende, Italy 87036;ICAR-CNR, Rende, Italy 87036;ICAR-CNR, Rende, Italy 87036;ICAR-CNR, Rende, Italy 87036

  • Venue:
  • DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2009
  • Data preparation techniques for improving rare class prediction

    MAMECTIS/NOLASC/CONTROL/WAMUS'11 Proceedings of the 13th WSEAS international conference on mathematical methods, computational techniques and intelligent systems, and 10th WSEAS international conference on non-linear analysis, non-linear systems and chaos, and 7th WSEAS international conference on dynamical systems and control, and 11th WSEAS international conference on Wavelet analysis and multirate systems: recent researches in computational techniques, non-linear systems and control

Quantified Score

Hi-index 0.00

Visualization

Abstract

A hierarchical classification framework is proposed for discriminating rare classes in imprecise domains, characterized by rarity (of both classes and cases), noise and low class separability. The devised framework couples the rules of a rule-based classifier with as many local probabilistic generative models. These are trained over the coverage of the corresponding rules to better catch those globally rare cases/classes that become less rare in the coverage. Two novel schemes for tightly integrating rule-based and probabilistic classification are introduced, that classify unlabeled cases by considering multiple classifier rules as well as their local probabilistic counterparts. An intensive evaluation shows that the proposed framework is competitive and often superior in accuracy w.r.t. established competitors, while overcoming them in dealing with rare classes.