uRule: A Rule-Based Classification System for Uncertain Data

Authors:
Biao Qin;Yuni Xia;Rakesh Sathyesh;Sunil Prabhakar;Yicheng Tu
Affiliations:
-;-;-;-;-
Venue:
ICDMW '10 Proceedings of the 2010 IEEE International Conference on Data Mining Workshops
Year:
2010

Citing 0
Cited 1

DAGger: clustering correlated uncertain data (to predict asset failure in energy networks)

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data uncertainty is common in real-world applications. Various reasons lead to data uncertainty, including imprecise measurements, network latency, outdated sources and sampling errors. These kinds of uncertainties have to be handled cautiously, or else the data mining results could be unreliable or wrong. In this demo, we will show uRule, a new rule-based classification and prediction system for uncertain data. This system uses new measures for generating, pruning and optimizing classification rules. These new measures are computed considering uncertain data intervals and probability distribution functions. Based on the new measures, the optimal splitting attributes and splitting values can be identified and used in classification rules. uRule can process uncertainty in both numerical and categorical data. It has satisfactory classification performance even when data is highly uncertain.