DTU: A Decision Tree for Uncertain Data

  • Authors:
  • Biao Qin;Yuni Xia;Fang Li

  • Affiliations:
  • Department of Computer and Information Science, Indiana University - Purdue University Indianapolis, USA;Department of Computer and Information Science, Indiana University - Purdue University Indianapolis, USA;Department of Mathematics, Indiana University - Purdue University Indianapolis, USA

  • Venue:
  • PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Decision Tree is a widely used data classification technique. This paper proposes a decision tree based classification method on uncertain data. Data uncertainty is common in emerging applications, such as sensor networks, moving object databases, medical and biological bases. Data uncertainty can be caused by various factors including measurements precision limitation, outdated sources, sensor errors, network latency and transmission problems. In this paper, we enhance the traditional decision tree algorithms and extend measures, including entropy and information gain, considering the uncertain data interval and probability distribution function. Our algorithm can handle both certain and uncertain datasets. The experiments demonstrate the utility and robustness of the proposed algorithm as well as its satisfactory prediction accuracy.