Handling missing attribute values in preterm birth data sets

  • Authors:
  • Jerzy W. Grzymala-Busse;Linda K. Goodwin;Witold J. Grzymala-Busse;Xinqun Zheng

  • Affiliations:
  • ,Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS;Nursing Informatics Program, Duke University, Durham, NC;Filterlogix, Lawrence, KS;PC Sprint, Overland Park, KS

  • Venue:
  • RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The objective of our research was to find the best approach to handle missing attribute values in data sets describing preterm birth provided by the Duke University. Five strategies were used for filling in missing attribute values, based on most common values and closest fit for symbolic attributes, averages for numerical attributes, and a special approach to induce only certain rules from specified information using the MLEM2 approach. The final conclusion is that the best strategy was to use the global most common method for symbolic attributes and the global average method for numerical attributes.