Aprimorando processos de imputação multivariada de dados com workflows

  • Authors:
  • Rafael Castaneda;Claudia Ferlin;Ronaldo Goldschmidt;Jorge de Abreu Soares;Luis Alfredo V. de Carvalho;Ricardo Choren

  • Affiliations:
  • Instituto Militar de Engenharia (IME), Pça General Tiburcio, Rio de Janeiro -- RJ -- Brazil;Universidade Federal do Rio de Janeiro (UFRJ), Rio de Janeiro -- RJ -- Brazil;Instituto Militar de Engenharia (IME), Pça General Tiburcio, Rio de Janeiro -- RJ -- Brazil;CEFET-RJ -- Centro Federal de Educação Tecnológica Celso Suckow da Fonseca, Maracanã -- Rio de Janeiro -- RJ -- Brazil;Universidade Federal do Rio de Janeiro (UFRJ), Rio de Janeiro -- RJ -- Brazil;Instituto Militar de Engenharia (IME), Pça General Tiburcio, Rio de Janeiro -- RJ -- Brazil

  • Venue:
  • SBBD '08 Proceedings of the 23rd Brazilian symposium on Databases
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Knowledge discovery in databases usually face the problem of missing values. Thus there are several preprocessing mechanisms that aim to make data imputation. However, these mechanisms normally deal with univariate cases, i.e. cases that present missing values in only one column. Iterative imputation mechanisms are capable of dealing with cases that present missing values in several columns, imputing values for one column at a time, but offer several implementation possibilities, from which the data analists find it difficult to choose. This paper presents a workflow-based platform to allow the easy setup, experimentation, and analisys of several iterative imputation techniques. It shows the usage of the platform and a sample experiment.