GRAFT, a complete system for data fusion

  • Authors:
  • Tomís Aluja-Banet;Josep Daunis-i-Estadella;David Pellicer

  • Affiliations:
  • Universitat Politècnica de Catalunya, Jordi Girona Salgado 1-3, CN C5204, E-08034 Barcelona, Spain;Universitat de Girona, Campus de Montilivi, Edifici P4, E-17071 Girona, Spain;TNS Audiencia de Medios, Camí de Can Calders núm. 4, E-08173 St. Cugat del Vallès, Spain

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2007

Quantified Score

Hi-index 0.03

Visualization

Abstract

Data fusion concerns the problem of merging information coming from independent sources. Also known as statistical matching, file grafting or microdata merging, it is a challenging problem for statisticians. The increasing growth of collected data makes combining different sources of information an attractive alternative to single source data. The interest in data fusion derives, in certain cases, from the impossibility of attaining specific information from one source of data and the reduction of the cost entailed by this operation and, in all cases, from taking greater advantage of the available collected information. The GRAFT system is presented. It is a multipurpose data fusion system based on the k-nearest neighbor (k-nn) hot deck imputation method. The system aim is to cope with many data fusion problems and domains. The k-nn is a very demanding algorithm. The solutions envisaged and their cost, which allow this methodology to be used in a wide range of real problems, are presented.