Discovering Classification from Data of Multiple Sources

  • Authors:
  • Charles X. Ling;Qiang Yang

  • Affiliations:
  • Department of Computer Science, University of Western Ontario, London, Canada N6A 5B7;Department of Computer Science, Hong Kong UST, Kowloon, Hong Kong

  • Venue:
  • Data Mining and Knowledge Discovery
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In many large e-commerce organizations, multiple data sources are often used to describe the same customers, thus it is important to consolidate data of multiple sources for intelligent business decision making. In this paper, we propose a novel method that predicts the classification of data from multiple sources without class labels in each source. We test our method on artificial and real-world datasets, and show that it can classify the data accurately. From the machine learning perspective, our method removes the fundamental assumption of providing class labels in supervised learning, and bridges the gap between supervised and unsupervised learning.