Generalization versus classification

  • Authors:
  • Rolf Wiehagen;Carl H. Smith

  • Affiliations:
  • Department of Informatics, Institute for Theoretical Informatics, Humboldt University, 1086 Berlin;Department of Computer Science, Institute for Advanced Computer Studies, University of Maryland, College Park, MD

  • Venue:
  • COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

Generalization is a learning problem that has received considerable attention. The generalization problem is to take a finite sample of some concept and produce an algorithm that can produce all other (perhaps infinitely many) samples of the same concept. Inductive inference is the study of this problem in a most general framework [1]. The classification problem is to take a finite sample of some concept and decide which type of concept the sample is from. The choice of type is usually finite. If the mechanism performing the classification is limiting, e.g., it makes more and more conjectures as to a classification as time goes on, then the process can also be considered as a type of learning. Roughly, we will say that some suitable mechanism has learned an appropriate classification if its sequence of conjectures stabilizes at some point. In this paper we formalize, at a suitable level of abstraction, the classification problem and rigorously compare it to the generalization problem. Despite some obvious similarities, the two notions are shown to be distinct. The new formalism of classification is investigated further.