Multiple instance learning of real valued data

  • Authors:
  • Daniel R. Dooly;Qi Zhang;Sally A. Goldman;Robert A. Amar

  • Affiliations:
  • Department of Computer Science, Southern Illinois University Edwardsville, Edwardsville, IL;Department of Computer Science and Engineering, Washington University, St. Louis, MO;Department of Computer Science and Engineering, Washington University, St. Louis, MO;Georgia Institute of Technology, College of Computing, 801 Atlantic Drive, Atlanta, GA

  • Venue:
  • The Journal of Machine Learning Research
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The multiple-instance learning model has received much attention recently with a primary application area being that of drug activity prediction. Most prior work on multiple-instance learning has been for concept learning, yet for drug activity prediction, the label is a real-valued affinity measurement giving the binding strength. We present extensions of k-nearest neighbors (k-NN), Citation-kNN, and the diverse density algorithm for the real-valued setting and study their performance on Boolean and real-valued data. We also provide a method for generating chemically realistic artificial data.