Active model selection

Authors:
Omid Madani;Daniel J. Lizotte;Russell Greiner
Affiliations:
Yahoo! Research Labs, Pasadena, CA;University of Alberta, Edmonton;University of Alberta, Edmonton
Venue:
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Year:
2004

Citing 10
Cited 12

Games against nature

Journal of Computer and System Sciences
Learning in embedded systems

Learning in embedded systems
Sequential PAC learning

COLT '95 Proceedings of the eighth annual conference on Computational learning theory
The Nonstochastic Multiarmed Bandit Problem

SIAM Journal on Computing
Learning cost-sensitive active classifiers

Artificial Intelligence
Toward Optimal Active Learning through Sampling Estimation of Error Reduction

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
PAC Bounds for Multi-armed Bandit and Markov Decision Processes

COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
Optimal learning: computational procedures for bayes-adaptive markov decision processes

Optimal learning: computational procedures for bayes-adaptive markov decision processes
Selective Sampling for Nearest Neighbor Classifiers

Machine Learning
Budgeted learning of nailve-bayes classifiers

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence

Reinforcement learning for active model selection

UBDM '05 Proceedings of the 1st international workshop on Utility-based data mining
Approximation algorithms for budgeted learning problems

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
The ratio index for budgeted learning, with applications

SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
Multi-armed Bandits with Metric Switching Costs

ICALP '09 Proceedings of the 36th Internatilonal Collogquium on Automata, Languages and Programming: Part II
Learning to ask the right questions to help a learner learn

Proceedings of the 16th international conference on Intelligent user interfaces
A selecting-the-best method for budgeted model selection

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Efficient Learning with Partially Observed Attributes

The Journal of Machine Learning Research
Nearly optimal exploration-exploitation decision thresholds

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Learning and classifying under hard budgets

ECML'05 Proceedings of the 16th European conference on Machine Learning
New algorithms for budgeted learning

Machine Learning
Efficiently gathering information in costly domains

Decision Support Systems
Using spatial data support for reducing uncertainty in geospatial applications

Geoinformatica

Quantified Score

Hi-index	0.00

Visualization

Abstract

Classical learning assumes the learner is given a labeled data sample, from which it learns a model. The field of Active Learning deals with the situation where the learner begins not with a training sample, but instead with resources that it can use to obtain information to help identify the optimal model. To better understand this task, this paper presents and analyses the simplified "(budgeted) active model selection" version, which captures the pure exploration aspect of many active learning problems in a clean and simple problem formulation. Here the learner can use a fixed budget of "model probes" (where each probe evaluates the specified model on a random indistinguishable instance) to identify which of a given set of possible models has the highest expected accuracy. Our goal is a policy that sequentially determines which model to probe next, based on the information observed so far. We present a formal description of this task, and show that it is NP-hard in general. We then investigate a number of algorithms for this task, including several existing ones (eg, "Round-Robin", "Interval Estimation", "Gittins") as well as some novel ones (e.g., "Biased-Robin"), describing first their approximation properties and then their empirical performance on various problem instances. We observe empirically that the simple biased-robin algorithm significantly outperforms the other algorithms in the case of identical costs and priors.