Near-Term Liability of Exploitation: Exploration and Exploitation in Multistage Problems

Authors:
Christina Fang;Daniel Levinthal
Affiliations:
Department of Management and Organization, Stern School of Business, New York University, New York, New York 10012;Department of Management, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania 19104
Venue:
Organization Science
Year:
2009

Citing 16
Cited 1

On the control of complex dynamic systems

Physica D
Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Learning in embedded systems

Learning in embedded systems
Noise strategies for improving local search

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Computer simulations of organizations as experiential learning systems: implications for organization theory

Computational organization theory
Unanticipated side effects of successful quality programs: exploring a paradox of organizational improvement

Management Science - Special issue on frontier research in manufacturing and logistics
Adaptation on rugged landscapes

Management Science
Organizational Learning: Creating, Retaining, and Transferring Knowledge

Organizational Learning: Creating, Retaining, and Transferring Knowledge
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Behind Deep Blue: Building the Computer that Defeated the World Chess Champion

Behind Deep Blue: Building the Computer that Defeated the World Chess Champion
Avoiding Complexity Catastrophe in Coevolutionary Pockets: Strategies for Rugged Landscapes

Organization Science
Dynamic Programming

Dynamic Programming
Imitation of Complex Strategies

Management Science
From T-Mazes to Labyrinths: Learning from Model-Based Feedback

Management Science
Human Problem Solving

Human Problem Solving
Simple Models of Discrete Choice and Their Performance in Bandit Experiments

Manufacturing & Service Operations Management

Balancing Exploration and Exploitation Through Structural Design: The Isolation of Subgroups and Organizational Learning

Organization Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

The classic trade-off between exploration and exploitation reflects the tension between gaining new information about alternatives to improve future returns and using the information currently available to improve present returns. By considering these issues in the context of a multistage, as opposed to a repeated, problem environment, we show that exploratory behavior has value quite apart from its role in revising beliefs. We show that even if current beliefs provide an unbiased characterization of the problem environment, maximizing with respect to these beliefs may lead to an inferior expected payoff relative to other mechanisms that make less aggressive use of the organization's beliefs. Search can lead to more robust actions in multistage decision problems than maximization, a benefit quite apart from its role in the updating of beliefs.