No regret learning in oligopolies: cournot vs. bertrand

Authors:
Uri Nadav;Georgios Piliouras
Affiliations:
Department of Computer Science, Stanford University, Stanford, CA;Department of Computer Science, Cornell University, Ithaca, NY
Venue:
SAGT'10 Proceedings of the Third international conference on Algorithmic game theory
Year:
2010

Citing 7
Cited 3

A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
The Nonstochastic Multiarmed Bandit Problem

SIAM Journal on Computing
Online convex optimization in the bandit setting: gradient descent without a gradient

SODA '05 Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms
Prediction, Learning, and Games

Prediction, Learning, and Games
Logarithmic regret algorithms for online convex optimization

Machine Learning
Correlated Equilibrium of Bertrand Competition

WINE '08 Proceedings of the 4th International Workshop on Internet and Network Economics
On the convergence of regret minimization dynamics in concave games

Proceedings of the forty-first annual ACM symposium on Theory of computing

Coalition formation and price of anarchy in cournot oligopolies

WINE'10 Proceedings of the 6th international conference on Internet and network economics
Beating the best Nash without regret

ACM SIGecom Exchanges
LP-Based covering games with low price of anarchy

WINE'12 Proceedings of the 8th international conference on Internet and Network Economics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Cournot and Bertrand oligopolies constitute the two most prevalent models of firm competition. The analysis of Nash equilibria in each model reveals a unique prediction about the stable state of the system. Quite alarmingly, despite the similarities of the two models, their projections expose a stark dichotomy. Under the Cournot model, where firms compete by strategically managing their output quantity, firms enjoy positive profits as the resulting market prices exceed that of the marginal costs. On the contrary, the Bertrand model, in which firms compete on price, predicts that a duopoly is enough to push prices down to the marginal cost level. This suggestion that duopoly will result in perfect competition, is commonly referred to in the economics literature as the "Bertrand paradox". In this paper, we move away from the safe haven of Nash equilibria as we analyze these models in disequilibrium under minimal behavioral hypotheses. Specifically, we assume that firms adapt their strategies over time, so that in hindsight their average payoffs are not exceeded by any single deviating strategy. Given this no-regret guarantee, we show that in the case of Cournot oligopolies, the unique Nash equilibrium fully captures the emergent behavior. Notably, we prove that under natural assumptions the daily market characteristics converge to the unique Nash. In contrast, in the case of Bertrand oligopolies, a wide range of positive average payoff profiles can be sustained. Hence, under the assumption that firms have no-regret the Bertrand paradox is resolved and both models arrive to the same conclusion that increased competition is necessary in order to achieve perfect pricing.