Implicit Negotiation in Repeated Games

Authors:
Michael L. Littman;Peter Stone
Affiliations:
-;-
Venue:
ATAL '01 Revised Papers from the 8th International Workshop on Intelligent Agents VIII
Year:
2001

Citing 10
Cited 26

Technical Note: \cal Q-Learning

Machine Learning
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork

Artificial Intelligence - Special issue on Robocop: the first step
Randomized strategic demand reduction: getting more by asking for less

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Friend-or-Foe Q-learning in General-Sum Games

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
On No-Regret Learning, Fictitious Play, and Nash Equilibrium

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
FAucS: An FCC Spectrum Auction Simulator for Autonomous Bidding Agents

WELCOM '01 Proceedings of the Second International Workshop on Electronic Commerce
Evaluating Concurrent Reinforcement Learners

ICMAS '00 Proceedings of the Fourth International Conference on MultiAgent Systems (ICMAS-2000)
Mechanisms for automated negotiation in state oriented domains

Journal of Artificial Intelligence Research

FAucS: An FCC Spectrum Auction Simulator for Autonomous Bidding Agents

WELCOM '01 Proceedings of the Second International Workshop on Electronic Commerce
Self-Enforcing Strategic Demand Reduction

AAMAS '02 Revised Papers from the Workshop on Agent Mediated Electronic Commerce on Agent-Mediated Electronic Commerce IV, Designing Mechanisms and Systems
A polynomial-time nash equilibrium algorithm for repeated games

Proceedings of the 4th ACM conference on Electronic commerce
Towards a pareto-optimal solution in general-sum games

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A polynomial-time Nash equilibrium algorithm for repeated games

Decision Support Systems - Special issue: The fourth ACM conference on electronic commerce
Theory of moves learners: towards non-myopic equilibria

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Learning against multiple opponents

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Multi-agent learning model with bargaining

Proceedings of the 38th conference on Winter simulation
A general criterion and an algorithmic framework for learning in multi-agent systems

Machine Learning
Multiagent learning is not the answer. It is the question

Artificial Intelligence
Reaching pareto-optimality in prisoner's dilemma using conditional joint action learning

Autonomous Agents and Multi-Agent Systems
A two-layered multi-agent reinforcement learning model and algorithm

Journal of Network and Computer Applications
Social reward shaping in the prisoner's dilemma

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Online Multiagent Learning against Memory Bounded Adversaries

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games

Web Intelligence and Agent Systems
Learning against opponents with bounded memory

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
A polynomial-time Nash equilibrium algorithm for repeated games

Decision Support Systems - Special issue: The fourth ACM conference on electronic commerce
On Evaluating Information Revelation Policies in Procurement Auctions: A Markov Decision Process Approach

Information Systems Research
Induction over Strategic Agents

Information Systems Research
Planning against fictitious players in repeated normal form games

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Sequential targeted optimality as a new criterion for teaching and following in repeated games

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
An overview of cooperative and competitive multiagent learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
The success and failure of tag-mediated evolution of cooperation

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Developing a Repeated Multi-agent Constant-Sum Game Algorithm Using Human Computation

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Decentralized anti-coordination through multi-agent learning

Journal of Artificial Intelligence Research
Multiagent learning in the presence of memory-bounded agents

Autonomous Agents and Multi-Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In business-related interactions such as the on-going high-stakes FCC spectrum auctions, explicit communication among participants is regarded as collusion, and is therefore illegal. In this paper, we consider the possibility of autonomous agents engaging in implicit negotiation via their tacit interactions. In repeated general-sum games, our testbed for studying this type of interaction, an agent using a "best response" strategy maximizes its own payoff assuming its behavior has no effect on its opponent. This notion of best response requires some degree of learning to determine the fixed opponent behavior. Against an unchanging opponent, the best-response agent performs optimally, and can be thought of as a "follower," since it adapts to its opponent. However, pairing two best-response agents in a repeated game can result in suboptimal behavior. We demonstrate this suboptimality in several different games using variants of Q-learning as an example of a best-response strategy.We then examine two "leader" strategies that induce better performance from opponent followers via stubbornness and threats. These tactics are forms of implicit negotiation in that they aim to achieve a mutually beneficial outcome without using explicit communication outside of the game.