Technical Note: \cal Q-Learning
Machine Learning
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Artificial Intelligence - Special issue on Robocop: the first step
Randomized strategic demand reduction: getting more by asking for less
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Friend-or-Foe Q-learning in General-Sum Games
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
On No-Regret Learning, Fictitious Play, and Nash Equilibrium
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
FAucS: An FCC Spectrum Auction Simulator for Autonomous Bidding Agents
WELCOM '01 Proceedings of the Second International Workshop on Electronic Commerce
Evaluating Concurrent Reinforcement Learners
ICMAS '00 Proceedings of the Fourth International Conference on MultiAgent Systems (ICMAS-2000)
Mechanisms for automated negotiation in state oriented domains
Journal of Artificial Intelligence Research
FAucS: An FCC Spectrum Auction Simulator for Autonomous Bidding Agents
WELCOM '01 Proceedings of the Second International Workshop on Electronic Commerce
Self-Enforcing Strategic Demand Reduction
AAMAS '02 Revised Papers from the Workshop on Agent Mediated Electronic Commerce on Agent-Mediated Electronic Commerce IV, Designing Mechanisms and Systems
A polynomial-time nash equilibrium algorithm for repeated games
Proceedings of the 4th ACM conference on Electronic commerce
Towards a pareto-optimal solution in general-sum games
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A polynomial-time Nash equilibrium algorithm for repeated games
Decision Support Systems - Special issue: The fourth ACM conference on electronic commerce
Theory of moves learners: towards non-myopic equilibria
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Learning against multiple opponents
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Multi-agent learning model with bargaining
Proceedings of the 38th conference on Winter simulation
Multiagent learning is not the answer. It is the question
Artificial Intelligence
Reaching pareto-optimality in prisoner's dilemma using conditional joint action learning
Autonomous Agents and Multi-Agent Systems
A two-layered multi-agent reinforcement learning model and algorithm
Journal of Network and Computer Applications
Social reward shaping in the prisoner's dilemma
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Online Multiagent Learning against Memory Bounded Adversaries
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games
Web Intelligence and Agent Systems
Learning against opponents with bounded memory
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
A polynomial-time Nash equilibrium algorithm for repeated games
Decision Support Systems - Special issue: The fourth ACM conference on electronic commerce
Information Systems Research
Induction over Strategic Agents
Information Systems Research
Planning against fictitious players in repeated normal form games
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Sequential targeted optimality as a new criterion for teaching and following in repeated games
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
An overview of cooperative and competitive multiagent learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
The success and failure of tag-mediated evolution of cooperation
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Developing a Repeated Multi-agent Constant-Sum Game Algorithm Using Human Computation
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Decentralized anti-coordination through multi-agent learning
Journal of Artificial Intelligence Research
Multiagent learning in the presence of memory-bounded agents
Autonomous Agents and Multi-Agent Systems
Hi-index | 0.00 |
In business-related interactions such as the on-going high-stakes FCC spectrum auctions, explicit communication among participants is regarded as collusion, and is therefore illegal. In this paper, we consider the possibility of autonomous agents engaging in implicit negotiation via their tacit interactions. In repeated general-sum games, our testbed for studying this type of interaction, an agent using a "best response" strategy maximizes its own payoff assuming its behavior has no effect on its opponent. This notion of best response requires some degree of learning to determine the fixed opponent behavior. Against an unchanging opponent, the best-response agent performs optimally, and can be thought of as a "follower," since it adapts to its opponent. However, pairing two best-response agents in a repeated game can result in suboptimal behavior. We demonstrate this suboptimality in several different games using variants of Q-learning as an example of a best-response strategy.We then examine two "leader" strategies that induce better performance from opponent followers via stubbornness and threats. These tactics are forms of implicit negotiation in that they aim to achieve a mutually beneficial outcome without using explicit communication outside of the game.