Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner's Dilemma: A Comparison Between Two Connectionist Models

Authors:
Emilian Lalev;Maurice Grinberg
Affiliations:
Central and East European Center for Cognitive Science, New Bulgarian University, 21 Montevideo Street, 1618 Sofia, Bulgaria;Central and East European Center for Cognitive Science, New Bulgarian University, 21 Montevideo Street, 1618 Sofia, Bulgaria
Venue:
Anticipatory Behavior in Adaptive Learning Systems
Year:
2007

Citing 2
Cited 2

Dynamics of internal models in game players

Physica D
Investigation of context effects in iterated prisoner's dilemma game

CONTEXT'05 Proceedings of the 5th international conference on Modeling and Using Context

Anticipations, Brains, Individual and Social Behavior: An Introduction to Anticipatory Systems

Anticipatory Behavior in Adaptive Learning Systems
The Role of Anticipation on Cooperation and Coordination in Simulated Prisoner's Dilemma Game Playing

Anticipatory Behavior in Adaptive Learning Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We compare the performance of two connectionist models developed to account for some specific aspects of the decision making process in the Iterated Prisoner's Dilemma Game. Both models are based on common recurrent network architecture. The first of them uses a backward-oriented reinforcement learning algorithm for learning to play the game while the second one makes its move decisions based on generated predictions about future games, moves and payoffs. Both models involve prediction of the opponent move and of the expected payoff and have an in-built autoassociator in their architecture aimed at more efficient payoff matrix representation. The results of the simulations show that the model with explicit anticipation about game outcomes could reproduce the experimentally observed dependency of the cooperation rate on the so-called cooperation index thus showing the importance of anticipation in modeling the actual decision making process in human participants. The role of the models' building blocks and mechanisms is investigated and discussed. Comparisons with experiments with human participants are presented.