Including cognitive biases and distance-based rewards in a connectionist model of complex problem solving

  • Authors:
  • Frédéric Dandurand;Thomas R. Shultz;Arnaud Rey

  • Affiliations:
  • Department of Psychology, Université de Montréal, 90 Vincent-d'Indy Avenue, Montreal, Quebec, H2V 2S9, Canada;Department of Psychology and School of Computer Science, McGill University, 1205 Dr. Penfield Avenue, Montreal, Quebec, H3A 1B1, Canada;Laboratoire de Psychologie Cognitive, CNRS-Aix-Marseille University, 3, place Victor Hugo, 13331 Marseille, France

  • Venue:
  • Neural Networks
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a cognitive, connectionist-based model of complex problem solving that integrates cognitive biases and distance-based and environmental rewards under a temporal-difference learning mechanism. The model is tested against experimental data obtained in a well-defined and planning-intensive problem. We show that incorporating cognitive biases (symmetry and simplicity) in a temporal-difference learning rule (SARSA) increases model adequacy-the solution space explored by biased models better fits observed human solutions. While learning from explicit rewards alone is intrinsically slow, adding distance-based rewards, a measure of closeness to goal, to the learning rule significantly accelerates learning. Finally, the model correctly predicts that explicit rewards have little impact on problem solvers' ability to discover optimal solutions.