Learning epistemic actions in model-free memory-free reinforcement learning: experiments with a neuro-robotic model

  • Authors:
  • Dimitri Ognibene;Nicola Catenacci Volpi;Giovanni Pezzulo;Gianluca Baldassare

  • Affiliations:
  • Personal Robotics Laboratory, Imperial College London, UK;IMT Institute for Advanced Studies, Lucca, Italy;Istituto di Scienze e Tecnologie della Cognizione, CNR, Italy and Istituto di Linguistica Computazionale "Antonio Zampolli", CNR, Italy;Istituto di Scienze e Tecnologie della Cognizione, CNR, Italy

  • Venue:
  • Living Machines'13 Proceedings of the Second international conference on Biomimetic and Biohybrid Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Passive sensory processing is often insufficient to guide biological organisms in complex environments. Rather, behaviourally relevant information can be accessed by performing so-called epistemicactions that explicitly aim at unveiling hidden information. However, it is still unclear how an autonomous agent can learn epistemic actions and how it can use them adaptively. In this work, we propose a definition of epistemic actions for POMDPs that derive from their characterizations in cognitive science and classical planning literature. We give theoretical insights about how partial observability and epistemic actions can affect the learning process and performance in the extreme conditions of model-free and memory-free reinforcement learning where hidden information cannot be represented. We finally investigate these concepts using an integrated eye-arm neural architecture for robot control, which can use its effectors to execute epistemic actions and can exploit the actively gathered information to efficiently accomplish a seek-and-reach task.