SIAM Journal on Control and Optimization
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Embodied evolution and learning: the neglected timing of maturation
ECAL'07 Proceedings of the 9th European conference on Advances in artificial life
Darwinian embodied evolution of the learning ability for survival
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Intrinsic Motivation Systems for Autonomous Mental Development
IEEE Transactions on Evolutionary Computation
An empirical comparison of two common multiobjective reinforcement learning algorithms
AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
Hi-index | 0.00 |
Understanding the design principle of reward functions is a substantial challenge both in artificial intelligence and neuroscience. Successful acquisition of a task usually requires not only rewards for goals, but also for intermediate states to promote effective exploration. This paper proposes a method for designing 'intrinsic' rewards of autonomous agents by combining constrained policy gradient reinforcement learning and embodied evolution. To validate the method, we use Cyber Rodent robots, in which collision avoidance, recharging from battery packs, and 'mating' by software reproduction are three major 'extrinsic' rewards. We show in hardware experiments that the robots can find appropriate 'intrinsic' rewards for the vision of battery packs and other robots to promote approach behaviors.