A reinforcement learning model of selective visual attention
Proceedings of the fifth international conference on Autonomous agents
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Expert Systems with Applications: An International Journal
Applying brain emotional learning algorithm for multivariable control of HVAC systems
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Emotion on FPGA: Model driven approach
Expert Systems with Applications: An International Journal
Pareto-Based Multiobjective Machine Learning: An Overview and Case Studies
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Dynamic multiobjective optimization problems: test cases, approximations, and applications
IEEE Transactions on Evolutionary Computation
Hi-index | 12.05 |
Dealing with uncertainties and lack of knowledge about problems and situations, there is a perpetual difficulty to evaluate the situations and action values in all time steps. On the other hand, the design of critics which delicately guide the agent even with reinforcement rewards and punishments in these complicated or blurred environments is laborious and cumbersome. In this study, we propose a framework for concurrent learning of control of attention to the sensory space, attention to the various critics to evaluate the selected motor actions, and the motor actions themselves. Previous works include the implementation of the control of attention for selecting the most important parts of data and/or reducing the dimensionality of the input space. However, decision making can depend not only on objective sensory data, but also upon mental states or subjective inputs as well. Specifically, we examine attention for the evaluations of selected action by various local critics, as well as sensory inputs. Each local critic evaluates the agent's actions regarding to its standpoints on task domain. Our agent tries to learn the degree of importance of each local critic's assessment, using sparse super-critic punishing revisions. So the agent learns the way of combining or even disregarding some of local critics while it learns to focus on the appropriate subset of features, and learns physical actions concurrently. By discovering the effective combination of local critics, the agent does not need any prior accurately designed critic. The mathematical formulation of proposed learning method is developed. Also, in order to evaluate the proposed method, two benchmarks are discussed. The effect of using attention control on robustness is analyzed via Monte Carlo analysis. The experimental results show the efficiency of proposed formulation in presence of uncertainties.