Improving Recurrent CSVM Performance for Robot Navigation on Discrete Labyrinths

Authors:
Nancy Arana-Daniel;Carlos López-Franco;Eduardo Bayro-Corrochano
Affiliations:
Electronics and Computer Science Division, Exact Sciences and Engineering Campus, CUCEI, Universidad de Guadalajara, Guadalajara, México C.P. 44430;Electronics and Computer Science Division, Exact Sciences and Engineering Campus, CUCEI, Universidad de Guadalajara, Guadalajara, México C.P. 44430;Department of Electrical Engineering and Computer Science, Cinvestav del IPN, Zapopan, México
Venue:
CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Year:
2009

Citing 5
Cited 0

A Reinforcement Connectionist Approach to Robot Path Finding in Non-Maze-Like Environments

Machine Learning
New algebraic tools for classical geometry

Geometric computing with Clifford algebras
Temporal credit assignment in reinforcement learning

Temporal credit assignment in reinforcement learning
Geometric preprocessing, geometric feedforward neural networks and Clifford support vector machines for visual learning

Neurocomputing
Active guidance for a finless rocket using neuroevolution

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an improvement of a recurrent learning system called LSTM-CSVM (introduced in [1]) for robot navigation applications, this approach is used to deal with some of the main issues addressed in the research area: the problem of navigation on large domains, partial observability, limited number of learning experiences and slow learning of optimal policies. The advantages of this new version of LSTM-CSVM system, are that it can find optimal paths through mazes and it reduces the number of generations to evolve the system to find the optimal navigation policy, therefore either the training time of the system is reduced. This is done by adding an heuristic methodoly to find the optimal path from start state to the goal state.can contain information about the whole environment or just partial information about it.