Controlling the learning process of real-time heuristic search

Authors:
Masashi Shimbo;Toru Ishida
Affiliations:
Graduate School of Information Science, Nara Institute of Science and Technology, Nara 630-0192, Japan;Department of Social Informatics, Kyoto University, Kyoto 606-8501, Japan
Venue:
Artificial Intelligence
Year:
2003

Citing 22
Cited 23

Heuristics: intelligent search strategies for computer problem solving

Heuristics: intelligent search strategies for computer problem solving
Depth-first iterative-deepening: an optimal admissible tree search

Artificial Intelligence
Parallel and distributed computation: numerical methods

Parallel and distributed computation: numerical methods
Real-time heuristic search

Artificial Intelligence
The (n2-1)-puzzle and related relocation problems

Journal of Symbolic Computation
Do the right thing: studies in limited rationality

Do the right thing: studies in limited rationality
Linear-space best-first search

Artificial Intelligence
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Agent searching in a tree and the optimality of iterative deepening

Artificial Intelligence
Learning to act using real-time dynamic programming

Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
The sciences of the artificial (3rd ed.)

The sciences of the artificial (3rd ed.)
Real-time search for learning autonomous agents

Real-time search for learning autonomous agents
Stochastic node caching for memory-bounded search

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Enhanced A algorithms for multiple alignments: optimal alignments for several sequences and k-opt approximate alignments for large cases

Theoretical Computer Science - Special issue: Genome informatics
Planning as heuristic search

Artificial Intelligence - Special issue on heuristic search in artificial intelligence
Minimax real-time heuristic search

Artificial Intelligence - Special issue on heuristic search in artificial intelligence
Modern Control Systems

Modern Control Systems
Moving-Target Search: A Real-Time Search for Changing Goals

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning Sorting and Decision Trees with POMDPs

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
A* with Partial Expansion for Large Branching Factor Problems

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Speeding up the Convergence of Real-Time Search

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Improving the learning efficiencies of realtime search

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1

The no inference engine theory - Performing conflict resolution during development

Decision Support Systems
RTTES: Real-time search in dynamic environments

Applied Intelligence
Statistically assisted routing algorithms (SARA) for hop count based forwarding in wireless sensor networks

Wireless Networks
Performance simulations of moving target search algorithms

International Journal of Computer Games Technology - Artificial Intelligence for Computer Games
Novel moving target search algorithms for computer gaming

Computers in Entertainment (CIE) - SPECIAL ISSUE: Media Arts and Games (Part II)
Pessimistic Heuristics Beat Optimistic Ones in Real-Time Search

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
LRTA* Works Much Better with Pessimistic Heuristics

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Speeding up learning in real-time search via automatic state abstraction

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Learning in real-time search: a unifying framework

Journal of Artificial Intelligence Research
Anytime heuristic search

Journal of Artificial Intelligence Research
Graph abstraction in real-time heuristic search

Journal of Artificial Intelligence Research
Dynamic control in real-time heuristic search

Journal of Artificial Intelligence Research
Real-time heuristic search with a priority queue

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
LRTA*(k)

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Hardness measures for gridworld benchmarks and performance analysis of real-time heuristic search algorithms

Journal of Heuristics
Multi-agent real-time pursuit

Autonomous Agents and Multi-Agent Systems
On learning in agent-centered search

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Case-based subgoaling in real-time heuristic search for video game pathfinding

Journal of Artificial Intelligence Research
Propagating updates in real-time search: HLRTA (k)

CAEPIA'05 Proceedings of the 11th Spanish association conference on Current Topics in Artificial Intelligence
Properties of the DGS-Auction Algorithm

Computational Economics
Learning where you are going and from whence you came: h- and g-cost learning in real-time heuristic search

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Avoiding and escaping depressions in real-time heuristic search

Journal of Artificial Intelligence Research
Weighted real-time heuristic search

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Real-time search provides an attractive framework for intelligent autonomous agents, as it allows us to model an agent's ability to improve its performance through experience. However, the behavior of real-time search agents is far from rational during the learning (convergence) process, in that they fail to balance the efforts to achieve a short-term goal (i.e., to safely arrive at a goal state in the present problem solving trial) and a long-term goal (to find better solutions through repeated trials). As a remedy, we introduce two techniques for controlling the amount of exploration, both overall and per trial. The weighted real-time search reduces the overall amount of exploration and accelerates convergence. It sacrifices admissibility but provides a nontrivial bound on the converged solution cost. The real-time search with upper bounds insures solution quality in each trial when the state space is undirected, These techniques result in a convergence process more stable compared with that of the Learning Real-Time A* algorithm.