Reinforcement learning: a survey

Authors:
Leslie Pack Kaelbling;Michael L. Littman;Andrew W. Moore
Affiliations:
Computer Science Department, Brown University, Providence, RI;Computer Science Department, Brown University, Providence, RI;Carnegie Mellon University, Pittsburgh, PA
Venue:
Journal of Artificial Intelligence Research
Year:
1996

Citing 62
Cited 696

A theory of the learnable

Communications of the ACM
Empirical model-building and response surface

Empirical model-building and response surface
Theory of linear and integer programming

Theory of linear and integer programming
Stochastic optimal control: theory and application

Stochastic optimal control: theory and application
Dynamic programming: deterministic and stochastic models

Dynamic programming: deterministic and stochastic models
Stochastic systems: estimation, identification and adaptive control

Stochastic systems: estimation, identification and adaptive control
Parallel and distributed computation: numerical methods

Parallel and distributed computation: numerical methods
Learning automata: an introduction

Learning automata: an introduction
Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Proceedings of the seventh international conference (1990) on Machine learning
Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations
Generalization and scaling in reinforcement learning

Advances in neural information processing systems 2
A survey of algorithmic methods for partially observed Markov decision processes

Annals of Operations Research
Reinforcement learning in Markovian and non-Markovian environments

NIPS-3 Proceedings of the 1990 conference on Advances in neural information processing systems 3
Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Machine Learning
Practical Issues in Temporal Difference Learning

Machine Learning
Technical Note: \cal Q-Learning

Machine Learning
Transfer of Learning by Composing Solutions of Elemental Sequential Tasks

Machine Learning
The Convergence of TD(λ) for General λ

Machine Learning
Reinforcement learning and its application to control

Reinforcement learning and its application to control
Learning in embedded systems

Learning in embedded systems
The complexity of stochastic games

Information and Computation
Reinforcement learning for robots using neural networks

Reinforcement learning for robots using neural networks
Efficient learning and planning within the Dyna framework

Adaptive Behavior
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Efficient reinforcement learning

COLT '94 Proceedings of the seventh annual conference on Computational learning theory
TD-Gammon, a self-teaching backgammon program, achieves master-level play

Neural Computation
Associative Reinforcement Learning: Functions in k-DNF

Machine Learning
Associative Reinforcement Learning: A Generate and Test Algorithm

Machine Learning
TD(λ) Converges with Probability 1

Machine Learning
Memoryless policies: theoretical limitations and practical results

SAB94 Proceedings of the third international conference on Simulation of adaptive behavior : from animals to animats 3: from animals to animats 3
A comparison of Q-learning and classifier systems

SAB94 Proceedings of the third international conference on Simulation of adaptive behavior : from animals to animats 3: from animals to animats 3
Learning to solve Markovian decision processes

Learning to solve Markovian decision processes
Asynchronous Stochastic Approximation and Q-Learning

Machine Learning
Acting optimally in partially observable stochastic domains

AAAI'94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 2)
Robot shaping: developing autonomous agents through learning

Artificial Intelligence
Learning to act using real-time dynamic programming

Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Temporal difference learning and TD-Gammon

Communications of the ACM
Adding temporary memory to ZCS

Adaptive Behavior
ALECSYS and the AutonoMouse: Learning to Control a Real Robot by Distributed Classifier Systems

Machine Learning
Continual learning in reinforcement environments

Continual learning in reinforcement environments
Feature-based methods for large scale dynamic programming

Machine Learning - Special issue on reinforcement learning
Reinforcement learning with replacing eligibility traces

Machine Learning - Special issue on reinforcement learning
Average reward reinforcement learning: foundations, algorithms, and empirical results

Machine Learning - Special issue on reinforcement learning
Predicting real-time planner performance by domain characterization

Predicting real-time planner performance by domain characterization
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Neural Network Perception for Mobile Robot Guidance

Neural Network Perception for Mobile Robot Guidance
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Brains, Behavior and Robotics

Brains, Behavior and Robotics
Finite State Markovian Decision Processes

Finite State Markovian Decision Processes
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Feudal Reinforcement Learning

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Dynamic Programming

Dynamic Programming
Memory Approaches to Reinforcement Learning in Non-Markovian Domains

Memory Approaches to Reinforcement Learning in Non-Markovian Domains
Temporal credit assignment in reinforcement learning

Temporal credit assignment in reinforcement learning
Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning)

Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning)
Reinforcement learning with selective perception and hidden state

Reinforcement learning with selective perception and hidden state
Classifier fitness based on accuracy

Evolutionary Computation
On the convergence of stochastic iterative dynamic programming algorithms

Neural Computation
A reinforcement learning approach to job-shop scheduling

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
On the complexity of solving Markov decision problems

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
Rapid, safe, and incremental learning of navigation strategies

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Explanation-Based Learning and Reinforcement Learning: A Unified View

Machine Learning
PAC adaptive control of linear systems

COLT '97 Proceedings of the tenth annual conference on Computational learning theory
Learning agents for uncertain environments (extended abstract)

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Learning situation-dependent costs: improving planning from probabilistic robot execution

AGENTS '98 Proceedings of the second international conference on Autonomous agents
A history-based approach for adaptive robot behavior in dynamic environments

AGENTS '98 Proceedings of the second international conference on Autonomous agents
Iterated phantom induction: a little knowledge can go a long way

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Bayesian Q-learning

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Tree based discretization for continuous state space reinforcement learning

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Module-Based Reinforcement Learning: Experiments with a Real Robot

Machine Learning - Special issue on learning in autonomous robots
Hierarchical Learning of Navigational Behaviors in anAutonomous Robot using a Predictive Sparse DistributedMemory

Machine Learning - Special issue on learning in autonomous robots
Learning from History for Behavior-Based Mobile Robots in Non-Stationary Conditions

Machine Learning - Special issue on learning in autonomous robots
PETEEI: a PET with evolving emotional intelligence

Proceedings of the third annual conference on Autonomous Agents
Team-partitioned, opaque-transition reinforcement learning

Proceedings of the third annual conference on Autonomous Agents
Adaptivity and learning in intelligent real-time systems

Proceedings of the third annual conference on Autonomous Agents
Proteus*—adaptive polling system for proactive management of ATM networks using collaborative intelligent agents

Proceedings of the third annual conference on Autonomous Agents
Nomadic radio: scaleable and contextual notification for wearable audio messaging

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Conjectural Equilibrium in Multiagent Learning

Machine Learning
Learning to Take Actions

Machine Learning
Efficient exploration for optimizing immediate reward

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Exploration of Multi-State Environments: Local Measures and Back-Propagation of Uncertainty

Machine Learning
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Video textures

Proceedings of the 27th annual conference on Computer graphics and interactive techniques
Adaptive Retrieval Agents: Internalizing Local Contextand Scaling up to the Web

Machine Learning - Special issue on information retrieval
A Study of Reinforcement Learning in the Continuous Case by the Means of Viscosity Solutions

Machine Learning
Nomadic radio: speech and audio interaction for contextual messaging in nomadic environments

ACM Transactions on Computer-Human Interaction (TOCHI) - Special issue on human-computer interaction with mobile systems
Using background knowledge to speed reinforcement learning in physical agents

Proceedings of the fifth international conference on Autonomous agents
Reinforcement learning for fuzzy agents: application to a pighouse environment control

New learning paradigms in soft computing
Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network

Neural Processing Letters
Multiagent learning using a variable learning rate

Artificial Intelligence
Designing agent collectives for systems with markovian dynamics

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to select a coordination mechanism

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Controlled animation of video sprites

Proceedings of the 2002 ACM SIGGRAPH/Eurographics symposium on Computer animation
Machine learning and inductive logic programming for multi-agent systems

Mutli-agents systems and applications
Relational reinforcement learning

Mutli-agents systems and applications
Iterated Phantom Induction: A Knowledge-Based Approach to Learning Control

Machine Learning
Dynamic non-Bayesian decision making in multi-agent systems

Annals of Mathematics and Artificial Intelligence
Efficient and inefficient ant coverage methods

Annals of Mathematics and Artificial Intelligence
Ant colony optimization and stochastic gradient descent

Artificial Life
A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making

Applied Intelligence
Module-Based Reinforcement Learning: Experiments with a Real Robot

Autonomous Robots
Hierarchical Learning of Navigational Behaviors in anAutonomous Robot using a Predictive Sparse Distributed Memory

Autonomous Robots
Learning from History for Behavior-Based Mobile Robots in Non-Stationary Conditions

Autonomous Robots
Dynamics of a Classical Conditioning Model

Autonomous Robots
Robot Awareness in Cooperative Mobile Robot Learning

Autonomous Robots
Multiagent Systems: A Survey from a Machine Learning Perspective

Autonomous Robots
Acquiring Mobile Robot Behaviors by Learning Trajectory Velocities

Autonomous Robots
Predictive Software

Automated Software Engineering
Automating the Construction of Internet Portals with Machine Learning

Information Retrieval
The Effect of Evolution in Artificial Life Learning Behavior

Journal of Intelligent and Robotic Systems
Relational Reinforcement Learning

Machine Learning
Robot learning driven by emotions

Adaptive Behavior
Maximum entropy-based optimal threshold selection using deterministic reinforcement learning with controlled randomization

Signal Processing
Learning intelligent behavior in a non-stationary and partially observable environment

Artificial Intelligence Review
Actor-critic models of the basal ganglia: new anatomical and computational perspectives

Neural Networks - Computational models of neuromodulation
Neuromodulation of decision and response selection

Neural Networks - Computational models of neuromodulation
Training Reinforcement Neurocontrollers Using the Polytope Algorithm

Neural Processing Letters
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
Exploration Strategies for Model-based Learning in Multi-agent Systems: Exploration Strategies

Autonomous Agents and Multi-Agent Systems
FLAME—Fuzzy Logic Adaptive Model of Emotions

Autonomous Agents and Multi-Agent Systems
A Topic-Specific Web Robot Model Based on Restless Bandits

IEEE Internet Computing
Sequence Learning: From Recognition and Prediction to Sequential Decision Making

IEEE Intelligent Systems
Optimal control using the transport equation: the Liouville machine

Adaptive Behavior
Evolving neural networks through augmenting topologies

Evolutionary Computation
An Information-Theoretic Approach for the Quantification of Relevance

ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Distributing a Mind on the Internet: The World-Wide-Mind

ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Problem Decomposition for Behavioural Cloning

ECML '00 Proceedings of the 11th European Conference on Machine Learning
Layered Learning

ECML '00 Proceedings of the 11th European Conference on Machine Learning
Towards a Universal Theory of Artificial Intelligence Based on Algorithmic Probability and Sequential Decisions

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Learning While Exploring: Bridging the Gaps in the Eligibility Traces

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Speeding Up Relational Reinforcement Learning through the Use of an Incremental First Order Decision Tree Learner

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Analysis and Design of Robot's Behavior: Towards a Methodology

EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Learning a Navigation Task in Changing Environments by Multi-task Reinforcement Learning

EWLR-8 Proceedings of the 8th European Workshop on Learning Robots: Advances in Robot Learning
Extraction of Local Structural Features in Images by Using a Multi-scale Relevance Function

MLDM '99 Proceedings of the First International Workshop on Machine Learning and Data Mining in Pattern Recognition
L-VIBRA: Learning the VIBRA Architecture

IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Least-Squares Methods in Reinforcement Learning for Control

SETN '02 Proceedings of the Second Hellenic Conference on AI: Methods and Applications of Artificial Intelligence
Learning to Balance Upright Posture: What can be Learnt Using Adaptive NN Models?

WIRN VIETRI 2002 Proceedings of the 13th Italian Workshop on Neural Nets-Revised Papers
Relational Reinforcement Learning

EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Machine Learning and Inductive Logic Programming for Multi-agent Systems

EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Reinforcement Learning for Cooperating and Communicating Reactive Agents in Electrical Power Grids

Balancing Reactivity and Social Deliberation in Multi-Agent Systems, From RoboCup to Real-World Applications (selected papers from the ECAI 2000 Workshop and additional contributions)
Parameterized Logic Programs where Computing Meets Learning

FLOPS '01 Proceedings of the 5th International Symposium on Functional and Logic Programming
Sequential Instance-Based Learning for Planning in the Context of an Imperfect Information Game

ICCBR '01 Proceedings of the 4th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Intraday FX Trading: An Evolutionary Reinforcement Learning Approach

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
An Introduction to Learning Fuzzy Classifier Systems

Learning Classifier Systems, From Foundations to Applications
A Roadmap to the Last Decade of Learning Classifier System Research

Learning Classifier Systems, From Foundations to Applications
Two Dimensional Evaluation Reinforcement Learning

IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Connectionist Models of Neurons, Learning Processes and Artificial Intelligence-Part I
Comparing the Learning Processes of Cognitive Distance Learning and Search Based Agent

IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Connectionist Models of Neurons, Learning Processes and Artificial Intelligence-Part I
Market Performance of Adaptive Trading Agents in Synchronous Double Auctions

PRIMA 2001 Proceedings of the 4th Pacific Rim International Workshop on Multi-Agents, Intelligent Agents: Specification, Modeling, and Applications
Game Theory and Artificial Intelligence

Selected papers from the UKMAS Workshop on Foundations and Applications of Multi-Agent Systems
Andhill-98: A RoboCup Team which Reinforces Positioning with Observation

RoboCup-98: Robot Soccer World Cup II
Team-Partitioned, Opaque-Transition Reinforced Learning

RoboCup-98: Robot Soccer World Cup II
From a Concurrent Architecture to a Concurrent Autonomous Agents Architecture

RoboCup-99: Robot Soccer World Cup III
Gemini in RoboCup-2000

RoboCup 2000: Robot Soccer World Cup IV
On the Relationship between Learning Capability and the Boltzmann-Formula

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Preliminary Results

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Introduction to Sequence Learning

Sequence Learning - Paradigms, Algorithms, and Applications
Hidden-Mode Markov Decision Processes for Nonstationary Sequential Decision Making

Sequence Learning - Paradigms, Algorithms, and Applications
Automatic Segmentation of Sequences through Hierarchical Reinforcement Learning

Sequence Learning - Paradigms, Algorithms, and Applications
Simulating Competing Alife Organisms by Constructive Compound Neural Networks

AI '00 Proceedings of the 13th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
Abstraction Methods for Game Theoretic Poker

CG '00 Revised Papers from the Second International Conference on Computers and Games
Logic, Knowledge Representation, and Bayesian Decision Theory

CL '00 Proceedings of the First International Conference on Computational Logic
Faster Near-Optimal Reinforcement Learning: Adding Adaptiveness to the E3 Algorithm

ALT '99 Proceedings of the 10th International Conference on Algorithmic Learning Theory
Feedforward Neural Networks in Reinforcement Learning Applied to High-Dimensional Motor Control

ALT '02 Proceedings of the 13th International Conference on Algorithmic Learning Theory
Learning Hierarchical Skills from Observation

DS '02 Proceedings of the 5th International Conference on Discovery Science
Mining Documents for Complex Semantic Relations by the Use of Context Classification

DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
To Collect or Not to Collect? Machine Learning for Memory Management

Proceedings of the 2nd Java Virtual Machine Research and Technology Symposium
Bounds on Sample Size for Policy Evaluation in Markov Environments

COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Self-Optimizing and Pareto-Optimal Policies in General Environments Based on Bayes-Mixtures

COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
Continual Robot Learning with Constructive Neural Networks

EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning

IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Analysis of Optimal Criteria

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Sensory-motor primitives as a basis for imitation: linking perception to action and biology to robotics

Imitation in animals and artifacts
Constructing complex minds through multiple authors

ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
A context-based architecture for general problem solving

ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Sequential cost-sensitive decision making with reinforcement learning

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Applications of the self-organising map to reinforcement learning

Neural Networks - New developments in self-organizing maps
State abstraction for programmable reinforcement learning agents

Eighteenth national conference on Artificial intelligence
Reinforcement learning of coordination in cooperative multi-agent systems

Eighteenth national conference on Artificial intelligence
The design of collectives of agents to control non-Markovian systems

Eighteenth national conference on Artificial intelligence
Dispersion games: general definitions and some specific learning results

Eighteenth national conference on Artificial intelligence
Anticipations control behavior: animal behavior in an anticipatory learning classifier system

Adaptive Behavior
Soccer strategies that live in the B2B world of negotiation and decision-making

Decision Support Systems
Designing neural control architectures for an autonomous robot using vision to solve complex learning tasks

Biologically inspired robot behavior engineering
A general learning approach to visually guided 3D-positioning and pose control of robot arms

Biologically inspired robot behavior engineering
Generalising Video Textures

TPCG '03 Proceedings of the Theory and Practice of Computer Graphics 2003
Concurrent layered learning

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Multi-agent learning in extensive games with complete information

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
The empirical Bayes envelope and regret minimization in competitive Markov decision processes

Mathematics of Operations Research
Using Reinforcement Learning for Similarity Assessment in Case-Based Systems

IEEE Intelligent Systems
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
A reinforcement learning adaptive fuzzy controller for robots

Fuzzy Sets and Systems - Theme: Modeling and control
Autonomous mental development in high dimensional context and action spaces

Neural Networks - 2003 Special issue: Advances in neural networks research — IJCNN'03
R-max - a general polynomial time algorithm for near-optimal reinforcement learning

The Journal of Machine Learning Research
Machine Learning for Computer Graphics: A Manifesto and Tutorial

PG '03 Proceedings of the 11th Pacific Conference on Computer Graphics and Applications
Mining Plans for Customer-Class Transformation

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Reinforcing reachable routes

Computer Networks: The International Journal of Computer and Telecommunications Networking
Speedup learning for repair-based search by identifying redundant steps

The Journal of Machine Learning Research
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Modeling the adaptive visual system: a survey of principled approaches

Neural Networks - Special issue: Neuroinformatics
A Tabu-Search Hyperheuristic for Timetabling and Rostering

Journal of Heuristics
Fault prognostics using dynamic wavelet neural networks

Artificial Intelligence for Engineering Design, Analysis and Manufacturing
Optimal Ordered Problem Solver

Machine Learning
Learning obstacle avoidance with an operant behavior model

Artificial Life
Stable repeated strategies for information exchange between two autonomous agents

Artificial Intelligence
A Reinforcement Learning Framework for Parameter Control in Computer Vision Applications

CRV '04 Proceedings of the 1st Canadian Conference on Computer and Robot Vision
A Geometric Approach to Multi-Criterion Reinforcement Learning

The Journal of Machine Learning Research
Self-organized load balancing in proxy servers: algorithms and performance

Journal of Intelligent Information Systems - Special issue on web intelligence
Everywhere messaging

IBM Systems Journal
Transfer of Experience Between Reinforcement Learning Environments with Progressive Difficulty

Artificial Intelligence Review
Utile distinction hidden Markov models

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning when and how to coordinate

Web Intelligence and Agent Systems
Cross channel optimized marketing by reinforcement learning

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Integrating Guidance into Relational Reinforcement Learning

Machine Learning
Incremental heuristic search in AI

AI Magazine
Reinforcement Learning of Coordination in Heterogeneous Cooperative Multi-Agent Systems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Precomputing avatar behavior from human motion data

SCA '04 Proceedings of the 2004 ACM SIGGRAPH/Eurographics symposium on Computer animation
General methodology 1: optimising discrete event simulation models using a reinforcement learning agent

Proceedings of the 34th conference on Winter simulation: exploring new frontiers
Supply chain planning: a reinforcement learning approach to production planning in the fabrication/fulfillment manufacturing process

Proceedings of the 35th conference on Winter simulation: driving innovation
Exploitation vs. exploration: choosing a supplier in an environment of incomplete information

Decision Support Systems
Efficient learning equilibrium

Artificial Intelligence
Asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Guiding queries to information sources with InfoBeacons

Proceedings of the 5th ACM/IFIP/USENIX international conference on Middleware
Learning and Exploiting Relative Weaknesses of Opponent Agents

Autonomous Agents and Multi-Agent Systems
Applying Active Space Principles to Active Classrooms

PERCOMW '05 Proceedings of the Third IEEE International Conference on Pervasive Computing and Communications Workshops
Graphical user interface of an interactive system for schemes design, used in distance learning

CompSysTech '04 Proceedings of the 5th international conference on Computer systems and technologies
Coordinating Multiple Agents via Reinforcement Learning

Autonomous Agents and Multi-Agent Systems
Strong, Stable, and Reliable Fitness Pressure in XCS due to Tournament Selection

Genetic Programming and Evolvable Machines
Teaching robots to plan through Q-learning

Robotica
Using Optimal Foraging Models to Evaluate Learned Robotic Foraging Behavior

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Fast multi-level adaptation for interactive autonomous characters

ACM Transactions on Graphics (TOG)
System for foreign exchange trading using genetic algorithms and reinforcement learning

International Journal of Systems Science
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process

Proceedings of the 2005 ACM symposium on Applied computing
Evolving Soccer Keepaway Players Through Task Decomposition

Machine Learning
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Optimal Control Using the Transport Equation: The Liouville Machine

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Learning Users' Interests by Quality Classification in Market-Based Recommender Systems

IEEE Transactions on Knowledge and Data Engineering
Teaching virtual characters how to use body language

Lecture Notes in Computer Science
Automatic pan-tilt-zoom calibration in the presence of hybrid sensor networks

Proceedings of the third ACM international workshop on Video surveillance & sensor networks
Intelligent exploration method for XCS

GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Learning strategies for story comprehension: a reinforcement learning approach

ICML '05 Proceedings of the 22nd international conference on Machine learning
Adaptive Clustering: Obtaining Better Clusters Using Feedback and Past Experience

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Online Evolution for a Self-Adapting Robotic Navigation System Using Evolvable Hardware

Artificial Life
A Reinforcement Learning Algorithm in Cooperative Multi-Robot Domains

Journal of Intelligent and Robotic Systems
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games

Autonomous Agents and Multi-Agent Systems
An Ensemble of Cooperative Extended Kohonen Maps for Complex Robot Motion Tasks

Neural Computation
Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms

Neural Computation
A Computational Model of the Functional Role of the Ventral-Striatal D2 Receptor in the Expression of Previously Acquired Behaviors

Neural Computation
Experiments in socially guided machine learning: understanding how humans teach

Proceedings of the 1st ACM SIGCHI/SIGART conference on Human-robot interaction
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms

Neural Computation
A Reinforcement Learning Approach to Online Clustering

Neural Computation
Reinforcement Learning in Continuous Time and Space

Neural Computation
DIAGAL: An Agent Communication Language Based on Dialogue Games and Sustained by Social Commitments

Autonomous Agents and Multi-Agent Systems
Precomputing avatar behavior from human motion data

Graphical Models - Special issue on SCA 2004
Relational temporal difference learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Dealing with non-stationary environments using context detection

ICML '06 Proceedings of the 23rd international conference on Machine learning
Learning hierarchical task networks by observation

ICML '06 Proceedings of the 23rd international conference on Machine learning
A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Graph kernels and Gaussian processes for relational reinforcement learning

Machine Learning
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
Improving reinforcement learning with context detection

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies

The Knowledge Engineering Review
A reinforcement learning approach to active camera foveation

Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
Quantum robot: structure, algorithms and applications

Robotica
I-SEE: an intelligent search agent for electronic commerce

International Journal of Electronic Commerce
Approximate Reasoning in MAS: Rough Set Approach

IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Plans as Products of Learning

IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
A reinforcement learning algorithm to minimize the mean tardiness of a single machine with controlled capacity

Proceedings of the 38th conference on Winter simulation
Neural-based downlink scheduling algorithm for broadband wireless networks

Computer Communications
Reinforcement Learning with Approximation Spaces

Fundamenta Informaticae
RLDDE: A novel reinforcement learning-based dimension and delay estimator for neural networks in time series prediction

Neurocomputing
A reinforcement learning approach to dynamic resource allocation

Engineering Applications of Artificial Intelligence
Dimensions of complexity of intelligent agents

PCAR '06 Proceedings of the 2006 international symposium on Practical cognitive agents and robots
First-Order Logical Neural Networks

International Journal of Hybrid Intelligent Systems - Recent developments in Hybrid Intelligent Systems
Performance analysis of the AntNet algorithm

Computer Networks: The International Journal of Computer and Telecommunications Networking
The theory and experiments of designing cooperative intelligent systems

Decision Support Systems
Adaptive load balancing of parallel applications with multi-agent reinforcement learning on heterogeneous systems

Scientific Programming - Distributed Computing and Applications
Allocating time and location information to activity-travel patterns through reinforcement learning

Knowledge-Based Systems
DEA: An Architecture for Goal Planning and Classification

Neural Computation
Adaptive Behavior in Autonomous Agents

Presence: Teleoperators and Virtual Environments
If multi-agent learning is the answer, what is the question?

Artificial Intelligence
Approximate Reasoning in MAS: Rough Set Approach

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Modeling embodied visual behaviors

ACM Transactions on Applied Perception (TAP)
On developmental mental architectures

Neurocomputing
Editorial: New trends in Cognitive Science: Integrative approaches to learning and development

Neurocomputing
Reinforcement learning by reward-weighted regression for operational space control

Proceedings of the 24th international conference on Machine learning
Learning and Cooperation in Sequential Games

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
An Action-Selection Calculus

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Responsive characters from motion fragments

ACM SIGGRAPH 2007 papers
Near-optimal character animation with continuous control

ACM SIGGRAPH 2007 papers
Part III: dynamic texture synthesis

ACM SIGGRAPH 2007 courses
Learning to trade with insider information

Proceedings of the ninth international conference on Electronic commerce
Data acquisition and cost-effective predictive modeling: targeting offers for electronic commerce

Proceedings of the ninth international conference on Electronic commerce
Guiding exploration by pre-existing knowledge without modifying reward

Neural Networks
Reinforcement learning for a biped robot based on a CPG-actor-critic method

Neural Networks
Shaping multi-agent systems with gradient reinforcement learning

Autonomous Agents and Multi-Agent Systems
Metric embedding of view-graphs

Autonomous Robots
Application of reinforcement learning in robot soccer

Engineering Applications of Artificial Intelligence
A reinforcement agent for threshold fusion

Applied Soft Computing
A layered approach to learning coordination knowledge in multiagent environments

Applied Intelligence
Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation

Neural Computation
Affect, Anticipation, and Adaptation: Affect-Controlled Selection of Anticipatory Simulation in Artificial Adaptive Agents

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of reinforcement learning to the game of Othello

Computers and Operations Research
Analysis and optimization of service availability in a HA cluster with load-dependent machine availability

IEEE Transactions on Parallel and Distributed Systems
Learning reinforcement strategies for a changing workforce

WBED'07 Proceedings of the sixth conference on IASTED International Conference Web-Based Education - Volume 2
Dynamically learning sources of trust information: experience vs. reputation

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Survey paper: Optimal experimental design and some related control problems

Automatica (Journal of IFAC)
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Artificial Intelligence
Learning to Control in Operational Space

International Journal of Robotics Research
On the nature of neural information: A critique of the received view 50 years later

Neurocomputing
A Policy-based Approach for Reconfiguration Management and Enforcement in Autonomic Communication Systems

Wireless Personal Communications: An International Journal
Optimizing time warp simulation with reinforcement learning techniques

Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come
Error-driven active learning in growing radial basis function networks for early robot learning

Neurocomputing
Accelerating autonomous learning by using heuristic selection of actions

Journal of Heuristics
Dynamic learning of action patterns for object acquisition

International Journal of Intelligent Systems Technologies and Applications
A cooperative learning approach to Mixed Performance Controller design: a behavioural viewpoint

International Journal of Intelligent Systems Technologies and Applications
Artificial Intelligence techniques: An introduction to their use for modelling environmental systems

Mathematics and Computers in Simulation
Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

International Journal of Knowledge-based and Intelligent Engineering Systems
Investigation of Q-learning in the context of a virtual learning environment

Informatics in education
A node discovery service for partially mobile sensor networks

Proceedings of the 2nd international workshop on Middleware for sensor networks
Water reservoir control under economic, social and environmental constraints

Automatica (Journal of IFAC)
Service oriented architecture for financial customer relationship management

Proceedings of the second international conference on Distributed event-based systems
Accelerating neuroevolutionary methods using a Kalman filter

Proceedings of the 10th annual conference on Genetic and evolutionary computation
Automating spoken dialogue management design using machine learning: An industry perspective

Speech Communication
Learning Agents in an Artificial Power Exchange: Tacit Collusion, Market Power and Efficiency of Two Double-auction Mechanisms

Computational Economics
Tuning continual exploration in reinforcement learning: An optimality property of the Boltzmann strategy

Neurocomputing
Cooperative content distribution and traffic engineering

Proceedings of the 3rd international workshop on Economics of networked systems
Pessimistic cost-sensitive active learning of decision trees for profit maximizing targeting campaigns

Data Mining and Knowledge Discovery
Reinforcement learning for problems with symmetrical restricted states

Robotics and Autonomous Systems
Ensemble clustering with voting active clusters

Pattern Recognition Letters
2008 Special Issue: Two forms of immediate reward reinforcement learning for exploratory data analysis

Neural Networks
Efficient Exploration in Reinforcement Learning Based on Utile Suffix Memory

Informatica
Adaptive hybrid control for noise rejection

NN'08 Proceedings of the 9th WSEAS International Conference on Neural Networks
Optimization of Handover Parameters for Traffic Sharing in GERAN

Wireless Personal Communications: An International Journal
Incremental Learning of Planning Operators in Stochastic Domains

SOFSEM '07 Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science
Ambulance Decision Support Using Evolutionary Reinforcement Learning in Robocup Rescue Simulation League

RoboCup 2006: Robot Soccer World Cup X
Fuzzy Q-Map Algorithm for Reinforcement Learning

Computational Intelligence and Security
Reinforcement Learning Reward Functions for Unsupervised Learning

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
A Kernel-Based Reinforcement Learning Approach to Dynamic Behavior Modeling of Intrusion Detection

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
A Novel Method of Constructing ANN

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Recurrent Boosting for Classification of Natural and Synthetic Time-Series Data

CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Towards the Automatic Learning of Reflex Modulation for Mobile Robot Navigation

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Multi-agent Learning Dynamics: A Survey

CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
Flexible Control Mechanism for Multi-DOF Robotic Arm Based on Biological Fluctuation

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Efficient Node Discovery in Mobile Wireless Sensor Networks

DCOSS '08 Proceedings of the 4th IEEE international conference on Distributed Computing in Sensor Systems
Epoch-Incremental Queue-Dyna Algorithm

ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Learning Grouping and Anti-predator Behaviors for Multi-agent Systems

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Rational Bidding Using Reinforcement Learning

GECON '08 Proceedings of the 5th international workshop on Grid Economics and Business Models
State-Dependent Exploration for Policy Gradient Methods

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Some Progress of Supervised Learning

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
EANT+KALMAN: An Efficient Reinforcement Learning Method for Continuous State Partially Observable Domains

KI '08 Proceedings of the 31st annual German conference on Advances in Artificial Intelligence
A Logical Framework to Reinforcement Learning Using Hybrid Probabilistic Logic Programs

SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
Passive dynamic walker controller design employing an RLS-based natural actor-critic learning algorithm

Engineering Applications of Artificial Intelligence
Towards adaptive programming: integrating reinforcement learning into a programming language

Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
A probabilistic model of integration

Decision Support Systems
DESIGN OF AN AEROSPACE LAUNCH VEHICLE AUTOPILOT BASED ON OPTIMIZED EMOTIONAL LEARNING ALGORITHM

Cybernetics and Systems
REINFORCEMENT LEARNING FOR POMDP USING STATE CLASSIFICATION

Applied Artificial Intelligence
Meta-case-based reasoning: self-improvement through self-understanding

Journal of Experimental & Theoretical Artificial Intelligence
Maintaining dynamic channel profiles on the web

Proceedings of the VLDB Endowment
Design and analysis of GA based neural/fuzzy optimum adaptive control

WSEAS Transactions on Systems and Control
Q-Strategy: A Bidding Strategy for Market-Based Allocation of Grid Services

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
An Information-Theoretic Class of Stochastic Decision Processes

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Effects of chaotic exploration on reinforcement learning in target capturing task

International Journal of Knowledge-based and Intelligent Engineering Systems
Route optimization with Q-learning

ACS'08 Proceedings of the 8th conference on Applied computer scince
Improving the Exploration Strategy in Bandit Algorithms

Learning and Intelligent Optimization
Action-Based Environment Modeling for Maintaining Trust

Trust in Agent Societies
Simulation of sequential data: An enhanced reinforcement learning approach

Expert Systems with Applications: An International Journal
Sequential optimal design of neurophysiology experiments

Neural Computation
How people talk when teaching a robot

Proceedings of the 4th ACM/IEEE international conference on Human robot interaction
Motivated Learning from Interesting Events: Adaptive, Multitask Learning Agents for Complex Environments

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Simulation-optimization using a reinforcement learning approach

Proceedings of the 40th Conference on Winter Simulation
Pruning an ensemble of classifiers via reinforcement learning

Neurocomputing
QL2, a simple reinforcement learning scheme for two-player zero-sum Markov games

Neurocomputing
Dialogue games that agents play within a society

Artificial Intelligence
Reinforcement Learning: A Tutorial Survey and Recent Advances

INFORMS Journal on Computing
A reinforcement learning framework for utility-based scheduling in resource-constrained systems

Future Generation Computer Systems
A policy-based framework for autonomic reconfiguration management in heterogeneous networks

Proceedings of the 7th International Conference on Mobile and Ubiquitous Multimedia
SO-antnet for improving load sharing in MANET

Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
Designing autonomous layered video coders

Image Communication
SarsaLandmark: an algorithm for learning in POMDPs with landmarks

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Multiagent learning in large anonymous games

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Novel reinforcement learning-based approaches to reduce loss probability in buffer-less OBS networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
Immediate Reward Reinforcement Learning for Clustering and Topology Preserving Mappings

Similarity-Based Clustering
Reordering Sparsification of Kernel Machines in Approximate Policy Iteration

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Opponent Modeling in Adversarial Environments through Learning Ingenuity

Proceedings of the 2005 conference on Self-Organization and Autonomic Informatics (I)
Cognitive learning with automatic goal acquisition

Proceedings of the 2006 conference on STAIRS 2006: Proceedings of the Third Starting AI Researchers' Symposium
Cognitive Architectures: Where do we go from here?

Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Transfer Learning and Intelligence: an Argument and Approach

Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Learning by Automatic Option Discovery from Conditionally Terminating Sequences

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Partially Observable Markov Decision Process Approximations for Adaptive Sensing

Discrete Event Dynamic Systems
Learning teaching strategies in an Adaptive and Intelligent Educational System through Reinforcement Learning

Applied Intelligence
Neuroevolutionary reinforcement learning for generalized helicopter control

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
A cooperative and self-adaptive metaheuristic for the facility location problem

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
From Continuous Behaviour to Discrete Knowledge

IWANN '03 Proceedings of the 7th International Work-Conference on Artificial and Natural Neural Networks: Part II: Artificial Neural Nets Problem Solving Methods
Motion Planning of a Non-holonomic Vehicle in a Real Environment by Reinforcement Learning*

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes

Anticipatory Behavior in Adaptive Learning Systems
Toward Rough-Granular Computing

RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Robotic Target Tracking with Approximation Space-Based Feedback During Reinforcement Learning

RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
An Efficient and Adaptive Mechanism for Parallel Simulation Replication

PADS '09 Proceedings of the 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation
A q-learning based adaptive bidding strategy in combinatorial auctions

Proceedings of the 11th International Conference on Electronic Commerce
Hand grip pattern recognition for mobile user interfaces

IAAI'06 Proceedings of the 18th conference on Innovative applications of artificial intelligence - Volume 2
QUICR-learning for multi-agent coordination

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
RL-CD: dealing with non-stationarity in reinforcement learning

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Quality Enhancement Based on Reinforcement Learning and Feature Weighting for a Critiquing-Based Recommender

ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Parallel Algorithms for Solving Markov Decision Process

ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
Optimal efficient learning equilibrium: imperfect monitoring in symmetric games

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Adaptive treatment of epilepsy via batch-mode reinforcement learning

IAAI'08 Proceedings of the 20th national conference on Innovative applications of artificial intelligence - Volume 3
Unknown rewards in finite-horizon domains

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Evaluation of a hierarchical reinforcement learning spoken dialogue system

Computer Speech and Language
Ants and reinforcement learning: a case study in routing in dynamic networks

IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
Optimizing dialogue management with reinforcement learning: experiments with the NJFun system

Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods

Journal of Artificial Intelligence Research
Collective intelligence, data routing and braess' paradox

Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation

Journal of Artificial Intelligence Research
PHA*: finding the shortest path with A* in an unknown physical environment

Journal of Artificial Intelligence Research
Reinforcement learning for agents with many sensors and actuators acting in categorizable environments

Journal of Artificial Intelligence Research
Closed-loop learning of visual control policies

Journal of Artificial Intelligence Research
A formal framework for speedup learning from problems and solutions

Journal of Artificial Intelligence Research
Dynamic non-Bayesian decision making

Journal of Artificial Intelligence Research
AntNet: distributed stigmergetic control for communications networks

Journal of Artificial Intelligence Research
A machine learning approach to building domain-specific search engines

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Bounding the suboptimality of reusing subproblems

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Heuristic selection of actions in multiagent reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
State similarity based approach for improving performance in RL

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Analogical learning in a turn-based strategy game

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning to count by think aloud imitation

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Direct code access in self-organizing neural networks for reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning strategies for open-domain natural language question answering

ACLstudent '05 Proceedings of the ACL Student Research Workshop
Adapting Reinforcement Learning for Trust: Effective Modeling in Dynamic Environments

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Real-time planning for parameterized human motion

Proceedings of the 2008 ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Selecting GVT interval for time-warp-based distributed simulation using reinforcement learning technique

SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Automatic abstraction in reinforcement learning using data mining techniques

Robotics and Autonomous Systems
Reinforcement learning in distributed domains: beyond team games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
R-MAX: a general polynomial time algorithm for near-optimal reinforcement learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Exploiting multiple secondary reinforcers in policy gradient reinforcement learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Learning strategies for open-domain natural language question answering

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Utility-based on-line exploration for repeated navigation in an embedded graph

Artificial Intelligence
Reinforcement learning versus model predictive control: a comparison on a power system problem

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A Q-learning approach to derive optimal consumption and investment strategies

IEEE Transactions on Neural Networks
Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit

IEEE Transactions on Neural Networks
Knowledge-based recurrent neural networks in Reinforcement Learning

ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
Speeding up reinforcement learning using recurrent neural networks in non-Markovian environments

ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
An agent structure for evaluating micro-level MAS performance

PerMIS '07 Proceedings of the 2007 Workshop on Performance Metrics for Intelligent Systems
An RL-based scheduling algorithm for video traffic in high-rate wireless personal area networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
Agent Architectures for Compliance

ESAW '09 Proceedings of the 10th International Workshop on Engineering Societies in the Agents World X
Reinforcement Learning Based Web Service Compositions for Mobile Business

WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
A reinforcement learning framework for utility-based scheduling in resource-constrained systems

A reinforcement learning framework for utility-based scheduling in resource-constrained systems
A reinforcement learning approach to dynamic resource allocation

A reinforcement learning approach to dynamic resource allocation
Two-step recommendation based personalization for future services

ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 3
Nash Q-learning multi-agent flow control for high-speed networks

ACC'09 Proceedings of the 2009 conference on American Control Conference
Coordinated multiple ramps metering based on neuro-fuzzy adaptive dynamic programming

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive state space partitioning for reinforcement learning

Engineering Applications of Artificial Intelligence
An enhanced reinforcement routing protocol for inter-vehicular unicast application

EuroIMSA '08 Proceedings of the IASTED International Conference on Internet and Multimedia Systems and Applications
Probabilistic fuzzy logic system: a tool to process stochastic and imprecise information

FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Introducing a round robin tournament into Blondie24

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Intercluster connection in cognitive wireless mesh networks based on intelligent network coding

EURASIP Journal on Advances in Signal Processing - Special issue on dynamic spectrum access for wireless networking
Online layered learning for cross-layer optimization of dynamic multimedia systems

MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems
Autonomous development of vergence control driven by disparity energy neuron populations

Neural Computation
A novel technique to design a fuzzy logic controller using Q(λ)-learning and genetic algorithms in the pursuit-evasion game

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A probabilistic fuzzy logic system: learning in the stochastic environment with incomplete dynamics

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Real-valued Q-learning in multi-agent cooperation

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Cooperative multi-robot reinforcement learning: a framework in hybrid state space

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Truncated fourier series formulation for bipedal walking balance control

Robotica
A Recursive Classifier System for Partially Observable Environments

Fundamenta Informaticae
A framework for the design of a military operational supply network

CISDA'09 Proceedings of the Second IEEE international conference on Computational intelligence for security and defense applications
Online reinforcement learning for dynamic multimedia systems

IEEE Transactions on Image Processing
Improving iterative repair strategies for scheduling with the SVM

Neurocomputing
Asynchronous neurocomputing for optimal control and reinforcement learning with large state spaces

Neurocomputing
A new approach to fuzzy classifier systems and its application in self-generating neuro-fuzzy systems

Neurocomputing
On agents and grids: Creating the fabric for a new generation of distributed intelligent systems

Web Semantics: Science, Services and Agents on the World Wide Web
Induction over Strategic Agents

Information Systems Research
Review article: Synergizing reinforcement learning and game theory-A new direction for control

Applied Soft Computing
Sequential anomaly detection based on temporal-difference learning: Principles, models and case studies

Applied Soft Computing
Fuzzy decision tree function approximation in reinforcement learning

International Journal of Artificial Intelligence and Soft Computing
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments

The Journal of Machine Learning Research
Convergence results for ant routing algorithms viastochastic approximation

Proceedings of the 13th ACM international conference on Hybrid systems: computation and control
Tournament selection: stable fitness pressure in XCS

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
A hybrid architecture combining reactive plan execution and reactive learning

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Knowledge discovery and emergent complexity in bioinformatics

KDECB'06 Proceedings of the 1st international conference on Knowledge discovery and emergent complexity in bioinformatics
Reinforcement learning for online control of evolutionary algorithms

ESOA'06 Proceedings of the 4th international conference on Engineering self-organising systems
A theory of profit sharing in dynamic environment

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
A region selecting method which performs observation and action in the multi-resolution environment

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Constructing an autonomous agent with an interdependent heuristics

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Evolving symbolic controllers

EvoWorkshops'03 Proceedings of the 2003 international conference on Applications of evolutionary computing
Learning in groups of traffic signals

Engineering Applications of Artificial Intelligence
Heuristic search based exploration in reinforcement learning

IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Reinforcement learning scheme for grouping and anti-predator behavior

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
MBEANN: mutation-based evolving artificial neural networks

ECAL'07 Proceedings of the 9th European conference on Advances in artificial life
Efficient exploration through active learning for value function approximation in reinforcement learning

Neural Networks
Simple model-based exploration and exploitation of Markov decision processes using the elimination algorithm

MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Stochastic weights reinforcement learning for exploratory data analysis

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Clustering with reinforcement learning

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Independent factor reinforcement learning for portfolio management

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Kernel-based online NEAT for keepaway soccer

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
An agent reinforcement learning model based on neural networks

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Two-layer networked learning control of a nonlinear HVAC system

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Toward perception based computing: a rough-granular perspective

WImBI'06 Proceedings of the 1st WICI international conference on Web intelligence meets brain informatics
Simple algorithmic principles of discovery, subjective beauty, selective attention, curiosity & creativity

DS'07 Proceedings of the 10th international conference on Discovery science
Market-based hierarchical resource management using machine learning

DSOM'07 Proceedings of the Distributed systems: operations and management 18th IFIP/IEEE international conference on Managing virtualization of networks and services
A novel design of hidden web crawler using reinforcement learning based agents

APPT'07 Proceedings of the 7th international conference on Advanced parallel processing technologies
Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications
A self-optimized job scheduler for heterogeneous server clusters

JSSPP'07 Proceedings of the 13th international conference on Job scheduling strategies for parallel processing
A new Q-learning with generalized approximation spaces

ICNC'09 Proceedings of the 5th international conference on Natural computation
A state-cluster based Q-learning

ICNC'09 Proceedings of the 5th international conference on Natural computation
OBDD-based universal planning: specifying and solving planning problems for synchronized agents in non-deterministic domains

Artificial intelligence today
Reinforcement learning approaches to coordination in cooperative multi-agent systems

Adaptive agents and multi-agent systems
ONDUX: on-demand unsupervised learning for information extraction

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
On decentralized self-adaptation: lessons from the trenches and challenges for the future

Proceedings of the 2010 ICSE Workshop on Software Engineering for Adaptive and Self-Managing Systems
Brief announcement: a reinforcement learning approach for dynamic load-balancing of parallel digital logic simulation

Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Cooperative communications with relay selection for QoS provisioning in wireless sensor networks

GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
A policy-based sensor selection system with goal oriented singular value decomposition technique

POLICY'09 Proceedings of the 10th IEEE international conference on Policies for distributed systems and networks
Improving optimistic exploration in model-free reinforcement learning

ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Probabilistic Policy Reuse for inter-task transfer learning

Robotics and Autonomous Systems
Joint path and wavelength selection using Q-learning in optical burst switching networks

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Monotonicity of constrained optimal transmission policies in correlated fading channels with ARQ

IEEE Transactions on Signal Processing
A general framework to detect unsafe system states from multisensor data stream

IEEE Transactions on Intelligent Transportation Systems
Optimizing debt collections using constrained reinforcement learning

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Using spatial hints to improve policy reuse in a reinforcement learning agent

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Action selection and task sequence learning for hybrid dynamical cognitive agents

Robotics and Autonomous Systems
Learning hybridization strategies in evolutionary algorithms

Intelligent Data Analysis
MRL-CC: a novel cooperative communication protocol for QoS provisioning in wireless sensor networks

International Journal of Sensor Networks
Intelligent negotiation behaviour model for an open railway access market

Expert Systems with Applications: An International Journal
Autonomous decision making in layered and reconfigurable video coders

Asilomar'09 Proceedings of the 43rd Asilomar conference on Signals, systems and computers
The Dynamics of Multi-Agent Reinforcement Learning

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
On the scalability and dynamic load-balancing of optimistic gate level simulation

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
An adaptive Q-learning algorithm developed for agent-based computational modeling of electricity market

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Reinforcement learning with time

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
A robust and fast action selection mechanism for planning

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Evolution and sustainability of a wildlife monitoring sensor network

Proceedings of the 8th ACM Conference on Embedded Networked Sensor Systems
The neuronal replicator hypothesis

Neural Computation
A study of Q-learning considering negative rewards

Artificial Life and Robotics
Coalition-based metaheuristic: a self-adaptive metaheuristic using reinforcement learning and mimetism

Journal of Heuristics
An autonomic testing framework for IPv6 configuration protocols

AIMS'10 Proceedings of the Mechanisms for autonomous management of networks and services, and 4th international conference on Autonomous infrastructure, management and security
Time-based reward shaping in real-time strategy games

ADMI'10 Proceedings of the 6th international conference on Agents and data mining interaction
AdQL - anomaly detection Q-learning in control multi-queue systems with QoS constraints

KES-AMSTA'10 Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II
Why and how hippocampal transition cells can be used in reinforcement learning

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Reinforcement learning scheme for grouping and characterization of multi-agent network

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part III
Extending context spaces theory by proactive adaptation

ruSMART/NEW2AN'10 Proceedings of the Third conference on Smart Spaces and next generation wired, and 10th international conference on Wireless networking
Generating adaptive route instructions using hierarchical reinforcement learning

SC'10 Proceedings of the 7th international conference on Spatial cognition
Evolving a single scalable controller for an octopus arm with a variable number of segments

PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part II
Part-based feature synthesis for human detection

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Evolutionary dynamics of regret minimization

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
A View on Human Goal-Directed Activity and the Construction of Artificial Intelligence

Minds and Machines
On-line feedback-based automatic resource configuration for distributed applications

Cluster Computing
Emotion and reinforcement: affective facial expressions facilitate robot learning

ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
Self-learning fuzzy logic controllers for pursuit-evasion differential games

Robotics and Autonomous Systems
Generalized learning automata for multi-agent reinforcement learning

AI Communications - European Workshop on Multi-Agent Systems (EUMAS) 2009
Solving multi-stage games with hierarchical learning automata that bootstrap

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Finding optimum route of electrical energy transmission line using multi-criteria with Q-learning

Expert Systems with Applications: An International Journal
Autonomous discovery of subgoals using acyclic state trajectories

ICICA'10 Proceedings of the First international conference on Information computing and applications
A reinforcement learning based framework for prediction of near likely nodes in data-centric mobile wireless networks

EURASIP Journal on Wireless Communications and Networking
Adaptation-based programming in java

Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
Structural knowledge transfer by spatial abstraction for reinforcement learning agents

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Supplier behavior modeling and winner determination using parallel MDP

Expert Systems with Applications: An International Journal
Rapid behavior adaptation for human-centered robots in a dynamic environment based on the integration of primitive confidences on multi-sensor elements

Artificial Life and Robotics
Continuous state/action reinforcement learning: A growing self-organizing map approach

Neurocomputing
The inverse classification problem

Journal of Computer Science and Technology
Accelerating point-based POMDP algorithms via greedy strategies

SIMPAR'10 Proceedings of the Second international conference on Simulation, modeling, and programming for autonomous robots
Reduct based Q-learning: an introduction

Proceedings of the 2011 International Conference on Communication, Computing & Security
Extended spatial and temporal learning scale in reinforcement learning

CIMMACS '10 Proceedings of the 9th WSEAS international conference on computational intelligence, man-machine systems and cybernetics
A hybrid agent architecture integrating desire, intention and reinforcement learning

Expert Systems with Applications: An International Journal
Multiobjective reinforcement learning for traffic signal control using vehicular ad hoc network

EURASIP Journal on Advances in Signal Processing - Special title on vehicular ad hoc networks
Robust high performance reinforcement learning through weighted k-nearest neighbors

Neurocomputing
Approximate dynamic programming for an inventory problem: Empirical comparison

Computers and Industrial Engineering
Cognitive Radio with Reinforcement Learning Applied to Multicast Downlink Transmission with Power Adjustment

Wireless Personal Communications: An International Journal
Spatially-aware dialogue control using hierarchical reinforcement learning

ACM Transactions on Speech and Language Processing (TSLP)
Path selection in disaster response management based on Q-learning

International Journal of Automation and Computing
A Modified Memory-Based Reinforcement Learning Method for Solving POMDP Problems

Neural Processing Letters
Empirically evaluating the application of reinforcement learning to the induction of effective and adaptive pedagogical strategies

User Modeling and User-Adapted Interaction
Noisy reinforcements in reinforcement learning: some case studies based on gridworlds

ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
Intelligent "health restoration system": reinforcement learning feedback to diagnosis and treatment planning

TELE-INFO'06 Proceedings of the 5th WSEAS international conference on Telecommunications and informatics
Supporting smart interactions with predictive analytics

The smart internet
The implementation of Q-learning for problems in continuous state and action space using SOM-based fuzzy systems

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Knowledge of opposite actions for reinforcement learning

Applied Soft Computing
Dual memory model for using pre-existing knowledge in reinforcement learning tasks

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Stochastic processes for return maximization in reinforcement learning

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Completely self-referential optimal reinforcement learners

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Supporting smart interactions with predictive analytics

The smart internet
Software agent with reinforcement learning approach for medical image segmentation

Journal of Computer Science and Technology
Data Collection in Wireless Sensor Networks with Mobile Elements: A Survey

ACM Transactions on Sensor Networks (TOSN)
A dynamic route change mechanism for mobile ad hoc networks

International Journal of Communication Networks and Distributed Systems
Optimization of heuristic search using recursive algorithm selection and reinforcement learning

Annals of Mathematics and Artificial Intelligence
Reinforcement learning techniques for the control of wastewater treatment plants

IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation: new challenges on bioinspired applications - Volume Part II
Towards a collaborative ranking mechanism for efficient and personalized internet search service provisioning

Journal of Computational Methods in Sciences and Engineering - Intelligent Systems and Knowledge Management (Part II)
Multiagent learning in large anonymous games

Journal of Artificial Intelligence Research
Exploiting Best-Match Equations for Efficient Reinforcement Learning

The Journal of Machine Learning Research
Collaborative learning in uncertain environments

CARE@AI'09/CARE@IAT'10 Proceedings of the CARE@AI 2009 and CARE@IAT 2010 international conference on Collaborative agents - research and development
User to user QoE routing system

WWIC'11 Proceedings of the 9th IFIP TC 6 international conference on Wired/wireless internet communications
Bridging the gap between reinforcement learning and knowledge representation: a logical off- and on-policy framework

ECSQARU'11 Proceedings of the 11th European conference on Symbolic and quantitative approaches to reasoning with uncertainty
Group-agreement as a reliability measure for witness recommendations in reputation-based trust protocols

Transactions on computational science XII
SociableSense: exploring the trade-offs of adaptive sampling and computation offloading for social sensing

MobiCom '11 Proceedings of the 17th annual international conference on Mobile computing and networking
Heliza: talking dirty to the attackers

Journal in Computer Virology
An information-theoretic analysis of return maximization in reinforcement learning

Neural Networks
Learning to act optimally in partially observable Markov decision processes using hybrid probabilistic logic programs

SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
On the power of global reward signals in reinforcement learning

MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
A composite self-organisation mechanism in an agent network

WISE'11 Proceedings of the 12th international conference on Web information system engineering
An emergent approach for the control of wastewater treatment plants by means of reinforcement learning techniques

Expert Systems with Applications: An International Journal
Model based Bayesian exploration

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning finite-state controllers for partially observable environments

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning to cooperate via policy search

UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
The apriori stochastic dependency detection (ASDD) algorithm for learning stochastic logic rules

CLIMA IV'04 Proceedings of the 4th international conference on Computational Logic in Multi-Agent Systems
A multi-agent fuzzy-reinforcement learning method for continuous domains

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
An adaptive approach for the exploration-exploitation dilemma for learning agents

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
General discounting versus average reward

ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Teachable characters: user studies, design principles, and learning performance

IVA'06 Proceedings of the 6th international conference on Intelligent Virtual Agents
Learning in one-shot strategic form games

ECML'06 Proceedings of the 17th European conference on Machine Learning
A sparse kernel-based least-squares temporal difference algorithm for reinforcement learning

ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part I
Unique state and automatical action abstracting based on logical MDPs with negation

ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part II
Testing probabilistic equivalence through reinforcement learning

FSTTCS'06 Proceedings of the 26th international conference on Foundations of Software Technology and Theoretical Computer Science
Network-Adaptive qos routing using local information

APNOMS'06 Proceedings of the 9th Asia-Pacific international conference on Network Operations and Management: management of Convergence Networks and Services
Cognitive agents for sense and respond logistics

DAMAS'05 Proceedings of the 2005 international conference on Defence Applications of Multi-Agent Systems
Opponent learning for multi-agent system simulation

RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Efficient gradient estimation for motor control learning

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Policy-contingent abstraction for robust robot control

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
A machine learning approach to intraday trading on foreign exchange markets

IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
ARKAQ-learning: autonomous state space segmentation and policy generation

ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
The quantitative law of effect is a robust emergent property of an evolutionary algorithm for reinforcement learning

ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
Institutionalization through reciprocal habitualization and typification

WRAC'05 Proceedings of the Second international conference on Radical Agent Concepts: innovative Concepts for Autonomic and Agent-Based Systems
Fast reinforcement learning of dialogue policies using stable function approximation

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
URL: A unified reinforcement learning approach for autonomic cloud management

Journal of Parallel and Distributed Computing
Analysis and improvement of policy gradient estimation

Neural Networks
A hybrid learning strategy for discovery of policies of action

IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
Grey reinforcement learning for incomplete information processing

TAMC'06 Proceedings of the Third international conference on Theory and Applications of Models of Computation
Q-Learning with FCMAC in multi-agent cooperation

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Multiagent reinforcement learning for a planetary exploration multirobot system

PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Multiagent model for grid computing

PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Feature extraction for decision-theoretic planning in partially observable environments

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Teamwork and simulation in hybrid cognitive architecture

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Mapping web usage patterns to MDP model and mining with reinforcement learning

FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
Machine learning of plan robustness knowledge about instances

ECML'05 Proceedings of the 16th European conference on Machine Learning
Adaptive modeling: an approach and a method for implementing adaptive agents

MMAS'04 Proceedings of the First international conference on Massively Multi-Agent Systems
Agent based decision support system using reinforcement learning under emergency circumstances

ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part I
Multiagent association rules mining in cooperative learning systems

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Evaluating the effectiveness of exploration and accumulated experience in automatic case elicitation

ICCBR'05 Proceedings of the 6th international conference on Case-Based Reasoning Research and Development
Optimal motion planning by reinforcement learning in autonomous mobile vehicles

Robotica
A reinforcement learning approach for host-based intrusion detection using sequences of system calls

ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Multi-Agent cooperative reinforcement learning in 3d virtual world

ICSI'10 Proceedings of the First international conference on Advances in Swarm Intelligence - Volume Part I
Reinforcement learning by chaotic exploration generator in target capturing task

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Heuristic rule induction for decision making in near-deterministic domains

SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Towards intelligent management of a student's time

SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Learning multi-modal control programs

HSCC'05 Proceedings of the 8th international conference on Hybrid Systems: computation and control
Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning

Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning
Market-based recommender systems: learning users' interests by quality classification

AOIS'04 Proceedings of the 6th international conference on Agent-Oriented Information Systems II
Modeling the brain's operating system

BVAI'05 Proceedings of the First international conference on Brain, Vision, and Artificial Intelligence
Learning-Based spectrum selection in cognitive radio ad hoc networks

WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Emergence of flocking behavior based on reinforcement learning

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Emergent consensus in decentralised systems using collaborative reinforcement learning

Self-star Properties in Complex Information Systems
Rough sets and vague concept approximation: from sample approximation to adaptive learning

Transactions on Rough Sets V
Intelligent Social Media Indexing and Sharing Using an Adaptive Indexing Search Engine

ACM Transactions on Intelligent Systems and Technology (TIST)
Trace equivalence characterization through reinforcement learning

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Coordinating learning agents for multiple resource job scheduling

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
A user trust-based collaborative filtering recommendation algorithm

ICICS'09 Proceedings of the 11th international conference on Information and Communications Security
Effectiveness of considering state similarity for reinforcement learning

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Efficient deep web crawling using reinforcement learning

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
HYPERION: a recursive hyper-heuristic framework

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
An overview of cooperative and competitive multiagent learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
An adaptive approach for the exploration-exploitation dilemma and its application to economic systems

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Multi-agent relational reinforcement learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
An adaptive learning scheme for load balancing with zone partition in multi-sink wireless sensor network

Expert Systems with Applications: An International Journal
DynaMOC: a dynamic overlapping coalition-based multiagent system for coordination of mobile ad hoc devices

ICIRA'11 Proceedings of the 4th international conference on Intelligent Robotics and Applications - Volume Part I
A convergent multiagent reinforcement learning approach for a subclass of cooperative stochastic games

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Multi-agent reinforcement learning for simulating pedestrian navigation

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Heterogeneous populations of learning agents in the minority game

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
On modeling the affective effect on learning

MIWAI'11 Proceedings of the 5th international conference on Multi-Disciplinary Trends in Artificial Intelligence
Reinforcement Programming

Computational Intelligence
WILDSENSING: Design and deployment of a sustainable sensor network for wildlife monitoring

ACM Transactions on Sensor Networks (TOSN)
Dyna-H: A heuristic planning reinforcement learning algorithm applied to role-playing game strategy decision systems

Knowledge-Based Systems
Conflict resolution and learning probability matching in a neural cell-assembly architecture

Cognitive Systems Research
Psychological models of human and optimal performance in bandit problems

Cognitive Systems Research
Value-function reinforcement learning in Markov games

Cognitive Systems Research
When do differences matter? On-line feature extraction through cognitive economy

Cognitive Systems Research
Self-organization in an agent network: A mechanism and a potential application

Decision Support Systems
Hierarchical task decomposition through symbiosis in reinforcement learning

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Sample aware embedded feature selection for reinforcement learning

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Value function approximation through sparse bayesian modeling

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Dynamic channel selection with reinforcement learning for cognitive WLAN over fiber

International Journal of Communication Systems
Tax Collections Optimization for New York State

Interfaces
A novel feature sparsification method for kernel-based approximate policy iteration

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A rapid sparsification method for kernel machines in approximate policy iteration

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A Recursive Classifier System for Partially Observable Environments

Fundamenta Informaticae
A New Architecture for Learning Classifier Systems to Solve POMDP Problems

Fundamenta Informaticae
An online kernel-based clustering approach for value function approximation

SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
Reinforcement Learning with Approximation Spaces

Fundamenta Informaticae
Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems

Automatica (Journal of IFAC)
Interactive Character Animation Using Simulated Physics: A State-of-the-Art Review

Computer Graphics Forum
An improved choice function heuristic selection for cross domain heuristic search

PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part II
Safe robot learning by energy limitation

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part III
Learning policies for battery usage optimization in electric vehicles

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Planning interactive task for intelligent characters

Computer Animation and Virtual Worlds
Multi-agent task division learning in hide-and-seek games

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Learning motion controllers with adaptive depth perception

EUROSCA'12 Proceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation
Learning motion controllers with adaptive depth perception

Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Building ubiquitous computing applications using the VERSAG adaptive agent framework

Journal of Systems and Software
Continuous strategy replicator dynamics for multi-agent Q-learning

Autonomous Agents and Multi-Agent Systems
Cooperative behavior acquisition in multi-agent reinforcement learning system using attention degree

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Observer effect from stateful resources in agent sensing

Autonomous Agents and Multi-Agent Systems
Adaptive value function approximation for continuous-state stochastic dynamic programming

Computers and Operations Research
A reinforcement learning approach to improve the argument selection effectiveness in argumentation-based negotiation

Expert Systems with Applications: An International Journal
Scheduling fighter aircraft maintenance with reinforcement learning

Proceedings of the Winter Simulation Conference
Learning classifier system with average reward reinforcement learning

Knowledge-Based Systems
Reinforcement Learning for Improving Gene Identification Accuracy by Combination of Gene-Finding Programs

International Journal of Applied Metaheuristic Computing
Enhancing the Adaptation of BDI Agents Using Learning Techniques

International Journal of Agent Technologies and Systems
A Reinforcement Learning Approach to Setting Multi-Objective Goals for Energy Demand Management

International Journal of Agent Technologies and Systems
Simulating Cooperative Behaviors in Dynamic Networks

International Journal of Agent Technologies and Systems
Safe exploration of state and action spaces in reinforcement learning

Journal of Artificial Intelligence Research
Convergence results for ant routing algorithms via stochastic approximation

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A state-dependent time evolving multi-constraint routing algorithm

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Non-reciprocating Sharing Methods in Cooperative Q-Learning Environments

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Abstraction in Model Based Partially Observable Reinforcement Learning Using Extended Sequence Trees

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Performance of distributed multi-agent multi-state reinforcement spectrum management using different exploration schemes

Expert Systems with Applications: An International Journal
2013 Special Issue: Modulation for emergent networks: Serotonin and dopamine

Neural Networks
GESwarm: grammatical evolution for the automatic synthesis of collective behaviors in swarm robotics

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Neuroevolution results in emergence of short-term memory in multi-goal environment

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Adding memory condition to learning classifier systems to solve partially observable environments

International Journal of Computer Applications in Technology
Learning to crawl deep web

Information Systems
Distributed dynamic data driven prediction based on reinforcement learning approach

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Generation of tests for programming challenge tasks using multi-objective optimization

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Testing probabilistic equivalence through Reinforcement Learning

Information and Computation
Utilizing query change for session search

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
INFORM: a dynamic interest forwarding mechanism for information centric networking

Proceedings of the 3rd ACM SIGCOMM workshop on Information-centric networking
Machine learning for interactive systems and robots: a brief introduction

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Novel method for using Q-learning in small microcontrollers

Proceedings of the 51st ACM Southeast Conference
Real-time structured light coding for adaptive patterns

Journal of Real-Time Image Processing
Exploration in relational domains for model-based reinforcement learning

The Journal of Machine Learning Research
Towards minimizing the annotation cost of certified text classification

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Exploratory and interactive daily deals recommendation

Proceedings of the 7th ACM conference on Recommender systems
Learning policies for battery usage optimization in electric vehicles

Machine Learning
Assessing the appropriateness of using markov decision processes for RF spectrum management

Proceedings of the 16th ACM international conference on Modeling, analysis & simulation of wireless and mobile systems
SLEDGE: Sequential Labeling of Image Edges for Boundary Detection

International Journal of Computer Vision
A novel reinforcement learning architecture for continuous state and action spaces

Advances in Artificial Intelligence
Efficient batch processing of proximity queries by optimized probing

Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Backward Q-learning: The combination of Sarsa algorithm and Q-learning

Engineering Applications of Artificial Intelligence
Reinforcement learning in robotics: A survey

International Journal of Robotics Research
Analysis of cross-price effects on markdown policies by using function approximation techniques

Knowledge-Based Systems
Efficient learning in linearly solvable MDP models

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Reinforcement learning models for scheduling in wireless networks

Frontiers of Computer Science: Selected Publications from Chinese Universities
The use of partially converged simulations in building surrogate models

Advances in Engineering Software
Reinforcement learning based routing in wireless mesh networks

Wireless Networks
A comparative evaluation of multi-objective exploration algorithms for high-level design

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Review: A survey on intelligent routing protocols in wireless sensor networks

Journal of Network and Computer Applications
Construction of approximation spaces for reinforcement learning

The Journal of Machine Learning Research
Hybrid motion graph for character motion synthesis

Journal of Visual Languages and Computing
Review: Cloud computing service composition: A systematic literature review

Expert Systems with Applications: An International Journal
Goal-oriented behavior sequence generation based on semantic commands using multiple timescales recurrent neural network with initial state correction

Neurocomputing
Object search by manipulation

Autonomous Robots
Active Rare Class Discovery and Classification Using Dirichlet Processes

International Journal of Computer Vision
Reinforcement Learning for Multiple Access Control in Wireless Sensor Networks: Review, Model, and Open Issues

Wireless Personal Communications: An International Journal
MineralMiner: An active sensing simulation environment

Multiagent and Grid Systems
A reinforcement learning based solution for cognitive network cooperation between co-located, heterogeneous wireless sensor networks

Ad Hoc Networks
A game theoretic approach to swarm robotics

Applied Bionics and Biomechanics
A multi-agent control architecture for a robotic wheelchair

Applied Bionics and Biomechanics

Quantified Score

Hi-index	0.02

Visualization

Abstract

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word "reinforcement." The paper discusses central issues of reinforcement learning, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state. It concludes with a survey of some implemented systems and an assessment of the practical utility of current methods for reinforcement learning.