Technical Note: \cal Q-Learning

Authors:
Christopher J. C. H. Watkins;Peter Dayan
Affiliations:
25b Framfield Road, Highbury, London N5 1UU, England;Centre for Cognitive Science, University of Edinburgh, 2 Buccleuch Place, Edinburgh EH8 9EH, Scotland
Venue:
Machine Learning
Year:
1992

Citing 0
Cited 648

An optimal-control application of two paradigms of on-line learning

COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Markov decision processes in large state spaces

COLT '95 Proceedings of the eighth annual conference on Computational learning theory
Learning curve bounds for a Markov decision process with undiscounted rewards

COLT '96 Proceedings of the ninth annual conference on Computational learning theory
Shifting Inductive Bias with Success-Story Algorithm, AdaptiveLevin Search, and Incremental Self-Improvement

Machine Learning - Special issue on inductive transfer
Explanation-Based Learning and Reinforcement Learning: A Unified View

Machine Learning
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Bayesian Q-learning

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Tree based discretization for continuous state space reinforcement learning

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Module-Based Reinforcement Learning: Experiments with a Real Robot

Machine Learning - Special issue on learning in autonomous robots
An adaptive agent bidding strategy based on stochastic modeling

Proceedings of the third annual conference on Autonomous Agents
Fast Online Q(λ)

Machine Learning
Colearning in Differential Games

Machine Learning
Learning to Take Actions

Machine Learning
Efficient exploration for optimizing immediate reward

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Exploration of Multi-State Environments: Local Measures and Back-Propagation of Uncertainty

Machine Learning
A Nonlinear Noise-Shaping Delta-Sigma Modulator with On-Chip Reinforcement Learning^{*}

Analog Integrated Circuits and Signal Processing - Special issue on Learning on Silicon
Multi-agent reinforcement learning for planning and conflict resolution in a dynamic domain

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Adaptivity in agent-based routing for data networks

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Distributed reinforcement learning for a traffic engineering application

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Learning sequences of actions in collectives of autonomous agents

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
An approach to the analysis and design of multiagent systems based on interaction frames

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2
Designing agent collectives for systems with markovian dynamics

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to select a coordination mechanism

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Construction of a learning agent handling its rewards according to environmental situations

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Integrated learning for interactive synthetic characters

Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Machine learning and inductive logic programming for multi-agent systems

Mutli-agents systems and applications
Learning Sequences of Compatible Actions Among Agents

Artificial Intelligence Review
Analog VLSI Stochastic Perturbative Learning Architectures

Analog Integrated Circuits and Signal Processing
Printed Circuit Board Design via Organizational-Learning Agents

Applied Intelligence
Reinforcement Learning in the Multi-Robot Domain

Autonomous Robots
Module-Based Reinforcement Learning: Experiments with a Real Robot

Autonomous Robots
Dynamics of a Classical Conditioning Model

Autonomous Robots
Reinforcement Learning Soccer Teams with Incomplete World Models

Autonomous Robots
Hierarchic Social Entropy: An Information Theoretic Measure of Robot Group Diversity

Autonomous Robots
Target Reaching by Using Visual Information and Q-learning Controllers

Autonomous Robots
Making Organizational Learning Operational: Implications from Learning Classifier Systems

Computational & Mathematical Organization Theory
Reinforced Genetic Programming

Genetic Programming and Evolvable Machines
An Integrated Approach of Learning, Planning, and Execution

Journal of Intelligent and Robotic Systems
Relational Reinforcement Learning

Machine Learning
Reinforcement Learning for Call Admission Control and Routing under Quality of Service Constraints in Multimedia Networks

Machine Learning
Kernel-Based Reinforcement Learning

Machine Learning
Near-Optimal Reinforcement Learning in Polynomial Time

Machine Learning
Learning intelligent behavior in a non-stationary and partially observable environment

Artificial Intelligence Review
Control of exploitation-exploration meta-parameter in reinforcement learning

Neural Networks - Computational models of neuromodulation
Training Reinforcement Neurocontrollers Using the Polytope Algorithm

Neural Processing Letters
The anticipatory classifier system and genetic generalization

Natural Computing: an international journal
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
Exploration Strategies for Model-based Learning in Multi-agent Systems: Exploration Strategies

Autonomous Agents and Multi-Agent Systems
Pricing in Agent Economies Using Multi-Agent Q-Learning

Autonomous Agents and Multi-Agent Systems
Predicting the Expected Behavior of Agents that Learn About Agents: The CLRI Framework

Autonomous Agents and Multi-Agent Systems
A Framework for Learning in Search-Based Systems

IEEE Transactions on Knowledge and Data Engineering
Learning Optimal Robotic Tasks

IEEE Expert: Intelligent Systems and Their Applications
Optimal control using the transport equation: the Liouville machine

Adaptive Behavior
Learning to play strong poker

Machines that learn to play games
DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Electronic Institutions as a Framework for Agents' Negotiation and Mutual Commitment

EPIA '01 Proceedings of the10th Portuguese Conference on Artificial Intelligence on Progress in Artificial Intelligence, Knowledge Extraction, Multi-agent Systems, Logic Programming and Constraint Solving
Module Based Reinforcement Learning: An Application to a Real Robot

EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Robot Learning Using Gate-Level Evolvable Hardware

EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Vision Based State Space Construction for Learning Mobile Robots in Multi-agent Environments

EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
A Framework for Supporting Intelligent Fault and Performance Management for Communication Networks

MMNS '01 Proceedings of the 4th IFIP/IEEE International Conference on Management of Multimedia Networks and Services: Management of Multimedia on the Internet
Minimizing Transmission Costs through Adaptive Marking in Differentiated Services Networks

MMNS '02 Proceedings of the 5th IFIP/IEEE International Conference on Management of Multimedia Networks and Services: Management of Multimedia on the Internet
An Integrated On-Line Learning System for Evolving Programmable Logic Array Controllers

PPSN VI Proceedings of the 6th International Conference on Parallel Problem Solving from Nature
A Reinforcement Learning with Condition Reduced Fuzz Rules

SEAL'98 Selected papers from the Second Asia-Pacific Conference on Simulated Evolution and Learning on Simulated Evolution and Learning
Computational Models of the Amygdala and the Orbitofrontal Cortex: A Hierarchical Reinforcement Learning System for Robotic Control

AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Minimax Fuzzy Q-Learning in Cooperative Multi-agent Systems

ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Modular-Fuzzy Cooperation Algorithm for Multi-agent Systems

ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
MUTANT: A Genetic Learning System

AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
Machine Learning and Inductive Logic Programming for Multi-agent Systems

EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Reinforcement Learning for Control of Traffic and Access Points in Intelligent Wireless ATM Networks

Proceedings of the International Conference, 7th Fuzzy Days on Computational Intelligence, Theory and Applications
Sequential Strategy for Learning Multi-stage Multi-agent Collaborative Games

ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Market-Based Reinforcement Learning in Partially Observable Worlds

ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Learning Multi-agent Strategies in Multi-stage Collaborative Games

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Lempel-Ziv Coding in Reinforcement Learning

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
An Introduction to Learning Fuzzy Classifier Systems

Learning Classifier Systems, From Foundations to Applications
Fuzzy and Crisp Representations of Real-Valued Input for Learning Classifier Systems

Learning Classifier Systems, From Foundations to Applications
Probability-Enhanced Predictions in the Anticipatory Classifier System

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Rationality of Reward Sharing in Multi-agent Reinforcement Learning

PRIMA '99 Proceedings of the Second Pacific Rim International Workshop on Multi-Agents: Approaches to Intelligent Agents
Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer

RoboCup 2001: Robot Soccer World Cup V
VQQL. Applying Vector Quantization to Reinforcement Learning

RoboCup-99: Robot Soccer World Cup III
Open Theoretical Questions in Reinforcement Learning

EuroCOLT '99 Proceedings of the 4th European Conference on Computational Learning Theory
Application of Episodic Q-Learning to a Multi-agent Cooperative Task

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning

Sequence Learning - Paradigms, Algorithms, and Applications
Sequential Decision Making Based on Direct Search

Sequence Learning - Paradigms, Algorithms, and Applications
Communication and Interaction with Learning Agents in Virtual Soccer

VW '00 Proceedings of the Second International Conference on Virtual Worlds
An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning

ATAL '00 Proceedings of the 7th International Workshop on Intelligent Agents VII. Agent Theories Architectures and Languages
Implicit Negotiation in Repeated Games

ATAL '01 Revised Papers from the 8th International Workshop on Intelligent Agents VIII
Learning Rates for Q-Learning

COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
PAC Bounds for Multi-armed Bandit and Markov Decision Processes

COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning

IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Analysis of Optimal Criteria

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Sequential cost-sensitive decision making with reinforcement learning

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Applications of the self-organising map to reinforcement learning

Neural Networks - New developments in self-organizing maps
The design of collectives of agents to control non-Markovian systems

Eighteenth national conference on Artificial intelligence
D-Learning: what learning in dogs tells us about building characters that learn what they ought to learn

Exploring artificial intelligence in the new millennium
Social learning mechanisms compared in a simple environment

ICAL 2003 Proceedings of the eighth international conference on Artificial life
Towards a pareto-optimal solution in general-sum games

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A selection-mutation model for q-learning in multi-agent systems

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Advice-exchange in heterogeneous groups of learning agents

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A Taxonomy for artificial embryogeny

Artificial Life
The Ant Colony Optimization paradigm for combinatorial optimization

Advances in evolutionary computing
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
Reinforcement learning based on local state feature learning and policy adjustment

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Introduction to multimedia and mobile agents
On the convergence of optimistic policy iteration

The Journal of Machine Learning Research
ε-mdps: learning in varying environments

The Journal of Machine Learning Research
Adaptive Radial Basis Decomposition by Learning Vector Quantization

Neural Processing Letters
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Distributed Reinforcement Learning Control for Batch Sequencing and Sizing in Just-In-Time Manufacturing Systems

Applied Intelligence
CONVERGENCE OF SIMULATION-BASED POLICY ITERATION

Probability in the Engineering and Informational Sciences
A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL

Probability in the Engineering and Informational Sciences
Call admission control in cellular networks: a reinforcement learning solution

International Journal of Network Management
An experimental evaluation of reinforcement learning for gain scheduling

Design and application of hybrid intelligent systems
Employing OLAP mining for multiagent reinforcement learning

Design and application of hybrid intelligent systems
Learning Rates for Q-learning

The Journal of Machine Learning Research
Dynamic bipedal walking assisted by learning

Robotica
Dynamic abstraction in reinforcement learning via clustering

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Sparse cooperative Q-learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning when and how to coordinate

Web Intelligence and Agent Systems
Incremental heuristic search in AI

AI Magazine
Best-Response Multiagent Learning in Non-Stationary Environments

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Run the GAMUT: A Comprehensive Approach to Evaluating Game-Theoretic Algorithms

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Unifying Temporal and Structural Credit Assignment Problems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
An Architecture for Persistent Reactive Behavior

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Learning from Multiple Sources

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Planning, learning and coordination in multiagent decision processes

TARK '96 Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge
No Pervasive Computing without Intelligent Systems

BT Technology Journal
Building agents to serve customers

AI Magazine
Reliability of internal prediction/estimation and its application: I. adaptive action selection reflecting reliability of value function

Neural Networks
Basic Ideas for Event-Based Optimization of Markov Systems

Discrete Event Dynamic Systems
Learning and Exploiting Relative Weaknesses of Opponent Agents

Autonomous Agents and Multi-Agent Systems
Coordinating Multiple Agents via Reinforcement Learning

Autonomous Agents and Multi-Agent Systems
Toward Open Negotiation

IEEE Internet Computing
Teaching robots to plan through Q-learning

Robotica
An Architecture for Behavior-Based Reinforcement Learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process

Proceedings of the 2005 ACM symposium on Applied computing
Neighboring crossover to improve GA-based Q-learning method for multi-legged robot control

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
XCS with eligibility traces

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Optimal Control Using the Transport Equation: The Liouville Machine

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Q-learning of sequential attention for visual object recognition from informative local descriptors

ICML '05 Proceedings of the 22nd international conference on Machine learning
Adaptive Clustering: Obtaining Better Clusters Using Feedback and Past Experience

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Online Evolution for a Self-Adapting Robotic Navigation System Using Evolvable Hardware

Artificial Life
A middleware for autonomic QoS management based on learning

SEM '05 Proceedings of the 5th international workshop on Software engineering and middleware
Local Reinforcement and Recombination in Classifier Systems

Evolutionary Computation
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games

Autonomous Agents and Multi-Agent Systems
Adaptive dialogue systems - interaction with interact

SIGDIAL '02 Proceedings of the 3rd SIGdial workshop on Discourse and dialogue - Volume 2
Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms

Neural Computation
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms

Neural Computation
PAC model-free reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Efficient QoS provisioning for adaptive multimedia in mobile communication networks by reinforcement learning

Mobile Networks and Applications - Special issue: Recent advances in wireless networking
Small-scale peer-to-peer overlays

ACM SIGOPS Operating Systems Review
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
A hierarchical approach to efficient reinforcement learning in deterministic domains

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning against multiple opponents

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Adaptive distributed resource allocation and diagnostics using cooperative information-sharing strategies

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning to commit in repeated games

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
A comprehensive review of nature inspired routing algorithms for fixed telecommunication networks

Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Nature-inspired applications and systems
A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies

The Knowledge Engineering Review
The asymptotic equipartition property in reinforcement learning and its relation to return maximization

Neural Networks
A reinforcement learning approach to active camera foveation

Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
Quantum robot: structure, algorithms and applications

Robotica
Combining expert advice in reactive environments

Journal of the ACM (JACM)
Optimal Joint Session Admission Control in Integrated WLAN and CDMA Cellular Networks with Vertical Handoff

IEEE Transactions on Mobile Computing
Cooperative transportation system for humanoid robots using simulation-based learning

Applied Soft Computing
Reinforcement Learning with Approximation Spaces

Fundamenta Informaticae
RLDDE: A novel reinforcement learning-based dimension and delay estimator for neural networks in time series prediction

Neurocomputing
Cooperative Transportation by Humanoid Robots - Solving Piano Movers' Problem

International Journal of Hybrid Intelligent Systems
Using multi-agent systems for learning optimal policies for complex problems

ACM-SE 45 Proceedings of the 45th annual southeast regional conference
A proposal of the learning system using the recordable multi-layer type rule base and its application for the fire panic problem

Proceedings of the 2006 international conference on Game research and development
A general criterion and an algorithmic framework for learning in multi-agent systems

Machine Learning
Temporal pattern identification using spike-timing dependent plasticity

Neurocomputing
Allocating time and location information to activity-travel patterns through reinforcement learning

Knowledge-Based Systems
If multi-agent learning is the answer, what is the question?

Artificial Intelligence
Collaborative Multiagent Reinforcement Learning by Payoff Propagation

The Journal of Machine Learning Research
STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents

dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
The Multi-Agent Data Collection in HLA-Based Simulation System

Proceedings of the 21st International Workshop on Principles of Advanced and Distributed Simulation
Modeling embodied visual behaviors

ACM Transactions on Applied Perception (TAP)
On developmental mental architectures

Neurocomputing
Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension

Evolutionary Computation
An Action-Selection Calculus

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of SONQL for real-time learning of robot behaviors

Robotics and Autonomous Systems
Learning with “Relevance”: Using a Third Factor to Stabilize Hebbian Learning

Neural Computation
A reinforcement agent for threshold fusion

Applied Soft Computing
Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation

Neural Computation
Adaptive stepsize selection for tracking in a regime-switching environment

Automatica (Journal of IFAC)
Reinforcement learning in multi-agent environment and ant colony for packet scheduling in routers

Proceedings of the 5th ACM international workshop on Mobility management and wireless access
Application of reinforcement learning to the game of Othello

Computers and Operations Research
Modeling motivation for adaptive nonplayer characters in dynamic computer game worlds

Computers in Entertainment (CIE) - Theoretical and Practical Computer Applications in Entertainment
Classifier fitness based on accuracy

Evolutionary Computation
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Batch reinforcement learning in a complex domain

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Distributed path planning for mobile robots using a swarm of interacting reinforcement learners

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Advice taking in multiagent reinforcement learning

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Learning the meaning of action commands based on "no news is good news" criterion

Proceedings of the 2007 workshop on Multimodal interfaces in semantic interaction
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Artificial Intelligence
Learning how to combine sensory-motor functions into a robust behavior

Artificial Intelligence
A novel framework for automatic generation of fuzzy neural networks

Neurocomputing
Teachable robots: Understanding human teaching behavior to build more effective robot learners

Artificial Intelligence
Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

International Journal of Robotics Research
A Policy-based Approach for Reconfiguration Management and Enforcement in Autonomic Communication Systems

Wireless Personal Communications: An International Journal
On the convergence of stochastic iterative dynamic programming algorithms

Neural Computation
Improving generalization for temporal difference learning: The successor representation

Neural Computation
Biologically-inspired adaptive learning control strategies: A rough set approach

International Journal of Hybrid Intelligent Systems
Knowledge propagation in a distributed omnidirectional vision system

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Marco Somalvico Memorial Issue
Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

International Journal of Knowledge-based and Intelligent Engineering Systems
Recursive least squares and quadratic prediction in continuous multistep problems

Proceedings of the 10th annual conference companion on Genetic and evolutionary computation
Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective

The Journal of Machine Learning Research
Accelerated Neural Evolution through Cooperatively Coevolved Synapses

The Journal of Machine Learning Research
Learning Agents in an Artificial Power Exchange: Tacit Collusion, Market Power and Efficiency of Two Double-auction Mechanisms

Computational Economics
Tuning continual exploration in reinforcement learning: An optimality property of the Boltzmann strategy

Neurocomputing
Norm emergence under constrained interactions in diverse societies

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Expediting RL by using graphical structures

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Adaptive Kanerva-based function approximation for multi-agent systems

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Reinforcement learning for problems with symmetrical restricted states

Robotics and Autonomous Systems
Adaptiveness in Agent Communication: Application and Adaptation of Conversation Patterns

Agent Communication II
Reinforcement Learning in Fine Time Discretization

ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
State Space Partition for Reinforcement Learning Based on Fuzzy Min-Max Neural Network

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
A Novel Method of Constructing ANN

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Competition and Coordination in Stochastic Games

CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
From Actions to Goals and Vice-Versa: Theoretical Analysis and Models of the Ideomotor Principle and TOTE

Anticipatory Behavior in Adaptive Learning Systems
Combining the Best of the Two Worlds: Inheritance Versus Experience

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Making a Robot Learn to Play Soccer Using Reward and Punishment

KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Perception and Developmental Learning of Affordances in Autonomous Robots

KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Model-Based Reinforcement Learning in a Complex Domain

RoboCup 2007: Robot Soccer World Cup XI
A Design of Reward Function Based on Knowledge in Multi-agent Learning

ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Learning Grouping and Anti-predator Behaviors for Multi-agent Systems

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Route Optimization Using Q-Learning for On-Demand Bus Systems

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Rational Bidding Using Reinforcement Learning

GECON '08 Proceedings of the 5th international workshop on Grid Economics and Business Models
State-Dependent Exploration for Policy Gradient Methods

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Multi-Agent Reinforcement Learning for Intrusion Detection: A Case Study and Evaluation

MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
Spatial Abstraction: Aspectualization, Coarsening, and Conceptual Classification

Proceedings of the international conference on Spatial Cognition VI: Learning, Reasoning, and Talking about Space
Robot Navigation Based on Fuzzy RL Algorithm

ISNN '08 Proceedings of the 5th international symposium on Neural Networks: Advances in Neural Networks
Applying Reinforcement Learning to Multi-robot Team Coordination

HAIS '08 Proceedings of the 3rd international workshop on Hybrid Artificial Intelligence Systems
Meta-case-based reasoning: self-improvement through self-understanding

Journal of Experimental & Theoretical Artificial Intelligence
An online multi-agent co-operative learning algorithm in POMDPs

Journal of Experimental & Theoretical Artificial Intelligence
Service diffusion in the market considering consumers' subjective value

CSTST '08 Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
Experimental analysis of eligibility traces strategies in temporal difference learning

International Journal of Knowledge Engineering and Soft Data Paradigms
Reinforcement Learning on a Futures Market Simulator

KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Semantic Relatedness Measure Using Object Properties in an Ontology

ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Designing Toys That Come Alive: Curious Robots for Creative Play

ICEC '08 Proceedings of the 7th International Conference on Entertainment Computing
Q-Strategy: A Bidding Strategy for Market-Based Allocation of Grid Services

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
Reinforcement Learning for Decision Making in Sequential Visual Attention

Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets

Recent Advances in Reinforcement Learning
A Collaborative Reinforcement Learning Approach to Urban Traffic Control Optimization

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Learning-Rate Adjusting Q-Learning for Prisoner's Dilemma Games

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Hybridizing evolutionary computation and reinforcement learning for the design of almost universal controllers for autonomous robots

Neurocomputing
A novel Artificial Neural Network training method combined with Quantum Computational Multi-Agent System theory

International Journal of Intelligent Systems Technologies and Applications
A robot that learns in stages utilizing scaffolds: toward an active and long-term human-robot interaction

ACE '08 Proceedings of the 2008 International Conference on Advances in Computer Entertainment Technology
Effects of chaotic exploration on reinforcement learning in target capturing task

International Journal of Knowledge-based and Intelligent Engineering Systems
Opportunities for multiagent systems and multiagent reinforcement learning in traffic control

Autonomous Agents and Multi-Agent Systems
Measurement of Underlying Cooperation in Multiagent Reinforcement Learning

PRIMA '08 Proceedings of the 11th Pacific Rim International Conference on Multi-Agents: Intelligent Agents and Multi-Agent Systems
A learning classifier system for mazes with aliasing clones

Natural Computing: an international journal
Imitation guided learning in learning classifier systems

Natural Computing: an international journal
Negotiation Model Supporting Co-Allocation for Grid Scheduling

GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Pruning an ensemble of classifiers via reinforcement learning

Neurocomputing
Reinforcement-learning agents with different temperature parameters explain the variety of human action-selection behavior in a Markov decision process task

Neurocomputing
QL2, a simple reinforcement learning scheme for two-player zero-sum Markov games

Neurocomputing
Integrated cognitive architectures: a survey

Artificial Intelligence Review
Reinforcement distribution in fuzzy Q-learning

Fuzzy Sets and Systems
Modeling reinforcement learning algorithms for performance analysis

Proceedings of the International Conference on Advances in Computing, Communication and Control
Boosting the performance of computing systems through adaptive configuration tuning

Proceedings of the 2009 ACM symposium on Applied Computing
Basal Ganglia Models for Autonomous Behavior Learning

Creating Brain-Like Intelligence
An Optimal Approximate Dynamic Programming Algorithm for the Lagged Asset Acquisition Problem

Mathematics of Operations Research
An autonomic architecture for optimizing QoE in multimedia access networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
A policy-based framework for autonomic reconfiguration management in heterogeneous networks

Proceedings of the 7th International Conference on Mobile and Ubiquitous Multimedia
Static strategy and dynamic adjustment: An effective method for Grid task scheduling

Future Generation Computer Systems
Maestro: a self-organizing peer-to-peer dataflow framework using reinforcement learning

Proceedings of the 18th ACM international symposium on High performance distributed computing
A new marketing strategy map for direct marketing

Knowledge-Based Systems
Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Generalized model learning for reinforcement learning in factored domains

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
An empirical analysis of value function-based and policy search reinforcement learning

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multiagent learning in large anonymous games

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Comparing trust mechanisms for monitoring aggregator nodes in sensor networks

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Novel reinforcement learning-based approaches to reduce loss probability in buffer-less OBS networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
Opponent Modeling in Adversarial Environments through Learning Ingenuity

Proceedings of the 2005 conference on Self-Organization and Autonomic Informatics (I)
Fast Learning in an Actor-Critic Architecture with Reward and Punishment

Proceedings of the 2008 conference on Tenth Scandinavian Conference on Artificial Intelligence: SCAI 2008
Reinforcement Learning with Classifier Selection for Focused Crawling

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning-Rate Adjusting Q-Learning for Two-Person Two-Action Symmetric Games

KES-AMSTA '09 Proceedings of the Third KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Reinforcement learning for robot soccer

Autonomous Robots
EDA-RL: estimation of distribution algorithms for reinforcement learning problems

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Learning in the time-dependent minority game

Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
TEMMAS: The Electricity Market Multi-Agent Simulator

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
The kNN-TD Reinforcement Learning Algorithm

IWINAC '09 Proceedings of the 3rd International Work-Conference on The Interplay Between Natural and Artificial Computation: Part I: Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira's Scientific Legacy
Multi-agent Reinforcement Learning in Network Management

AIMS '09 Proceedings of the 3rd International Conference on Autonomous Infrastructure, Management and Security: Scalability of Networks and Services
A q-learning based adaptive bidding strategy in combinatorial auctions

Proceedings of the 11th International Conference on Electronic Commerce
QUICR-learning for multi-agent coordination

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Hard constrained semi-Markov decision processes

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Motion-based autonomous grounding: inferring external world properties from encoded internal sensory states alone

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Harnessing migrations in a market-based grid OS

GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
Markov Decision Processes with Arbitrary Reward Processes

Mathematics of Operations Research
Prediction of solar conditions by emotional learning

Intelligent Data Analysis
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games

Web Intelligence and Agent Systems
Agent-based simulation of electricity markets: a survey of tools

Artificial Intelligence Review
Machine learning in digital games: a survey

Artificial Intelligence Review
A DR algorithm based on artificial potential field method

Multimedia Tools and Applications
Parallel Algorithms for Solving Markov Decision Process

ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
Profit sharing auction

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Potential-based shaping in model-based reinforcement learning

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Hierarchical reinforcement learning with the MAXQ value function decomposition

Journal of Artificial Intelligence Research
Accelerating reinforcement learning by composing solutions of automatically identified subtasks

Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods

Journal of Artificial Intelligence Research
Collective intelligence, data routing and braess' paradox

Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation

Journal of Artificial Intelligence Research
Reinforcement learning for agents with many sensors and actuators acting in categorizable environments

Journal of Artificial Intelligence Research
Risk-sensitive reinforcement learning applied to control under constraints

Journal of Artificial Intelligence Research
Cooperative information sharing to improve distributed learning in multi-agent systems

Journal of Artificial Intelligence Research
Behavior bounding: an efficient method for high-level behavior comparison

Journal of Artificial Intelligence Research
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
On partially controlled multi-agent systems

Journal of Artificial Intelligence Research
Dynamic non-Bayesian decision making

Journal of Artificial Intelligence Research
Truncating temporal differences: on the efficient implementation of TD (λ) for reinforcement learning

Journal of Artificial Intelligence Research
Reinforcement algorithms using functional approximation for generalization and their application to cart centering and fractal compression

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Direct code access in self-organizing neural networks for reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Emergence of norms through social learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Qualitative map learning based on co-visibility of objects

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Topology and Memory Effect on Convention Emergence

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Multi-agent based modeling of liver detoxification

SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Automatic abstraction in reinforcement learning using data mining techniques

Robotics and Autonomous Systems
Robot weightlifting by direct policy search

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Learning against opponents with bounded memory

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Learning to act using real-time dynamic programming

Artificial Intelligence
Effective learning in the presence of adaptive counterparts

Journal of Algorithms
Neuroevolution strategies for episodic reinforcement learning

Journal of Algorithms
Assured end-to-end QoS through adaptive marking in multi-domain differentiated services networks

Computer Communications
Reinforcement learning-based dynamic bandwidth provisioning for quality of service in differentiated services networks

Computer Communications
Medical QoS provision based on reinforcement learning in ultrasound streaming over 3.5G wireless systems

IEEE Journal on Selected Areas in Communications - Special issue on wireless and pervasive communications for healthcare
A reward field model generation in Q-learning by dynamic programming

Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
Ant colony optimization incorporated with fuzzy Q-learning for reinforcement fuzzy control

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
A Q-learning approach to derive optimal consumption and investment strategies

IEEE Transactions on Neural Networks
Interaction, observance or both? Study of the effects on convention emergence

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Route optimisation using evolutionary approaches for on-demand pickup problem

International Journal of Advanced Intelligence Paradigms
Reinforcement learning and adaptive dynamic programming for feedback control

IEEE Circuits and Systems Magazine
Interaction, observance or both? Study of the effects on convention emergence

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Hybridization of cognitive models using evolutionary strategies

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
A Model of Neuronal Specialization Using Hebbian Policy-Gradient with "Slow" Noise

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
A Swarm-Based Learning Method Inspired by Social Insects

ICIC '07 Proceedings of the 3rd International Conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence
Swarm Reinforcement Learning Algorithm Based on Particle Swarm Optimization Whose Personal Bests Have Lifespans

ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
A Q-learning model-independent flow controller for high-speed networks

ACC'09 Proceedings of the 2009 conference on American Control Conference
Nash Q-learning multi-agent flow control for high-speed networks

ACC'09 Proceedings of the 2009 conference on American Control Conference
Transfer of knowledge for a climbing virtual human: a reinforcement learning approach

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
On the asymptotic equivalence between differential Hebbian and temporal difference learning

Neural Computation
Associating domain-dependent knowledge and Monte Carlo approaches within a Go program

Information Sciences: an International Journal
Hybrid Q-learning algorithm about cooperation in MAS

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
An adaptive inventory control for a supply chain

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Fuzzy Q-learning in a nondeterministic environment: developing an intelligent Ms. Pac-Man agent

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Reinforcement interval type-2 fuzzy controller design by online rule generation and Q-value-aided ant colony optimization

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Autonomous development of vergence control driven by disparity energy neuron populations

Neural Computation
Real-valued Q-learning in multi-agent cooperation

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Implementation of fuzzy Q-learning based on modular fuzzy model and parallel structured learning

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
The improvement of Q-learning applied to imperfect information game

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Hardware design of autonomous snake-like robot for reinforcement learning based on environment: discussion of versatility on different tasks

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Design of semi-decentralized control laws for distributed-air-jet micromanipulators by reinforcement learning

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Cooperative multi-robot reinforcement learning: a framework in hybrid state space

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
The hesitation of a robot: a delay in its motion increases learning efficiency and impresses humans as teachable

Proceedings of the 5th ACM/IEEE international conference on Human-robot interaction
A new mobile robot navigation method using fuzzy logic and a modified Q-learning algorithm

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Feature Article---Merging AI and OR to Solve High-Dimensional Stochastic Optimization Problems Using Approximate Dynamic Programming

INFORMS Journal on Computing
Truncated fourier series formulation for bipedal walking balance control

Robotica
Online reinforcement learning for dynamic multimedia systems

IEEE Transactions on Image Processing
Online adaptive policies for ensemble classifiers

Neurocomputing
A new approach to fuzzy classifier systems and its application in self-generating neuro-fuzzy systems

Neurocomputing
Encoding robotic sensor states for Q-learning using the self-organizing map

Journal of Computing Sciences in Colleges
A self-organizing neural architecture integrating desire, intention and reinforcement learning

Neurocomputing
Review article: Synergizing reinforcement learning and game theory-A new direction for control

Applied Soft Computing
Reinforcement Learning in Finite MDPs: PAC Analysis

The Journal of Machine Learning Research
Experience-based reinforcement learning to acquire effective behavior in a multi-agent domain

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Constructing an autonomous agent with an interdependent heuristics

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Q-learning with linear function approximation

COLT'07 Proceedings of the 20th annual conference on Learning theory
Context aware life pattern prediction using fuzzy-state Q-learning

ICOST'07 Proceedings of the 5th international conference on Smart homes and health telematics
Reinforcement learning scheme for grouping and anti-predator behavior

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Virtual markets: Q-learning sellers with simple state representation

AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
Online learning of task-driven object-based visual attention control

Image and Vision Computing
Skill combination for reinforcement learning

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
A novel ANN model based on quantum computational MAS theory

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
An agent reinforcement learning model based on neural networks

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Two-layer networked learning control of a nonlinear HVAC system

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Can agents acquire human-like behaviors in a sequential bargaining game?: comparison of Roth's and Q-learning agents

MABS'06 Proceedings of the 2006 international conference on Multi-agent-based simulation VII
A k-NN based perception scheme for reinforcement learning

EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
Reinforcement learning of predictive features in affordance perception

Proceedings of the 2006 international conference on Towards affordance-based robot control
Approximate Dynamic Programming for Ambulance Redeployment

INFORMS Journal on Computing
A state-cluster based Q-learning

ICNC'09 Proceedings of the 5th international conference on Natural computation
Study on traffic signal control based on Q-learning

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 3
Cooperative learning using advice exchange

Adaptive agents and multi-agent systems
Multiagent learning for open systems: a study in opponent classification

Adaptive agents and multi-agent systems
Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks

Information Sciences: an International Journal
Applying reinforcement learning to scheduling strategies in an actual grid environment

International Journal of High Performance Systems Architecture
A study on hierarchical modular reinforcement learning for multi-agent pursuit problem based on relative coordinate states

CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
A cat-like robot real-time learning to run

ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Joint path and wavelength selection using Q-learning in optical burst switching networks

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Application of reinforcement learning to autonomous heading control for bionic underwater robots

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
A novel method for strategy acquisition and its application to a double-auction market game

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on game theory
Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

IEEE Journal on Selected Areas in Communications
Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

Autonomous Agents and Multi-Agent Systems
Evolutionary mechanism design: a review

Autonomous Agents and Multi-Agent Systems
Automated bidding in computational markets: an application in market-based allocation of computing services

Autonomous Agents and Multi-Agent Systems
Multi-task evolutionary shaping without pre-specified representations

Proceedings of the 12th annual conference on Genetic and evolutionary computation
An activation reinforcement based classifier system for balancing generalisation and specialisation (ARCS)

Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Decentralized Q-learning for aggregated interference control in completely and partially observable cognitive radio networks

CCNC'10 Proceedings of the 7th IEEE conference on Consumer communications and networking conference
High-level reinforcement learning in strategy games

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Frequency adjusted multi-agent Q-learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Planning against fictitious players in repeated normal form games

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Extending adaptive fuzzy behavior hierarchies to multiple levels of composite behaviors

Robotics and Autonomous Systems
Modeling Behavior Cycles as a Value System for Developmental Robots

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

ACM Transactions on Modeling and Computer Simulation (TOMACS)
Model-free control based on reinforcement learning for a wastewater treatment problem

Applied Soft Computing
Learning to follow navigational directions

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Computing optimal policies for partially observable decision processes using compact representations

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
From cognition to docition: The teaching radio paradigm for distributed & autonomous deployments

Computer Communications
Towards modeling the behavior of physical intruders in a region monitored by a wireless sensor network

Proceedings of the 3rd ACM workshop on Artificial intelligence and security
Docitive networks: an emerging paradigm for dynamic spectrum management

IEEE Wireless Communications
A Human-Robot Collaborative Reinforcement Learning Algorithm

Journal of Intelligent and Robotic Systems
Rule acquisition for cognitive agents by using estimation of distribution algorithms

International Journal of Knowledge Engineering and Soft Data Paradigms
Multi-goal Q-learning of cooperative teams

Expert Systems with Applications: An International Journal
Multi-policy optimization in self-organizing systems

SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Why and how hippocampal transition cells can be used in reinforcement learning

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Reinforcement learning scheme for grouping and characterization of multi-agent network

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part III
Effects of social network topology and options on norm emergence

COIN'09 Proceedings of the 5th international conference on Coordination, organizations, institutions, and norms in agent systems
Evaluation of techniques for a learning-driven modeling methodology in multiagent simulation

MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Evolutionary dynamics of regret minimization

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Towards constraint optimal control of greenhouse climate

LSMS/ICSEE'10 Proceedings of the 2010 international conference on Life system modeling and simulation and intelligent computing, and 2010 international conference on Intelligent computing for sustainable energy and environment: Part III
Exploring continuous action spaces with diffusion trees for reinforcement learning

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Distributed Reinforcement Learning for Coordinate Multi-Robot Foraging

Journal of Intelligent and Robotic Systems
Auto-exploratory average reward reinforcement learning

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Reinforcement learning based resource allocation in business process management

Data & Knowledge Engineering
Self-learning fuzzy logic controllers for pursuit-evasion differential games

Robotics and Autonomous Systems
To adapt or not to adapt: consequences of adapting driver and traffic light agents

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Continuous-state reinforcement learning with fuzzy approximation

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Bifurcation analysis of reinforcement learning agents in the Selten's horse game

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Solving multi-stage games with hierarchical learning automata that bootstrap

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Web-based multi-agent system architecture in a dynamic environment

International Journal of Knowledge-based and Intelligent Engineering Systems
Adaptation-based programming in java

Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
A consideration of human immunity-based reinforcement learning with continuous states

Artificial Life and Robotics
An information-spectrum approach to analysis of return maximization in reinforcement learning

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Learning active fusion of multiple experts' decisions: An attention-based approach

Neural Computation
Predicting and compensating for lexicon access errors

Proceedings of the 16th international conference on Intelligent user interfaces
Swarm reinforcement learning method based on an actor-critic method

SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
Extended Q-learning algorithm for path-planning of a mobile robot

SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
Reduct based Q-learning: an introduction

Proceedings of the 2011 International Conference on Communication, Computing & Security
An innovative routing algorithm with reinforcement learning and pattern tree adjustment for wireless sensor networks

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume Part III
State representation with perceptual constancy based on active motion

ICSR'10 Proceedings of the Second international conference on Social robotics
Multi-level cognitive machine-learning based concept for human-like "artificial" walking: Application to autonomous stroll of humanoid robots

Neurocomputing
Robust high performance reinforcement learning through weighted k-nearest neighbors

Neurocomputing
Information Collection on a Graph

Operations Research
Self-organizing state aggregation for architecture design of Q-learning

Information Sciences: an International Journal
Path selection in disaster response management based on Q-learning

International Journal of Automation and Computing
The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems
Sampled fictitious play for approximate dynamic programming

Computers and Operations Research
Noisy reinforcements in reinforcement learning: some case studies based on gridworlds

ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
Supporting smart interactions with predictive analytics

The smart internet
The implementation of Q-learning for problems in continuous state and action space using SOM-based fuzzy systems

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Knowledge of opposite actions for reinforcement learning

Applied Soft Computing
Supporting smart interactions with predictive analytics

The smart internet
Software agent with reinforcement learning approach for medical image segmentation

Journal of Computer Science and Technology
Ambulance redeployment: an approximate dynamic programming approach

Winter Simulation Conference
Coordination control of greenhouse environmental factors

International Journal of Automation and Computing
Using reinforcement learning for controlling an elastic web application hosting platform

Proceedings of the 8th ACM international conference on Autonomic computing
Towards a real-world scenario for investigating organic computing principles in heterogeneous societies of robots

Proceedings of the 2011 workshop on Organic computing
A dynamic route change mechanism for mobile ad hoc networks

International Journal of Communication Networks and Distributed Systems
Learning chasing behaviours of non-player characters in games using SARSA

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
An incremental model of lexicon consensus in a population of agents by means of grammatical evolution, reinforcement learning and semantic rules

IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
A Multi-State Q-Learning Approach for the Dynamic Load Balancing of Time Warp

PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Learning to manage combined energy supply systems

Proceedings of the 17th IEEE/ACM international symposium on Low-power electronics and design
A Monte-Carlo AIXI approximation

Journal of Artificial Intelligence Research
A probabilistic approach for maintaining trust based on evidence

Journal of Artificial Intelligence Research
Multiagent learning in large anonymous games

Journal of Artificial Intelligence Research
Learning in minority games with multiple resources

ECAL'09 Proceedings of the 10th European conference on Advances in artificial life: Darwin meets von Neumann - Volume Part II
What kinds of human negotiation skill can be acquired by changing negotiation order of bargaining agents?

HCII'11 Proceedings of the 1st international conference on Human interface and the management of information: interacting with information - Volume Part II
Voting in multi-agent system for improvement of partial observations

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
A distributed reinforcement learning approach for solving optimization problems

CIT'11 Proceedings of the 5th WSEAS international conference on Communications and information technology
Evolving subjective utilities: Prisoner's Dilemma game examples

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Comparing humans and AI agents

AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
How to playwell in non-zero sum games: some lessons from generalized traveler's dilemma

AMT'11 Proceedings of the 7th international conference on Active media technology
Preference-based policy iteration: leveraging preference learning for reinforcement learning

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Empirical and theoretical support for lenient learning

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
The evolution of rules for conflicts resolution in self-organizing teams

Expert Systems with Applications: An International Journal
Reinforcement learning control with adaptive gain for a Saccharomyces cerevisiae fermentation process

Applied Soft Computing
SD-Q: selective discount Q learning based on new results of intertemporal choice theory

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Principled methods for biasing reinforcement learning agents

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
An information-theoretic analysis of return maximization in reinforcement learning

Neural Networks
Social welfare for automatic innovation

MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
An artificial market for efficient allocation of road transport networks

MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
Value-difference based exploration: adaptive control between epsilon-greedy and softmax

KI'11 Proceedings of the 34th Annual German conference on Advances in artificial intelligence
Obstacle avoidance of redundant manipulators using neural networks based reinforcement learning

Robotics and Computer-Integrated Manufacturing
Overcoming Omniscience in Axelrod's Model

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
A self-adaptive routing paradigm for wireless mesh networks based on reinforcement learning

Proceedings of the 14th ACM international conference on Modeling, analysis and simulation of wireless and mobile systems
Correlated action effects in decision theoretic regression

UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence
Evaluating a reinforcement learning algorithm with a general intelligence test

CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Quantum reinforcement learning

ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part II
An adaptive approach for the exploration-exploitation dilemma for learning agents

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
Teachable characters: user studies, design principles, and learning performance

IVA'06 Proceedings of the 6th international conference on Intelligent Virtual Agents
Efficient non-linear control through neuroevolution

ECML'06 Proceedings of the 17th European conference on Machine Learning
Using meta-level control with reinforcement learning to improve the performance of the agents

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Testing probabilistic equivalence through reinforcement learning

FSTTCS'06 Proceedings of the 26th international conference on Foundations of Software Technology and Theoretical Computer Science
Experience cooperative sharing in cross-layer cognitive radio for real-time multimedia communication

Proceedings of the 4th International Conference on Cognitive Radio and Advanced Spectrum Management
Opponent learning for multi-agent system simulation

RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Ensemble pruning using reinforcement learning

SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
Toward guidelines for modeling learning agents in multiagent-based simulation: implications from Q-learning and sarsa agents

MABS'04 Proceedings of the 2004 international conference on Multi-Agent and Multi-Agent-Based Simulation
A general framework for analyzing the optimal call admission control in DS-CDMA cellular network

ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part II
A machine learning approach to intraday trading on foreign exchange markets

IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Institutionalization through reciprocal habitualization and typification

WRAC'05 Proceedings of the Second international conference on Radical Agent Concepts: innovative Concepts for Autonomic and Agent-Based Systems
Reinforcement learning based sensing policy optimization for energy efficient cognitive radio networks

Neurocomputing
Learning-based ship design optimization approach

Computer-Aided Design
Kernel-Based reinforcement learning

ICIC'06 Proceedings of the 2006 international conference on Intelligent Computing - Volume Part I
A hybrid learning strategy for discovery of policies of action

IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
AlchemistJ: a framework for self-adaptive software

EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Grey reinforcement learning for incomplete information processing

TAMC'06 Proceedings of the Third international conference on Theory and Applications of Models of Computation
Self-organizing neural architecture for reinforcement learning

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Q learning based on self-organizing fuzzy radial basis function network

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Selecting actions for resource-bounded information extraction using reinforcement learning

Proceedings of the fifth ACM international conference on Web search and data mining
A hybrid cognitive/reactive intelligent agent autonomous path planning technique in a networked-distributed unstructured environment for reinforcement learning

The Journal of Supercomputing
Optimal tuning of continual online exploration in reinforcement learning

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Reward function and initial values: better choices for accelerated goal-directed reinforcement learning

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
An adaptive strategy for energy-efficient data collection in sparse wireless sensor networks

EWSN'10 Proceedings of the 7th European conference on Wireless Sensor Networks
Self-organized and evolvable cognitive architecture for intelligent agents and multi-agent systems

EvoApplicatons'10 Proceedings of the 2010 international conference on Applications of Evolutionary Computation - Volume Part I
Machine learning of plan robustness knowledge about instances

ECML'05 Proceedings of the 16th European conference on Machine Learning
Communication, diversity and learning: cornerstones of swarm behavior

SAB'04 Proceedings of the 2004 international conference on Swarm Robotics
S2A: secure smart household appliances

Proceedings of the second ACM conference on Data and Application Security and Privacy
Perception-Action based object detection from local descriptor combination and reinforcement learning

SCIA'05 Proceedings of the 14th Scandinavian conference on Image Analysis
Multiobjective water pinch analysis of the cuernavaca city water distribution network

EMO'05 Proceedings of the Third international conference on Evolutionary Multi-Criterion Optimization
Reinforcement learning using a grid based function approximator

Biomimetic Neural Learning for Intelligent Robots
Cost integration in multi-step viewpoint selection for object recognition

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Multiagent association rules mining in cooperative learning systems

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Optimal motion planning by reinforcement learning in autonomous mobile vehicles

Robotica
K-Shortest paths q-routing: a new QoS routing algorithm in telecommunication networks

ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
Reduced-State SARSA featuring extended channel reassignment for dynamic channel allocation in mobile cellular networks

ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
Reinforcement learning by chaotic exploration generator in target capturing task

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning

Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning
Generating inspiration for agent design by reinforcement learning

Information and Software Technology
Aggressive joint access and backhaul design for distributed-cognition 1gbps/km2 system architecture

WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Multi-agent case-based reasoning for cooperative reinforcement learners

ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
Emergence of flocking behavior based on reinforcement learning

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Dialog strategy acquisition and its evaluation for efficient learning of word meanings by agents

EELC'06 Proceedings of the Third international conference on Emergence and Evolution of Linguistic Communication: symbol Grounding and Beyond
Learning automata-based approach to learn dialogue policies in large state space

International Journal of Intelligent Information and Database Systems
On the organisation of agent experience: scaling up social cognition

Socionics
Trace equivalence characterization through reinforcement learning

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Mobile p2p automatic content sharing by ontology-based and contextualized integrative negotiation

DEECS'06 Proceedings of the Second international conference on Data Engineering Issues in E-Commerce and Services
Improvement of air handling unit control performance using reinforcement learning

PKAW'06 Proceedings of the 9th Pacific Rim Knowledge Acquisition international conference on Advances in Knowledge Acquisition and Management
Efficient deep web crawling using reinforcement learning

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
A reinforcement learning approach for the flexible job shop scheduling problem

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Aspects of active norm learning and the effect of lying on norm emergence in agent societies

PRIMA'11 Proceedings of the 14th international conference on Agents in Principle, Agents in Practice
Reinforcement distribution in continuous state action space fuzzy Q–learning: a novel approach

WILF'05 Proceedings of the 6th international conference on Fuzzy Logic and Applications
Learning pareto-optimal solutions in 2x2 conflict games

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
An adaptive approach for the exploration-exploitation dilemma and its application to economic systems

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
A convergent multiagent reinforcement learning approach for a subclass of cooperative stochastic games

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Heterogeneous populations of learning agents in the minority game

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Self-Organizing reinforcement learning model

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
Dyna-H: A heuristic planning reinforcement learning algorithm applied to role-playing game strategy decision systems

Knowledge-Based Systems
Non-intrusive policy optimization for dependable and adaptive service-oriented systems

Proceedings of the 27th Annual ACM Symposium on Applied Computing
A time-constrained SLA negotiation strategy in competitive computational grids

Future Generation Computer Systems
Strategy learning for autonomous agents in smart grid markets

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
A model of attentional impairments in autism: first steps toward a computational theory

Cognitive Systems Research
Accelerating evolution via egalitarian social learning

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Robustness of optimal channel reservation using handover prediction in multiservice wireless networks

Wireless Networks
Value function approximation through sparse bayesian modeling

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Automatic construction of temporally extended actions for MDPs using bisimulation metrics

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Compound reinforcement learning: theory and an application to finance

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Proposal and evaluation of the active course classification support system with exploitation-oriented learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reputation-Aware learning for SLA negotiation

IFIP'12 Proceedings of the 2012 international conference on Networking
Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
V-MAX: tempered optimism for better PAC reinforcement learning

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Just add Pepper: extending learning algorithms for repeated matrix games to repeated Markov games

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Dynamic potential-based reward shaping

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Adaptive agents on evolving networks

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Task allocation in mesh structure: 2side leapfrog algorithm and q-learning based algorithm

ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IV
GRiDA: GReen Distributed Algorithm for energy-efficient IP backbone networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
Selecting vision operators and fixing their optimal parameters values using reinforcement learning

ICISP'12 Proceedings of the 5th international conference on Image and Signal Processing
Multiagent learning through neuroevolution

WCCI'12 Proceedings of the 2012 World Congress conference on Advances in Computational Intelligence
A modular hierarchical reinforcement learning algorithm

ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
Overhead-Controlled routing in WSNs with reinforcement learning

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Reinforcement Learning with Approximation Spaces

Fundamenta Informaticae
Learning Classification Programs: The Genetic Algorithm Approach

Fundamenta Informaticae
Towards a Multiple-Lookahead-Levels agent reinforcement-learning technique and its implementation in integrated circuits

The Journal of Supercomputing
Managing Femto to Macro Interference without X2 Interface Support through POMDP

Mobile Networks and Applications
Machine learning in agent-based stochastic simulation: Inferential theory and evaluation in transportation logistics

Computers & Mathematics with Applications
Learning to achieve socially optimal solutions in general-sum games

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Distributed learning of best response behaviors in concurrent iterated many-object negotiations

MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Evolutionary dynamics of ant colony optimization

MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Q-Tree: automatic construction of hierarchical state representation for reinforcement learning

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part III
Planning interactive task for intelligent characters

Computer Animation and Virtual Worlds
Multi-agent task division learning in hide-and-seek games

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Cooperative behavior acquisition in multi-agent reinforcement learning system using attention degree

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Multi-agent learning and the reinforcement gradient

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Recognizing internal states of other agents to anticipate and coordinate interactions

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
How to design agent-based simulation models using agent learning

Proceedings of the Winter Simulation Conference
Learning classifier system with average reward reinforcement learning

Knowledge-Based Systems
Simulating plausible mechanisms for changing hepatic xenobiotic clearance patterns

Proceedings of the Winter Simulation Conference
A Reinforcement Learning Approach to Setting Multi-Objective Goals for Energy Demand Management

International Journal of Agent Technologies and Systems
Two-step gradient-based reinforcement learning for underwater robotics behavior learning

Robotics and Autonomous Systems
Exploiting user feedback for adapting mobile interaction obtrusiveness

UCAmI'12 Proceedings of the 6th international conference on Ubiquitous Computing and Ambient Intelligence
On measuring social intelligence: experiments on competition and cooperation

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
A social network-based trust-aware propagation model for P2P systems

Knowledge-Based Systems
Undesired state-action prediction in multi-agent reinforcement learning for linked multi-component robotic system control

Information Sciences: an International Journal
Robust convention emergence in social networks through self-reinforcing structures dissolution

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
On Finding and Learning Effective Strategies for Complex Non-zero-sum Repeated Games

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Behavior Abstraction Robustness in Agent Modeling

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Knowledge-Based Exploration for Reinforcement Learning in Self-Organizing Neural Networks

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
A two-pheromone trail ant colony system--tabu search approach for the heterogeneous vehicle routing problem with time windows and multiple products

Journal of Heuristics
Performance of distributed multi-agent multi-state reinforcement spectrum management using different exploration schemes

Expert Systems with Applications: An International Journal
From model-based control to data-driven control: Survey, classification and perspective

Information Sciences: an International Journal
Learning with configurable operators and RL-based heuristics

NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
Learning to crawl deep web

Information Systems
An investigation into the development of service-oriented robotic systems

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Testing probabilistic equivalence through Reinforcement Learning

Information and Computation
Emergence of social norms through collective learning in networked agent societies

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
iCO2: multi-user eco-driving training environment based on distributed constraint optimization

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Smart exploration in reinforcement learning using absolute temporal difference errors

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Object focused q-learning for autonomous agents

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
A slotted CSMA based reinforcement learning approach for extending the lifetime of underwater acoustic wireless sensor networks

Computer Communications
Dynamic policy programming

The Journal of Machine Learning Research
Reinforcement learning for cooperative sensing gain in cognitive radio ad hoc networks

Wireless Networks
Distributed Q-Learning for Interference Mitigation in Self-Organised Femtocell Networks: Synchronous or Asynchronous?

Wireless Personal Communications: An International Journal
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Backward Q-learning: The combination of Sarsa algorithm and Q-learning

Engineering Applications of Artificial Intelligence
Online learning of timeout policies for dynamic power management

ACM Transactions on Embedded Computing Systems (TECS)
Resource-bounded machines are motivated to be effective, efficient, and curious

AGI'13 Proceedings of the 6th international conference on Artificial General Intelligence
Toward nonlinear local reinforcement learning rules through neuroevolution

Neural Computation
The dynamics of reinforcement social learning in cooperative multiagent systems

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Q-learning Reward Propagation Method for Reducing the Transmission Power of Sensor Nodes in Wireless Sensor Networks

Wireless Personal Communications: An International Journal
Gait Pattern Based on CMAC Neural Network for Robotic Applications

Neural Processing Letters
Reinforcement learning based routing in wireless mesh networks

Wireless Networks
Full-range adaptive cruise control based on supervised adaptive dynamic programming

Neurocomputing
An actor-critic algorithm for multi-agent learning in queue-based stochastic games

Neurocomputing
Adaptive learning algorithm of self-organizing teams

Expert Systems with Applications: An International Journal
Reinforcement learning algorithms with function approximation: Recent advances and applications

Information Sciences: an International Journal
The arcade learning environment: an evaluation platform for general agents

Journal of Artificial Intelligence Research
Construction of approximation spaces for reinforcement learning

The Journal of Machine Learning Research
Scheduling sensors for monitoring sentient spaces using an approximate POMDP policy

Pervasive and Mobile Computing
Frontal theta oscillatory activity is a common mechanism for the computation of unexpected outcomes and learning rate

Journal of Cognitive Neuroscience
Hybrid motion graph for character motion synthesis

Journal of Visual Languages and Computing
A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

Artificial Life and Robotics
Multiagent learning in the presence of memory-bounded agents

Autonomous Agents and Multi-Agent Systems
Learning potential functions and their representations for multi-task reinforcement learning

Autonomous Agents and Multi-Agent Systems
Embodied imitation-enhanced reinforcement learning in multi-agent systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Active tracking and pursuit under different levels of occlusion: a two-layer approach

Machine Vision and Applications
Self-healing in transparent optical packet switching mesh networks: A reinforcement learning perspective

Computer Networks: The International Journal of Computer and Telecommunications Networking
Hierarchical control of traffic signals using Q-learning with tile coding

Applied Intelligence
Analysis of emission right prices in greenhouse gas emission trading via agent-based model

Multiagent and Grid Systems
A reinforcement learning based solution for cognitive network cooperation between co-located, heterogeneous wireless sensor networks

Ad Hoc Networks
Self-organized femtocells: a Fuzzy Q-Learning approach

Wireless Networks
Adaptive function approximation in reinforcement learning with an interpolating growing neural gas

International Journal of Hybrid Intelligent Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

\cal Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states.This paper presents and proves in detail a convergence theorem for \cal Q-learning based on that outlined in Watkins (1989). We show that \cal Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many \cal Q values can be changed each iteration, rather than just one.