Introduction to Reinforcement Learning

Authors:
Richard S. Sutton;Andrew G. Barto
Affiliations:
-;-
Venue:
Introduction to Reinforcement Learning
Year:
1998

Citing 0
Cited 2533

Multi-modal stereognosis

Proceedings of the third annual conference on Autonomous Agents
Elevator Group Control Using Multiple Reinforcement Learning Agents

Machine Learning
Reinforcement learning and mistake bounded algorithms

COLT '99 Proceedings of the twelfth annual conference on Computational learning theory
Convergence analysis of temporal-difference learning algorithms with linear function approximation

COLT '99 Proceedings of the twelfth annual conference on Computational learning theory
Efficient exploration for optimizing immediate reward

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Learning state features from policies to bias exploration in reinforcement learning

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
A cerebellar model of timing and prediction in the control of reaching

Neural Computation
Toward a Model of Intelligence as an Economy of Agents

Machine Learning
A reinforcement learning agent for personalized information filtering

Proceedings of the 5th international conference on Intelligent user interfaces
Eddies: continuously adaptive query processing

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Congestion-dependent pricing of network services

IEEE/ACM Transactions on Networking (TON)
Automated strategy searches in an electronic goods market: learning and complex price schedules

Proceedings of the 1st ACM conference on Electronic commerce
Learning user's preferences by analyzing Web-browsing behaviors

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Adaptivity in agent-based routing for data networks

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Ant algorithms for discrete optimization

Artificial Life
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Learning to Play Chess Using Temporal Differences

Machine Learning
Relevance and reinforcement in interactive browsing

Proceedings of the ninth international conference on Information and knowledge management
On verifying game designs and playing strategies using reinforcement learning

Proceedings of the 2001 ACM symposium on Applied computing
On the Convergence of Temporal-Difference Learning with Linear Function Approximation

Machine Learning
On-line analysis of the TCP acknowledgment delay problem

Journal of the ACM (JACM)
Hierarchical multi-agent reinforcement learning

Proceedings of the fifth international conference on Autonomous agents
An architecture for action selection in robotic soccer

Proceedings of the fifth international conference on Autonomous agents
A social reinforcement learning agent

Proceedings of the fifth international conference on Autonomous agents
A reinforcement learning model of selective visual attention

Proceedings of the fifth international conference on Autonomous agents
Pricing information bundles in a dynamic environment

Proceedings of the 3rd ACM conference on Electronic Commerce
Searching in metric spaces

ACM Computing Surveys (CSUR)
Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network

Neural Processing Letters
Programming backgammon using self-teaching neural nets

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Adaptive dynamic scene analysis

Imaging and vision systems
Multiagent learning using a variable learning rate

Artificial Intelligence
Learning classifier systems: a complete introduction, review, and roadmap

Journal of Artificial Evolution and Applications
Robustness of reputation-based trust: boolean case

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Learning sequences of actions in collectives of autonomous agents

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Strategic sequential bidding in auctions using dynamic programming

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2
Designing agent collectives for systems with markovian dynamics

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to weigh basic behaviors in scalable agents

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Integrated learning for interactive synthetic characters

Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Agents' advanced features for negotiation and coordination

Mutli-agents systems and applications
Relational reinforcement learning

Mutli-agents systems and applications
A study of the simulated evolution of the spectral sensitivity of visual agent receptors

Artificial Life
Iterated Phantom Induction: A Knowledge-Based Approach to Learning Control

Machine Learning
Adaptive mirroring of system of systems architectures

WOSS '02 Proceedings of the first workshop on Self-healing systems
Learning Sequences of Compatible Actions Among Agents

Artificial Intelligence Review
The Brain-Like Sensorimotor Control System

Journal of Intelligent and Robotic Systems
Efficient and inefficient ant coverage methods

Annals of Mathematics and Artificial Intelligence
Ant colony optimization and stochastic gradient descent

Artificial Life
Planning and Control in Artificial Intelligence: A Unifying Perspective

Applied Intelligence
Rapid Concept Learning for Mobile Robots

Autonomous Robots
Dynamics of a Classical Conditioning Model

Autonomous Robots
Target Reaching by Using Visual Information and Q-learning Controllers

Autonomous Robots
Certain Principles of Biomorphic Robots

Autonomous Robots
Making Organizational Learning Operational: Implications from Learning Classifier Systems

Computational & Mathematical Organization Theory
Reinforced Genetic Programming

Genetic Programming and Evolvable Machines
Rollout Algorithms for Combinatorial Optimization

Journal of Heuristics
Finite-time Analysis of the Multiarmed Bandit Problem

Machine Learning
Introduction

Machine Learning
Reinforcement Learning for Call Admission Control and Routing under Quality of Service Constraints in Multimedia Networks

Machine Learning
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts

Machine Learning
Near-Optimal Reinforcement Learning in Polynomial Time

Machine Learning
Technical Update: Least-Squares Temporal Difference Learning

Machine Learning
Continuous-Action Q-Learning

Machine Learning
Structure in the Space of Value Functions

Machine Learning
Classifiers that approximate functions

Natural Computing: an international journal
Robot learning driven by emotions

Adaptive Behavior
A perspective view and survey of meta-learning

Artificial Intelligence Review
Learning intelligent behavior in a non-stationary and partially observable environment

Artificial Intelligence Review
Reinforcement Learning Rules in a Repeated Game

Computational Economics
Metalearning and neuromodulation

Neural Networks - Computational models of neuromodulation
TD Models of reward predictive responses in dopamine neurons

Neural Networks - Computational models of neuromodulation
Dopamine: generalization and bonuses

Neural Networks - Computational models of neuromodulation
Opponent interactions between serotonin and dopamine

Neural Networks - Computational models of neuromodulation
Control of exploitation-exploration meta-parameter in reinforcement learning

Neural Networks - Computational models of neuromodulation
Neuromodulation, theta rhythm and rat spatial navigation

Neural Networks - Computational models of neuromodulation
The anticipatory classifier system and genetic generalization

Natural Computing: an international journal
Neural computing increases robot adaptivity

Natural Computing: an international journal
Relative Loss Bounds for Temporal-Difference Learning

Machine Learning
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning

Discrete Event Dynamic Systems
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
Robots With Humanoid Features in Public Places: A Case Study

IEEE Intelligent Systems
Tracing Patterns and Attention: Humanoid Robot Cognition

IEEE Intelligent Systems
Jijo-2: An Office Robot that Communicates and Learns

IEEE Intelligent Systems
Room Service, AI-Style

IEEE Intelligent Systems
Optimal control using the transport equation: the Liouville machine

Adaptive Behavior
Learning cost-sensitive active classifiers

Artificial Intelligence
Learning of plan execution policies for indoor navigation

AI Communications - Special issue on KI-2001
Multiple model-based reinforcement learning

Neural Computation
Designing guide-path networks for automated guided vehicle system by using the Q-learning technique

Computers and Industrial Engineering
ZCS redux

Evolutionary Computation
A personalized and integrative comparison-shopping engine and its applications

Decision Support Systems - Special issue: Agents and e-commerce business models
Optimizing hypervideo navigation using a Markov decision process approach

Proceedings of the tenth ACM international conference on Multimedia
Using a time-delay actor-critic neural architecture with dopamine-like reinforcement signal for learning in autonomous robots

Emergent neural computational architectures based on neuroscience
Learning to play strong poker

Machines that learn to play games
Formalizing the Ant Algorithms in Terms of Reinforcement Learning

ECAL '99 Proceedings of the 5th European Conference on Advances in Artificial Life
An Information-Theoretic Approach for the Quantification of Relevance

ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Evolution of Reinforcement Learning in Uncertain Environments: Emergence of Risk-Aversion and Matching

ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Towards a Universal Theory of Artificial Intelligence Based on Algorithmic Probability and Sequential Decisions

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Learning While Exploring: Bridging the Gaps in the Eligibility Traces

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Speeding Up Relational Reinforcement Learning through the Use of an Incremental First Order Decision Tree Learner

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Social Agents Playing a Periodical Policy

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
On-Line Support Vector Machine Regression

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Propagation of Q-values in Tabular TD(lambda)

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Characterizing Markov Decision Processes

ECML '02 Proceedings of the 13th European Conference on Machine Learning
A Multi-agent System for Electronic Commerce including Adaptive Strategic Behaviours

EPIA '99 Proceedings of the 9th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Learning a Navigation Task in Changing Environments by Multi-task Reinforcement Learning

EWLR-8 Proceedings of the 8th European Workshop on Learning Robots: Advances in Robot Learning
Reinforcement Learning in Situated Agents: Theoretical and Practical Solutions

EWLR-8 Proceedings of the 8th European Workshop on Learning Robots: Advances in Robot Learning
Selection of Behavior in Social Situations

Proceedings of the EvoWorkshops on Applications of Evolutionary Computing
From the Sea to the Sidewalk: The Evolution of Hexapod Walking Gaits by a Genetic Algorithm

ICES '00 Proceedings of the Third International Conference on Evolvable Systems: From Biology to Hardware
Solving Partially Observable Problems by Evolution and Learning of Finite State Machines

ICES '01 Proceedings of the 4th International Conference on Evolvable Systems: From Biology to Hardware
Enhancing Multi-Agent Based Simulation with Human-Like Decision Making Strategies

MABS '00 Proceedings of the Second International Workshop on Multi-Agent-Based Simulation-Revised and Additional Papers
A Framework for Supporting Intelligent Fault and Performance Management for Communication Networks

MMNS '01 Proceedings of the 4th IFIP/IEEE International Conference on Management of Multimedia Networks and Services: Management of Multimedia on the Internet
An Overview of MAXQ Hierarchical Reinforcement Learning

SARA '02 Proceedings of the 4th International Symposium on Abstraction, Reformulation, and Approximation
Learning Options in Reinforcement Learning

Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Model Minimization in Hierarchical Reinforcement Learning

Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Language as a Complex Adaptive System

PPSN VI Proceedings of the 6th International Conference on Parallel Problem Solving from Nature
An Integrated On-Line Learning System for Evolving Programmable Logic Array Controllers

PPSN VI Proceedings of the 6th International Conference on Parallel Problem Solving from Nature
On Using Constructivism in Neural Classifier Systems

PPSN VII Proceedings of the 7th International Conference on Parallel Problem Solving from Nature
TCS Learning Classifier System Controller on a Real Robot

PPSN VII Proceedings of the 7th International Conference on Parallel Problem Solving from Nature
Using and Evaluating Adaptive Agents for Electronic Commerce Negotiation

IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Application of Reinforcement Learning to Electrical Power System Closed-Loop Emergency Control

PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Self-Similar Layered Hidden Markov Models

PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Reinforcement Learning: Past, Present and Future

SEAL'98 Selected papers from the Second Asia-Pacific Conference on Simulated Evolution and Learning on Simulated Evolution and Learning
Least-Squares Methods in Reinforcement Learning for Control

SETN '02 Proceedings of the Second Hellenic Conference on AI: Methods and Applications of Artificial Intelligence
Modelling Intelligent Behaviour: The Markov Decision Process Approach

IBERAMIA '98 Proceedings of the 6th Ibero-American Conference on AI: Progress in Artificial Intelligence
An Analysis of the Pheromone Q-Learning Algorithm

IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
Learning to Reach the Pareto Optimal Nash Equilibrium as a Team

AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Q-Learning in Continuous State and Action Spaces

AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
Neurofuzzy Learning of Mobile Robot Behaviours

AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
A Classification Scheme for Negotiation in Electronic Commerce

Agent Mediated Electronic Commerce, The European AgentLink Perspective.
Agents Advanced Features for Negotiation in Electronic Commerce and Virtual Organisations Formation Processes

Agent Mediated Electronic Commerce, The European AgentLink Perspective.
Selection of Tasks and Delegation of Responsibility in a Multiagent System for Emergent Process Management

AI '01 Proceedings of the 14th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Relational Reinforcement Learning

EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Agents' Advanced Features for Negotiation and Coordination

EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
SVD Reduction in Continuos Environment Reinforcement Learning

Proceedings of the International Conference, 7th Fuzzy Days on Computational Intelligence, Theory and Applications
Reinforcement Learning for Control of Traffic and Access Points in Intelligent Wireless ATM Networks

Proceedings of the International Conference, 7th Fuzzy Days on Computational Intelligence, Theory and Applications
Reinforcement Learning for Biped Locomotion

ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
Intraday FX Trading: An Evolutionary Reinforcement Learning Approach

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Lempel-Ziv Coding in Reinforcement Learning

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Coordinating Learning Agents via Utility Assignment

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Variance-Penalized Reinforcement Learning for Risk-Averse Asset Allocation

IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
Learning to Predict Variable-Delay Rewards and Its Role in Autonomous Developmental Robotics

IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
What Is a Learning Classifier System?

Learning Classifier Systems, From Foundations to Applications
Strength or Accuracy? Fitness Calculation in Learning Classifier Systems

Learning Classifier Systems, From Foundations to Applications
State of XCS Classifier System Research

Learning Classifier Systems, From Foundations to Applications
An Introduction to Learning Fuzzy Classifier Systems

Learning Classifier Systems, From Foundations to Applications
The Fighter Aircraft LCS: A Case of Different LCS Goals and Techniques

Learning Classifier Systems, From Foundations to Applications
A Roadmap to the Last Decade of Learning Classifier System Research

Learning Classifier Systems, From Foundations to Applications
An Artificial Economy of Post Production Systems

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Learning Classifier Systems Meet Multiagent Environments

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
An Algorithmic Description of XCS

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Mining Oblique Data with XCS

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Using Classifier Systems as Adaptive Expert Systems for Control

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
YACS: Combining Dynamic Programming with Generalization in Classifier Systems

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Biasing Exploration in an Anticipatory Learning Classifier System

IWLCS '01 Revised Papers from the 4th International Workshop on Advances in Learning Classifier Systems
Two Views of Classifier Systems

IWLCS '01 Revised Papers from the 4th International Workshop on Advances in Learning Classifier Systems
Fast Function Approximation with Hierarchical Neural Networks and Their Application to a Reinforcement Learning Agent

IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Connectionist Models of Neurons, Learning Processes and Artificial Intelligence-Part I
Information Integration for Robot Learning Using Neural Fuzzy Systems

IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
Hybrid Framework for Neuro-Dynamic Programming Application to Water Supply Networks

IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
Game Theory and Artificial Intelligence

Selected papers from the UKMAS Workshop on Foundations and Applications of Multi-Agent Systems
Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer

RoboCup 2001: Robot Soccer World Cup V
Evolutionary Behavior Selection with Activation/Termination Constraints

RoboCup 2001: Robot Soccer World Cup V
Multiple Reward Criterion for Cooperative Behavior Acquisition in a Muliagent Environment

RoboCup-99: Robot Soccer World Cup III
Learning to Behave by Environment Reinforcement

RoboCup-99: Robot Soccer World Cup III
Reinforcement Learning for 3 vs. 2 Keepaway

RoboCup 2000: Robot Soccer World Cup IV
Learning Mutual Trust

Proceedings of the workshop on Deception, Fraud, and Trust in Agent Societies held during the Autonomous Agents Conference: Trust in Cyber-societies, Integrating the Human and Artificial Perspectives
Toward the Formal Foundation of Ant Programming

ANTS '02 Proceedings of the Third International Workshop on Ant Algorithms
An Improved Q-Learning Algorithm Using Synthetic Pheromones

CEEMAS '01 Revised Papers from the Second International Workshop of Central and Eastern Europe on Multi-Agent Systems: From Theory to Practice in Multi-Agent Systems
Reactive and Memory-Based Genetic Programming for Robot Control

Proceedings of the Second European Workshop on Genetic Programming
On the Relationship between Learning Capability and the Boltzmann-Formula

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Value Prediction in Engineering Applications

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Learning from Human Decision-Making Behaviors - An Application to RoboCup Software Agents

IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence
On the Asymptotic Behaviour of a Constant Stepsize Temporal-Difference Learning Algorithm

EuroCOLT '99 Proceedings of the 4th European Conference on Computational Learning Theory
Open Theoretical Questions in Reinforcement Learning

EuroCOLT '99 Proceedings of the 4th European Conference on Computational Learning Theory
Application of Episodic Q-Learning to a Multi-agent Cooperative Task

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Preliminary Results

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Learning in Character: Building Autonomous Animated Characters That Learn What They Ought to Learn

ICVS '01 Proceedings of the International Conference on Virtual Storytelling: Using Virtual Reality Technologies for Storytelling
Introduction to Sequence Learning

Sequence Learning - Paradigms, Algorithms, and Applications
Hidden-Mode Markov Decision Processes for Nonstationary Sequential Decision Making

Sequence Learning - Paradigms, Algorithms, and Applications
Similarity between Fuzzy Multi-objective Control and Eligibility

AFSS '02 Proceedings of the 2002 AFSS International Conference on Fuzzy Systems. Calcutta: Advances in Soft Computing
Using a Time-Delay Actor-Critic Neural Architecture with Dopamine-Like Reinforcement Signal for Learning in Autonomous Robots

Emergent Neural Computational Architectures Based on Neuroscience - Towards Neuroscience-Inspired Computing
Decision-Theoretic Control of Planetary Rovers

Revised Papers from the International Seminar on Advances in Plan-Based Control of Robotic Agents,
Adaptive Representation Methods for Reinforcement Learning

AI '01 Proceedings of the 14th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
Learning as a Consequence of Selection

Selected Papers from the 5th European Conference on Artificial Evolution
Abstraction Methods for Game Theoretic Poker

CG '00 Revised Papers from the Second International Conference on Computers and Games
Learning Time Allocation Using Neural Networks

CG '00 Revised Papers from the Second International Conference on Computers and Games
Chess Neighborhoods, Function Combination, and Reinforcement Learning

CG '00 Revised Papers from the Second International Conference on Computers and Games
Logic, Knowledge Representation, and Bayesian Decision Theory

CL '00 Proceedings of the First International Conference on Computational Logic
MINERVA: A Tour-Guide Robot that Learns

KI '99 Proceedings of the 23rd Annual German Conference on Artificial Intelligence: Advances in Artificial Intelligence
Dynamic Pricing of Information Products Based on Reinforcement Learning: A Yield-Management Approach

KI '02 Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence
Using Document Structures for Personal Ontologies and User Modeling

UM '01 Proceedings of the 8th International Conference on User Modeling 2001
Faster Near-Optimal Reinforcement Learning: Adding Adaptiveness to the E3 Algorithm

ALT '99 Proceedings of the 10th International Conference on Algorithmic Learning Theory
Feedforward Neural Networks in Reinforcement Learning Applied to High-Dimensional Motor Control

ALT '02 Proceedings of the 13th International Conference on Algorithmic Learning Theory
Invention vs. Discovery

DS '02 Proceedings of the 5th International Conference on Discovery Science
To Collect or Not to Collect? Machine Learning for Memory Management

Proceedings of the 2nd Java Virtual Machine Research and Technology Symposium
High-Level Student Modeling with Machine Learning

ITS '00 Proceedings of the 5th International Conference on Intelligent Tutoring Systems
A Comparison of Decision Making Criteria and Optimization Methods for Active Robotic Sensing

NMA '02 Revised Papers from the 5th International Conference on Numerical Methods and Applications
An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning

ATAL '00 Proceedings of the 7th International Workshop on Intelligent Agents VII. Agent Theories Architectures and Languages
Autonomous Spacecraft Resource Management: A Multi-agent Approach

AI*IA '99 Proceedings of the 6th Congress of the Italian Association for Artificial Intelligence on Advances in Artificial Intelligence
A Platform for Electronic Commerce with Adaptive Agents

Agent-Mediated Electronic Commerce III, Current Issues in Agent-Based Electronic Commerce Systems (includes revised papers from AMEC 2000 Workshop)
An Adaptive, Maintable, Extensible Process Agent

DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
Learning Rates for Q-Learning

COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Optimizing Average Reward Using Discounted Rewards

COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Self-Optimizing and Pareto-Optimal Policies in General Environments Based on Bayes-Mixtures

COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
A Multi-agent Q-learning Framework for Optimizing Stock Trading Systems

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Agents for Industry Process Management

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Reinforcement Learning to Drive a Car by Pattern Matching

Proceedings of the 24th DAGM Symposium on Pattern Recognition
Some Effects of Individual Learning on the Evolution of Sensors

ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Spatiotemporal Abstraction of Stochastic Sequential Processes

Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Anticipation-Based Control Architecture for a Mobile Robot

ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning

IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
Incorporating Perception-Based Information in Reinforcement Learning Using Computing with Words

IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Analysis of Optimal Criteria

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Using ILP to Improve Planning in Hierarchical Reinforcement Learning

ILP '00 Proceedings of the 10th International Conference on Inductive Logic Programming
Dynamic balance of a biped robot using fuzzy reinforcement learning agents

Fuzzy Sets and Systems - Special issue: Fuzzy set techniques for intelligent robotic systems
Learning fuzzy rules from iterative execution of games

Fuzzy Sets and Systems - Theme: Modeling and learning
A context-based architecture for general problem solving

ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Timed delivery of reward signals in an autonomous robot

ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Using Markovian decision problems to analyze animal performance in random and variable ratio schedules of reinforcement

ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Levels of dynamics and adaptive behavior in evolutionary neural controllers

ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Memetic-neural scheduler of jobs in identical parallel machines

Second international workshop on Intelligent systems design and application
Nonlinear credit assignment for musical sequences

Second international workshop on Intelligent systems design and application
Sequential cost-sensitive decision making with reinforcement learning

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Evolution of reinforcement learning in uncertain environments: a simple explanation for complex foraging behaviors

Adaptive Behavior
Applications of the self-organising map to reinforcement learning

Neural Networks - New developments in self-organizing maps
Reinforcement learning for POMDPs based on action values and stochastic optimization

Eighteenth national conference on Artificial intelligence
The design of collectives of agents to control non-Markovian systems

Eighteenth national conference on Artificial intelligence
Machine learning

Handbook of data mining and knowledge discovery
References

Neural networks and the financial markets
D-Learning: what learning in dogs tells us about building characters that learn what they ought to learn

Exploring artificial intelligence in the new millennium
A System for Building Intelligent Agents that Learn to Retrieve and Extract Information

User Modeling and User-Adapted Interaction
Anticipations control behavior: animal behavior in an anticipatory learning classifier system

Adaptive Behavior
A Bi-Recursive Neural Network Architecture for the Prediction of Protein Coarse Contact Maps

CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Soccer strategies that live in the B2B world of negotiation and decision-making

Decision Support Systems
A taxonomy for spatiotemporal connectionist networks revisited: the unsupervised case

Neural Computation
A non-computationally-intensive neurocontroller for autonomous mobile robot navigation

Biologically inspired robot behavior engineering
A bio-inspired robotic mechanism for autonomous locomotion in unconventional environments

Autonomous robotic systems
Integration of soft computing towards autonomous legged robots

Autonomous robotic systems
SOS++: finding smart behaviors using learning and evolution

ICAL 2003 Proceedings of the eighth international conference on Artificial life
Walverine: a Walrasian trading agent

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Adaptive policy gradient in multiagent learning

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A selection-mutation model for q-learning in multi-agent systems

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Multi-agent learning in extensive games with complete information

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Introducing an agent of a certain persuasion

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
How to calm hyperactive agents

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
An introduction to reinforcement learning theory: value function methods

Advanced lectures on machine learning
Offline learning and the role of autogenous speech: new suggestions from birdsong research

Speech Communication - Special issue on the nature of speech perception (the psychophysics of speech perception III)
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
A reinforcement learning adaptive fuzzy controller for robots

Fuzzy Sets and Systems - Theme: Modeling and control
Reinforcement learning based on local state feature learning and policy adjustment

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Introduction to multimedia and mobile agents
Autonomous mental development in high dimensional context and action spaces

Neural Networks - 2003 Special issue: Advances in neural networks research — IJCNN'03
On the convergence of optimistic policy iteration

The Journal of Machine Learning Research
ε-mdps: learning in varying environments

The Journal of Machine Learning Research
R-max - a general polynomial time algorithm for near-optimal reinforcement learning

The Journal of Machine Learning Research
Using confidence bounds for exploitation-exploration trade-offs

The Journal of Machine Learning Research
Learning behavior-selection by emotions and cognition in a multi-goal robot task

The Journal of Machine Learning Research
Accuracy-based learning classifier systems: models, analysis and applications to classification tasks

Evolutionary Computation
Adaptive Radial Basis Decomposition by Learning Vector Quantization

Neural Processing Letters
Mining Plans for Customer-Class Transformation

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
To buy or not to buy: mining airfare data to minimize ticket purchase price

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Adding numbers to text classification

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Learning systems and their engineering: a project proposal

Practicing software engineering in the 21st century
Learning-assisted automated planning: looking back, taking stock, going forward

AI Magazine
Reinforcing reachable routes

Computer Networks: The International Journal of Computer and Telecommunications Networking
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Least-squares policy iteration

The Journal of Machine Learning Research
Distributed Reinforcement Learning Control for Batch Sequencing and Sizing in Just-In-Time Manufacturing Systems

Applied Intelligence
Interpretation by Implementation for Understanding a Multiagent Organization

Computational & Mathematical Organization Theory
Inter-module credit assignment in modular reinforcement learning

Neural Networks
Combining Hebbian and reinforcement learning in a minibrain model

Neural Networks
Combining importance sampling and temporal difference control variates to simulate Markov Chains

ACM Transactions on Modeling and Computer Simulation (TOMACS)
The domestic robot—a friendly cognitive system takes care of your home

Ambient intelligence
Autonomous Learning Architecture for Environmental Mapping

Journal of Intelligent and Robotic Systems
Frontal plane algorithms for dynamic bipedal walking

Robotica
Development and the Baldwin effect

Artificial Life
Learning obstacle avoidance with an operant behavior model

Artificial Life
Choosing search heuristics by non-stationary reinforcement learning

Metaheuristics
A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL

Probability in the Engineering and Informational Sciences
Reinforcement learning with via-point representation

Neural Networks
An experimental evaluation of reinforcement learning for gain scheduling

Design and application of hybrid intelligent systems
Rated MCRDR: finding non-linear relationships between classifications in MCRDR

Design and application of hybrid intelligent systems
Employing OLAP mining for multiagent reinforcement learning

Design and application of hybrid intelligent systems
Policy gradient methods in multi-agent systems: pursuit problem

Design and application of hybrid intelligent systems
A Reinforcement Learning Framework for Parameter Control in Computer Vision Applications

CRV '04 Proceedings of the 1st Canadian Conference on Computer and Robot Vision
Q(")-Based Image Thresholding

CRV '04 Proceedings of the 1st Canadian Conference on Computer and Robot Vision
Learning Rates for Q-learning

The Journal of Machine Learning Research
A Geometric Approach to Multi-Criterion Reinforcement Learning

The Journal of Machine Learning Research
A generic architecture for adaptive agents based on reinforcement learning

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Bio-inspired systems (BIS)
Self-organized load balancing in proxy servers: algorithms and performance

Journal of Intelligent Information Systems - Special issue on web intelligence
Representing von Neumann–Morgenstern Games in the Situation Calculus

Annals of Mathematics and Artificial Intelligence
Dynamic bipedal walking assisted by learning

Robotica
Transfer of Experience Between Reinforcement Learning Environments with Progressive Difficulty

Artificial Intelligence Review
Utile distinction hidden Markov models

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Using relative novelty to identify useful temporal abstractions in reinforcement learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Bellman goes relational

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Sparse cooperative Q-learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
P3VI: a partitioned, prioritized, parallel value iterator

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning when and how to coordinate

Web Intelligence and Agent Systems
Reinforcement Learning with Factored States and Actions

The Journal of Machine Learning Research
Recommender Systems Research: A Connection-Centric Survey

Journal of Intelligent Information Systems
Cross channel optimized marketing by reinforcement learning

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Integrating Guidance into Relational Reinforcement Learning

Machine Learning
Best-Response Multiagent Learning in Non-Stationary Environments

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Unifying Temporal and Structural Credit Assignment Problems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Hierarchical Reinforcement Learning in Communication-Mediated Multiagent Coordination

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Multi-Agent Patrolling with Reinforcement Learning

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Organization-Based Coalition Formation

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Improving the Learning Rate by Inducing a Transition Model

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Text Adaptation for Mobile Digital Teletext

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Learning to play games in extensive form by valuation

TARK '01 Proceedings of the 8th conference on Theoretical aspects of rationality and knowledge
Precomputing avatar behavior from human motion data

SCA '04 Proceedings of the 2004 ACM SIGGRAPH/Eurographics symposium on Computer animation
General methodology 1: optimising discrete event simulation models using a reinforcement learning agent

Proceedings of the 34th conference on Winter simulation: exploring new frontiers
Market-based recommendation: Agents that compete for consumer attention

ACM Transactions on Internet Technology (TOIT)
Affective Learning — A Manifesto

BT Technology Journal
Exploitation vs. exploration: choosing a supplier in an environment of incomplete information

Decision Support Systems
Learning diagnostic policies from examples by systematic search

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Knowledge-Based Kernel Approximation

The Journal of Machine Learning Research
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

The Journal of Machine Learning Research
Reliability of internal prediction/estimation and its application: I. adaptive action selection reflecting reliability of value function

Neural Networks
Coordinating Multiple Agents via Reinforcement Learning

Autonomous Agents and Multi-Agent Systems
Strong, Stable, and Reliable Fitness Pressure in XCS due to Tournament Selection

Genetic Programming and Evolvable Machines
Online learning of aggregate knowledge about non-linear preferences applied to negotiating prices and bundles

ICEC '04 Proceedings of the 6th international conference on Electronic commerce
Teaching robots to plan through Q-learning

Robotica
A theory of epineuronal memory

Neural Networks
Using Optimal Foraging Models to Evaluate Learned Robotic Foraging Behavior

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
An Architecture for Behavior-Based Reinforcement Learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Fast multi-level adaptation for interactive autonomous characters

ACM Transactions on Graphics (TOG)
Research challenges of autonomic computing

Proceedings of the 27th international conference on Software engineering
System for foreign exchange trading using genetic algorithms and reinforcement learning

International Journal of Systems Science
QoS Control Strategies for High-Quality Video Processing

Real-Time Systems
Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
The Cyber Rodent Project: Exploration of Adaptive Mechanisms for Self-Preservation and Self-Reproduction

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Walverine: a Walrasian trading agent

Decision Support Systems - Special issue: Decision theory and game theory in agent design
Agent learning in supplier selection models

Decision Support Systems - Special issue: Decision theory and game theory in agent design
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process

Proceedings of the 2005 ACM symposium on Applied computing
A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game

Machine Learning
An adaptive pursuit strategy for allocating operator probabilities

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
XCS with eligibility traces

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
XCS with computed prediction in multistep environments

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
An abstraction agorithm for genetics-based reinforcement learning

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
GAMM: genetic algorithms with meta-models for vision

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Self-managed decentralised systems using K-components and collaborative reinforcement learning

WOSS '04 Proceedings of the 1st ACM SIGSOFT workshop on Self-managed systems
Online model-based adaptation for optimizing performance and dependability

WOSS '04 Proceedings of the 1st ACM SIGSOFT workshop on Self-managed systems
Layering and heterogeneity as design principles for animated embedded agents

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Intelligent embedded agents
Computational intelligence for structured learning of a partner robot based on imitation

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Intelligent embedded agents
Using Agents and Simulation to Develop Adequate Thinking Styles

ICALT '05 Proceedings of the Fifth IEEE International Conference on Advanced Learning Technologies
Behavior transfer for value-function-based reinforcement learning

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Multi-agent reward analysis for learning in noisy domains

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Modeling task allocation using a decision theoretic model

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Automatic computer game balancing: a reinforcement learning approach

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Improving reinforcement learning function approximators via neuroevolution

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Novel runtime systems support for adaptive compositional modeling in PSEs

Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
E-commerce intelligent agent: personalization travel support agent using Q Learning

ICEC '05 Proceedings of the 7th international conference on Electronic commerce
Reinforcement learning for active model selection

UBDM '05 Proceedings of the 1st international workshop on Utility-based data mining
Contextual recommender problems [extended abstract]

UBDM '05 Proceedings of the 1st international workshop on Utility-based data mining
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Optimal Control Using the Transport Equation: The Liouville Machine

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Thesis: relational reinforcement learning

AI Communications
Teaching virtual characters how to use body language

Lecture Notes in Computer Science
Implementing Temporal-Difference Learning with the Scaled Conjugate Gradient Algorithm

Neural Processing Letters
Adaptive value function approximations in classifier systems

GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Learning classifier system equivalent with reinforcement learning with function approximation

GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Counter example for Q-bucket-brigade under prediction problem

GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
An autonomous explore/exploit strategy

GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Relating reinforcement learning performance to classification performance

ICML '05 Proceedings of the 22nd international conference on Machine learning
Proto-value functions: developmental reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
High speed obstacle avoidance using monocular vision and reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Coarticulation: an approach for generating concurrent plans in Markov decision processes

ICML '05 Proceedings of the 22nd international conference on Machine learning
A theoretical analysis of Model-Based Interval Estimation

ICML '05 Proceedings of the 22nd international conference on Machine learning
Bayesian sparse sampling for on-line reward optimization

ICML '05 Proceedings of the 22nd international conference on Machine learning
The Development of Embodied Cognition: Six Lessons from Babies

Artificial Life
Agent-Based Computational Economics: Growing Economies From the Bottom Up

Artificial Life
A middleware for autonomic QoS management based on learning

SEM '05 Proceedings of the 5th international workshop on Software engineering and middleware
Local Reinforcement and Recombination in Classifier Systems

Evolutionary Computation
Rule Fitness and Pathology in Learning Classifier Systems

Evolutionary Computation
Emergence of Cooperation: State of the Art

Artificial Life
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games

Autonomous Agents and Multi-Agent Systems
Hybrid least-squares methods for reinforcement learning

IEA/AIE'2003 Proceedings of the 16th international conference on Developments in applied artificial intelligence
GenSo-FDSS: a neural-fuzzy decision support system for pediatric ALL cancer subtype identification using gene expression data

Artificial Intelligence in Medicine
An analytic modelling approach for network routing algorithms that use "ant-like" mobile agents

Computer Networks: The International Journal of Computer and Telecommunications Networking
Context awarable self-configuration system for distributed resource management

IEA/AIE'2005 Proceedings of the 18th international conference on Innovations in Applied Artificial Intelligence
NJFun: a reinforcement learning spoken dialogue system

ANLP/NAACL-ConvSyst '00 Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems - Volume 3
Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia

Neural Computation
Adaptive dialogue systems - interaction with interact

SIGDIAL '02 Proceedings of the 3rd SIGdial workshop on Discourse and dialogue - Volume 2
An Ensemble of Cooperative Extended Kohonen Maps for Complex Robot Motion Tasks

Neural Computation
Stochastic Optimal Control and Estimation Methods Adapted to the Noise Characteristics of the Sensorimotor System

Neural Computation
Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms

Neural Computation
A Computational Model of the Functional Role of the Ventral-Striatal D2 Receptor in the Expression of Previously Acquired Behaviors

Neural Computation
Attention-Gated Reinforcement Learning of Internal Representations for Classification

Neural Computation
Temporal Difference Model Reproduces Anticipatory Neural Activity

Neural Computation
Developing adaptive auction mechanisms

ACM SIGecom Exchanges
The concept of a universal learning system as a basis for creating a general mathematical theory of learning

Minds and Machines - Machine learning as experimental philosophy of science
Evolution of Cooperative Problem Solving in an Artificial Economy

Neural Computation
A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

Neural Computation
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms

Neural Computation
Reinforcement Learning in Continuous Time and Space

Neural Computation
Finding optimal satisficing strategies for and-or trees

Artificial Intelligence
Pedagogical possibilities for the dice game pig

Journal of Computing Sciences in Colleges
Sequence-Learning Algorithm Based on Backward Chaining

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes

Machine Learning
Combining metric and topological navigation of simulated robots

Acta Cybernetica
Efficient Discriminant Viewpoint Selection for Active Bayesian Recognition

International Journal of Computer Vision
Playing games in many possible worlds

EC '06 Proceedings of the 7th ACM conference on Electronic commerce
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle

IEEE Transactions on Dependable and Secure Computing
Adaptive game AI with dynamic scripting

Machine Learning
Universal parameter optimisation in games based on SPSA

Machine Learning
SHAGE: a framework for self-managed robot software

Proceedings of the 2006 international workshop on Self-adaptation and self-managing systems
Simulating sellers in online exchanges

Decision Support Systems
A short tutorial on reinforcement learning: review and applications

Intelligent information processing II
Precomputing avatar behavior from human motion data

Graphical Models - Special issue on SCA 2004
Agent-based buddy-finding methodology for knowledge sharing

Information and Management
Using inaccurate models in reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Relational temporal difference learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

ICML '06 Proceedings of the 23rd international conference on Machine learning
Qualitative reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
PAC model-free reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Combining gradient techniques for numerical multi-objective evolutionary optimization

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Comparing evolutionary and temporal difference methods in a reinforcement learning domain

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Standard and averaging reinforcement learning in XCS

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Classifier prediction based on tile coding

Proceedings of the 8th annual conference on Genetic and evolutionary computation
A Bayesian approach to learning classifier systems in uncertain environments

Proceedings of the 8th annual conference on Genetic and evolutionary computation
On-line evolutionary computation for reinforcement learning in stochastic domains

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Genetic algorithms for action set selection across domains: a demonstration

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Reward allotment in an event-driven hybrid learning classifier system for online soccer games

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Une description probabiliste de la communication parlée entre homme et machine

IHM 2004 Proceedings of the 16th conference on Association Francophone d'Interaction Homme-Machine
The role of multisensor data fusion in neuromuscular control of a sagittal arm with a pair of muscles using actor-critic reinforcement learning method

Technology and Health Care
Adaptive mechanism design: a metalearning approach

ICEC '06 Proceedings of the 8th international conference on Electronic commerce: The new e-commerce: innovations for conquering current barriers, obstacles and limitations to conducting successful business on the internet
Graph kernels and Gaussian processes for relational reinforcement learning

Machine Learning
Division of labor in a group of robots inspired by ants' foraging behavior

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Design patterns from biology for distributed computing

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
A Model of Prefrontal Cortical Mechanisms for Goal-directed Behavior

Journal of Cognitive Neuroscience
Learnable behavioural model for autonomous virtual agents: low-level learning

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
A hierarchical approach to efficient reinforcement learning in deterministic domains

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Efficient agents for cliff-edge environments with a large set of decision options

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning from induced changes in opponent (re)actions in multi-agent games

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Rule value reinforcement learning for cognitive agents

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning the task allocation game

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
On the relationship between MDPs and the BDI architecture

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Coordinating simple and unreliable agents

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Haloperidol Impairs Learning and Error-related Negativity in Humans

Journal of Cognitive Neuroscience
Representation and timing in theories of the dopamine system

Neural Computation
QoS dynamic routing for wireless sensor networks

Proceedings of the 2nd ACM international workshop on Quality of service & security for wireless and mobile networks
Evolving classifiers on field programmable gate arrays: migrating XCS to FPGAs

Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Nature-inspired applications and systems
Fuzzy reinforcement learning for embedded soccer agents in a multi-agent context

International Journal of Robotics and Automation
Fuzzy and tile coding function approximation in agent coevolution

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Kernel rewards regression: an information efficient batch policy iteration approach

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Turning lights out with DQ-learning

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Economy-like reward distribution for division of labor

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
The self-organizing relationship (SOR) network employing fuzzy inference based heuristic evaluation

Neural Networks - 2006 Special issue: Advances in self-organizing maps--WSOM'05
Integrate and conquer: the next generation of intelligent avatars

Proceedings of the 2005 ACM SIGCHI International Conference on Advances in computer entertainment technology
Game design through self-play experiments

Proceedings of the 2005 ACM SIGCHI International Conference on Advances in computer entertainment technology
OMax brothers: a dynamic yopology of agents for improvization learning

Proceedings of the 1st ACM workshop on Audio and music computing multimedia
A Neuro-Dynamic Programming-Based Optimal Controller for Tomato Seedling Growth in Greenhouse Systems

Neural Processing Letters
Quantum robot: structure, algorithms and applications

Robotica
Building autonomic systems using collaborative reinforcement learning

The Knowledge Engineering Review
TAUPE: towards understanding program comprehension

CASCON '06 Proceedings of the 2006 conference of the Center for Advanced Studies on Collaborative research
Modeling energy constrained routing in selfish ad hoc networks

GameNets '06 Proceeding from the 2006 workshop on Game theory for communications and networks
The Role of Problem Classification in Online Meta-cognition

IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Approximate Reasoning in MAS: Rough Set Approach

IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Properties and mechanisms of self-organizing MANET and P2P systems

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Neural mechanism for stochastic behaviour during a competitive game

Neural Networks - 2006 Special issue: Neurobiology of decision making
Modeling of autonomous problem solving process by dynamic construction of task models in multiple tasks environment

Neural Networks - 2006 Special issue: Neurobiology of decision making
Effects of reward expectancy on sequential eye movements in monkeys

Neural Networks - 2006 Special issue: Neurobiology of decision making
Multi-agent learning model with bargaining

Proceedings of the 38th conference on Winter simulation
A reinforcement learning algorithm to minimize the mean tardiness of a single machine with controlled capacity

Proceedings of the 38th conference on Winter simulation
Learning what to talk about in descriptive games

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Learning from imperfect data

Applied Soft Computing
Neural-based downlink scheduling algorithm for broadband wireless networks

Computer Communications
Rough Sets and Vague Concepts

Fundamenta Informaticae - Contagious Creativity - In Honor of the 80th Birthday of Professor Solomon Marcus
$-Calculus of Bounded Rational Agents: Flexible Optimization as Search under Bounded Resources in Interactive Systems

Fundamenta Informaticae
Creating significant learning experiences in introductory artificial intelligence

Proceedings of the 38th SIGCSE technical symposium on Computer science education
Reinforcement Learning with Approximation Spaces

Fundamenta Informaticae
Binet-Cauchy Kernels on Dynamical Systems and its Application to the Analysis of Dynamic Scenes

International Journal of Computer Vision
RLDDE: A novel reinforcement learning-based dimension and delay estimator for neural networks in time series prediction

Neurocomputing
Behavioral Pattern Identification Through Rough Set Modeling

Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Calculi of Approximation Spaces

Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Rough Set Approach to Behavioral Pattern Identification

Fundamenta Informaticae - New Frontiers in Scientific Discovery - Commemorating the Life and Work of Zdzislaw Pawlak
Dimensions of complexity of intelligent agents

PCAR '06 Proceedings of the 2006 international symposium on Practical cognitive agents and robots
A hybrid system of abductive tactical decision making

International Journal of Hybrid Intelligent Systems
Performance analysis of the AntNet algorithm

Computer Networks: The International Journal of Computer and Telecommunications Networking
Using multi-agent systems for learning optimal policies for complex problems

ACM-SE 45 Proceedings of the 45th annual southeast regional conference
A proposal of the learning system using the recordable multi-layer type rule base and its application for the fire panic problem

Proceedings of the 2006 international conference on Game research and development
Robust automatic target recognition using learning classifier systems

Information Fusion
Aggregation of web search engines based on users' preferences in WebFusion

Knowledge-Based Systems
A document retrieval support system with term relationship

Web Intelligence and Agent Systems
Gradient descent for symmetric and asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Adaptive load balancing of parallel applications with multi-agent reinforcement learning on heterogeneous systems

Scientific Programming - Distributed Computing and Applications
Improved stability and convergence with three factor learning

Neurocomputing
Allocating time and location information to activity-travel patterns through reinforcement learning

Knowledge-Based Systems
Using the XCS Classifier System for Multi-objective Reinforcement Learning Problems

Artificial Life
On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies

Mathematics of Operations Research
Synergies Between Intrinsic and Synaptic Plasticity Mechanisms

Neural Computation
Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

Neural Computation
The Basal Ganglia and Cortex Implement Optimal Decision Making Between Alternative Actions

Neural Computation
Reinforcement Learning State Estimator

Neural Computation
If multi-agent learning is the answer, what is the question?

Artificial Intelligence
Policy Gradient in Continuous Time

The Journal of Machine Learning Research
Evolutionary Function Approximation for Reinforcement Learning

The Journal of Machine Learning Research
Collaborative Multiagent Reinforcement Learning by Payoff Propagation

The Journal of Machine Learning Research
STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents

dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
Causal Graph Based Decomposition of Factored MDPs

The Journal of Machine Learning Research
Point-Based Value Iteration for Continuous POMDPs

The Journal of Machine Learning Research
Approximate Reasoning in MAS: Rough Set Approach

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Decentralized, adaptive resource allocation for sensor networks

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Motivated reinforcement learning for adaptive characters in open-ended simulation games

Proceedings of the international conference on Advances in computer entertainment technology
Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploring selfish reinforcement learning in repeated games with stochastic rewards

Autonomous Agents and Multi-Agent Systems
Learning to communicate in a decentralized environment

Autonomous Agents and Multi-Agent Systems
Local strategy learning in networked multi-agent team formation

Autonomous Agents and Multi-Agent Systems
Knowledge acquisition for adaptive game AI

Science of Computer Programming
Modeling embodied visual behaviors

ACM Transactions on Applied Perception (TAP)
On developmental mental architectures

Neurocomputing
Editorial: New trends in Cognitive Science: Integrative approaches to learning and development

Neurocomputing
To each his own: The caregiver's role in a computational model of gaze following

Neurocomputing
Chaotic time series prediction for the game, Rock-Paper-Scissors

Applied Soft Computing
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

Neural Computation
Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension

Evolutionary Computation
Research Issues in Multiple Policy Optimization Using Collaborative Reinforcement Learning

SEAMS '07 Proceedings of the 2007 International Workshop on Software Engineering for Adaptive and Self-Managing Systems
Combining online and offline knowledge in UCT

Proceedings of the 24th international conference on Machine learning
Bayesian actor-critic algorithms

Proceedings of the 24th international conference on Machine learning
Constructing basis functions from directed graphs for value function approximation

Proceedings of the 24th international conference on Machine learning
Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

Proceedings of the 24th international conference on Machine learning
Cross-domain transfer for reinforcement learning

Proceedings of the 24th international conference on Machine learning
Multi-task reinforcement learning: a hierarchical Bayesian approach

Proceedings of the 24th international conference on Machine learning
MILCS: a mutual information learning classifier system

Proceedings of the 9th annual conference companion on Genetic and evolutionary computation
Learning classifier systems

Proceedings of the 9th annual conference companion on Genetic and evolutionary computation
Learning and Cooperation in Sequential Games

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Empirical Studies in Action Selection with Reinforcement Learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
An Action-Selection Calculus

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Emergence of Mirror Neurons in a Model of Gaze Following

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Grounded Sensorimotor Interaction Histories in an Information Theoretic Metric Space for Robot Ontogeny

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
The Neural Basis for Visual Selective Attention in Young Infants: A Computational Account

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Responsive characters from motion fragments

ACM SIGGRAPH 2007 papers
Initial results from the use of learning classifier systems to control in vitro neuronal networks

Proceedings of the 9th annual conference on Genetic and evolutionary computation
Empirical analysis of generalization and learning in XCS with gradient descent

Proceedings of the 9th annual conference on Genetic and evolutionary computation
XCSF with computed continuous action

Proceedings of the 9th annual conference on Genetic and evolutionary computation
Practical learning from one-sided feedback

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning and adaptivity in interactive recommender systems

Proceedings of the ninth international conference on Electronic commerce
Learning to trade with insider information

Proceedings of the ninth international conference on Electronic commerce
Data acquisition and cost-effective predictive modeling: targeting offers for electronic commerce

Proceedings of the ninth international conference on Electronic commerce
On the use of hybrid reinforcement learning for autonomic resource allocation

Cluster Computing
Shaping multi-agent systems with gradient reinforcement learning

Autonomous Agents and Multi-Agent Systems
Metric embedding of view-graphs

Autonomous Robots
A Graph-Based Evolutionary Algorithm: Genetic Network Programming (GNP) and Its Extension Using Reinforcement Learning

Evolutionary Computation
A reinforcement agent for threshold fusion

Applied Soft Computing
Introduction and control of subgoals in reinforcement learning

AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
A layered approach to learning coordination knowledge in multiagent environments

Applied Intelligence
Generalized multiagent learning with performance bound

Autonomous Agents and Multi-Agent Systems
Design of a peer-to-peer system for optimized content replication

Computer Communications
Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation

Neural Computation
Elman Backpropagation as Reinforcement for Simple Recurrent Networks

Neural Computation
Usage-based web recommendations: a reinforcement learning approach

Proceedings of the 2007 ACM conference on Recommender systems
Reinforcement learning in multi-agent environment and ant colony for packet scheduling in routers

Proceedings of the 5th ACM international workshop on Mobility management and wireless access
Affect, Anticipation, and Adaptation: Affect-Controlled Selection of Anticipatory Simulation in Artificial Adaptive Agents

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of reinforcement learning to the game of Othello

Computers and Operations Research
Analysis and optimization of service availability in a HA cluster with load-dependent machine availability

IEEE Transactions on Parallel and Distributed Systems
Self-organization for search in peer-to-peer networks: the exploitation-exploration dilemma

Proceedings of the 1st international conference on Bio inspired models of network, information and computing systems
Hardware architecture of reinforcement learning scheme for dynamic power management in embedded systems

EURASIP Journal on Embedded Systems
Policy-driven autonomic management of multi-component systems

CASCON '07 Proceedings of the 2007 conference of the center for advanced studies on Collaborative research
Adaptive evolutionary programming based on reinforcement learning

Information Sciences: an International Journal
Universal Intelligence: A Definition of Machine Intelligence

Minds and Machines
A formal framework and extensions for function approximation in learning classifier systems

Machine Learning
Modeling dopamine activity by Reinforcement Learning methods: implications from two recent models

Artificial Intelligence Review
Transfer via inter-task mappings in policy search reinforcement learning

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Parallel reinforcement learning with linear function approximation

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
An incentive mechanism for message relaying in unstructured peer-to-peer systems

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Model-based function approximation in reinforcement learning

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Reinforcement learning with utility-aware agents for market-based resource allocation

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Towards reinforcement learning representation transfer

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Distributed path planning for mobile robots using a swarm of interacting reinforcement learners

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Convergence and rate of convergence of a simple ant model

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
IFSA: incremental feature-set augmentation for reinforcement learning tasks

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Advice taking in multiagent reinforcement learning

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Distributed agent-based air traffic flow management

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
A reinforcement learning framework for online data migration in hierarchical storage systems

The Journal of Supercomputing
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Artificial Intelligence
Learning how to combine sensory-motor functions into a robust behavior

Artificial Intelligence
Simulating interactions of avatars in high dimensional state space

Proceedings of the 2008 symposium on Interactive 3D graphics and games
A novel framework for automatic generation of fuzzy neural networks

Neurocomputing
From schemas to neural networks: A multi-level modelling approach to biologically-inspired autonomous robotic systems

Robotics and Autonomous Systems
A study of mechanisms for improving robotic group performance

Artificial Intelligence
Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

International Journal of Robotics Research
Million Module March: Scalable Locomotion for Large Self-Reconfiguring Robots

International Journal of Robotics Research
Learning to Move in Modular Robots using Central Pattern Generators and Online Optimization

International Journal of Robotics Research
Adaptive building of decision trees by reinforcement learning

AIC'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Applied Informatics and Communications - Volume 7
An approach to fully automatic aircraft collision avoidance and navigation

ACS'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Applied Computer Science - Volume 7
Active audition using the parameter-less self-organising map

Autonomous Robots
Learning polite behavior with situation models

Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction
Optimizing time warp simulation with reinforcement learning techniques

Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come
Application of stochastic learning automata to intelligent vehicle control

ICCOMP'07 Proceedings of the 11th WSEAS International Conference on Computers
An ASML model for an intelligent vehicle control system

ICCOMP'07 Proceedings of the 11th WSEAS International Conference on Computers
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

Machine Learning
Radial basis networks for the simulation of stand alone AC generators during no-break power transfer

Proceedings of the 2007 Summer Computer Simulation Conference
RL-MAC: a reinforcement learning based MAC protocol for wireless sensor networks

International Journal of Sensor Networks
Dynamic learning of action patterns for object acquisition

International Journal of Intelligent Systems Technologies and Applications
Controlling an autonomous agent using internal value based action selection

International Journal of Intelligent Systems Technologies and Applications
Workstation capacity tuning using reinforcement learning

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Biologically-inspired robot spatial cognition based on rat neurophysiological studies

Autonomous Robots
Design of multi agent adaptive neuro-fuzzy based intelligent controllers for multi-objective nonlinear system

AIKED'05 Proceedings of the 4th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering Data Bases
Deciding what to observe next: adaptive variable selection for regression in multivariate data streams

Proceedings of the 2008 ACM symposium on Applied computing
A hybrid web recommender system based on Q-learning

Proceedings of the 2008 ACM symposium on Applied computing
Extremal search of decision policies for scalable distributed applications

Proceedings of the 2nd international conference on Scalable information systems
Biologically-inspired adaptive learning control strategies: A rough set approach

International Journal of Hybrid Intelligent Systems
Knowledge propagation in a distributed omnidirectional vision system

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Marco Somalvico Memorial Issue
Adaptivity at every layer: a modular approach for evolving societies of learning autonomous systems

Proceedings of the 2008 international workshop on Software engineering for adaptive and self-managing systems
Artificial Intelligence techniques: An introduction to their use for modelling environmental systems

Mathematics and Computers in Simulation
Cooperation learning in Multi-Agent Systems with annotation and reward

International Journal of Knowledge-based and Intelligent Engineering Systems
Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

International Journal of Knowledge-based and Intelligent Engineering Systems
The Metaphysical Character of the Criticisms Raised Against the Use of Probability for Dealing with Uncertainty in Artificial Intelligence

Minds and Machines
Error bounds of optimization algorithms for semi-Markov decision processes

International Journal of Systems Science
Investigation of Q-learning in the context of a virtual learning environment

Informatics in education
Real-time dynamic fuzzy Q-learning and control of mobile robots

ICECS'03 Proceedings of the 2nd WSEAS International Conference on Electronics, Control and Signal Processing
A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

Future Generation Computer Systems
Fuzzy Q-Learning with the modified fuzzy ART neural network

Web Intelligence and Agent Systems
A survey of autonomic computing—degrees, models, and applications

ACM Computing Surveys (CSUR)
Advancing the Layered Approach to Agent-Based Crowd Simulation

Proceedings of the 22nd Workshop on Principles of Advanced and Distributed Simulation
An adaptive approach for ensuring reliability in event based middleware

Proceedings of the second international conference on Distributed event-based systems
Learning classifier systems

Proceedings of the 10th annual conference companion on Genetic and evolutionary computation
Scaling ant colony optimization with hierarchical reinforcement learning partitioning

Proceedings of the 10th annual conference on Genetic and evolutionary computation
Towards efficient online reinforcement learning using neuroevolution

Proceedings of the 10th annual conference on Genetic and evolutionary computation
Genetic algorithms for mentor-assisted evaluation function optimization

Proceedings of the 10th annual conference on Genetic and evolutionary computation
Learning all optimal policies with multiple criteria

Proceedings of the 25th international conference on Machine learning
An object-oriented representation for efficient reinforcement learning

Proceedings of the 25th international conference on Machine learning
Active reinforcement learning

Proceedings of the 25th international conference on Machine learning
Hierarchical model-based reinforcement learning: R-max + MAXQ

Proceedings of the 25th international conference on Machine learning
Non-parametric policy gradients: a unified treatment of propositional and relational domains

Proceedings of the 25th international conference on Machine learning
A worst-case comparison between temporal difference and residual gradient with linear function approximation

Proceedings of the 25th international conference on Machine learning
Online kernel selection for Bayesian reinforcement learning

Proceedings of the 25th international conference on Machine learning
The many faces of optimism: a unifying approach

Proceedings of the 25th international conference on Machine learning
A semiparametric statistical approach to model-free policy evaluation

Proceedings of the 25th international conference on Machine learning
Preconditioned temporal difference learning

Proceedings of the 25th international conference on Machine learning
A bayesian logistic regression model for active relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective

The Journal of Machine Learning Research
Accelerated Neural Evolution through Cooperatively Coevolved Synapses

The Journal of Machine Learning Research
Rollout sampling approximate policy iteration

Machine Learning
An agent-based system for simulating dynamic choice-sets

Proceedings of the 2008 Spring simulation multiconference
Simulating new markets by introducing new accepting policies for the conventional continuous double auction

Proceedings of the 2008 Spring simulation multiconference
Letters: Synaptic plasticity model of a spiking neural network for reinforcement learning

Neurocomputing
Tuning continual exploration in reinforcement learning: An optimality property of the Boltzmann strategy

Neurocomputing
A novel recurrent neural network-based prediction system for option trading and hedging

Applied Intelligence
A combined tactical and strategic hierarchical learning framework in multi-agent games

Sandbox '08 Proceedings of the 2008 ACM SIGGRAPH symposium on Video games
On updates that constrain the features' connections during learning

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Regulating air traffic flow with coupled agents

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Aligning social welfare and agent preferences to alleviate traffic congestion

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Autonomous transfer for reinforcement learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Analysis of an evolutionary reinforcement learning method in a multiagent domain

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
The utility of temporal abstraction in reinforcement learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Switching dynamics of multi-agent learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Sigma point policy iteration

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Dynamics based control with PSRs

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Controlling deliberation in a Markov decision process-based agent

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Expediting RL by using graphical structures

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Transfer of task representation in reinforcement learning using policy-based proto-value functions

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
A new perspective to the keepaway soccer: the takers

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Graph Laplacian based transfer learning in reinforcement learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Autonomous agent learning using an actor-critic algorithm and behavior models

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Social reward shaping in the prisoner's dilemma

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Identifying beneficial teammates using multi-dimensional trust

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Evolutionary dynamics for designing multi-period auctions

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Sensitivity derivatives for flexible sensorimotor learning

Neural Computation
Network effects and embedded options: decision-making under uncertainty for network technology investments

Information Technology and Management
Adapting the interaction state model in conversational recommender systems

Proceedings of the 10th international conference on Electronic commerce
On the possibility of learning in reactive environments with arbitrary dependence

Theoretical Computer Science
Reinforcement learning of recurrent neural network for temporal coding

Neurocomputing
Efficient Exploration in Reinforcement Learning Based on Utile Suffix Memory

Informatica
Automating cyber-defense management

Proceedings of the 2nd workshop on Recent advances on intrusiton-tolerant systems
State space optimization using plan recognition and reinforcement learning on RTS game

AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
Application of the self organizing maps for visual reinforcement learning of mobile robot

AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
Reinforcement learning for appearance based visual servoing in robotic manipulation

ROCOM'08 Proceedings of the 8th WSEAS International Conference on Robotics, Control and Manufacturing Technology
Geodesic Gaussian kernels for value function approximation

Autonomous Robots
Incremental Learning of Planning Operators in Stochastic Domains

SOFSEM '07 Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science
Real World Multi-agent Systems: Information Sharing, Coordination and Planning

Logic, Language, and Computation
Checking Liveness Properties of Concurrent Systems by Reinforcement Learning

Model Checking and Artificial Intelligence
Reinforcement Learning in Fine Time Discretization

ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
Postural Control of Two-Stage Inverted Pendulum Using Reinforcement Learning and Self-organizing Map

ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part II
Autonomous Learning of Ball Trapping in the Four-Legged Robot League

RoboCup 2006: Robot Soccer World Cup X
Fuzzy Q-Map Algorithm for Reinforcement Learning

Computational Intelligence and Security
Towards Real-Time Distributed Signal Modeling for Brain-Machine Interfaces

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Reinforcement Learning Reward Functions for Unsupervised Learning

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
A Hierarchical Learning System Incorporating with Supervised, Unsupervised and Reinforcement Learning

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
An Extremely Simple Reinforcement Learning Rule for Neural Networks

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
Online Dynamic Value System for Machine Learning

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
Intelligence Through Interaction: Towards a Unified Theory for Learning

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
State Space Partition for Reinforcement Learning Based on Fuzzy Min-Max Neural Network

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Reinforcement Learning in Nonstationary Environment Navigation Tasks

CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Anticipations, Brains, Individual and Social Behavior: An Introduction to Anticipatory Systems

Anticipatory Behavior in Adaptive Learning Systems
Neural Correlates of Anticipation in Cerebellum, Basal Ganglia, and Hippocampus

Anticipatory Behavior in Adaptive Learning Systems
The Role of Anticipation in the Emergence of Language

Anticipatory Behavior in Adaptive Learning Systems
From Actions to Goals and Vice-Versa: Theoretical Analysis and Models of the Ideomotor Principle and TOTE

Anticipatory Behavior in Adaptive Learning Systems
Project "Animat Brain": Designing the Animat Control System on the Basis of the Functional Systems Theory

Anticipatory Behavior in Adaptive Learning Systems
Anticipatory Model of Musical Style Imitation Using Collaborative and Competitive Reinforcement Learning

Anticipatory Behavior in Adaptive Learning Systems
On Affect and Self-adaptation: Potential Benefits of Valence-Controlled Action-Selection

IWINAC '07 Proceedings of the 2nd international work-conference on The Interplay Between Natural and Artificial Computation, Part I: Bio-inspired Modeling of Cognitive Tasks
Feed-Forward Learning: Fast Reinforcement Learning of Controllers

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Combining the Best of the Two Worlds: Inheritance Versus Experience

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Strategies for Affect-Controlled Action-Selection in Soar-RL

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Toward Approximate Adaptive Learning

RSEISP '07 Proceedings of the international conference on Rough Sets and Intelligent Systems Paradigms
Variable Selection for Optimal Decision Making

AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Co-operative Co-evolutionary System for Solving Dynamic VRPTW Problems with Crisis Situations

HoloMAS '07 Proceedings of the 3rd international conference on Industrial Applications of Holonic and Multi-Agent Systems: Holonic and Multi-Agent Systems for Manufacturing
Cognitive Technical Systems -- What Is the Role of Artificial Intelligence?

KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Making a Robot Learn to Play Soccer Using Reward and Punishment

KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Multi-agent Learning Dynamics: A Survey

CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
Graph-Based Domain Mapping for Transfer Learning in General Games

ECML '07 Proceedings of the 18th European conference on Machine Learning
Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

ECML '07 Proceedings of the 18th European conference on Machine Learning
Planning and Learning in Environments with Delayed Feedback

ECML '07 Proceedings of the 18th European conference on Machine Learning
Policy Gradient Critics

ECML '07 Proceedings of the 18th European conference on Machine Learning
Sequence Labeling with Reinforcement Learning and Ranking Algorithms

ECML '07 Proceedings of the 18th European conference on Machine Learning
Imitation Learning Using Graphical Models

ECML '07 Proceedings of the 18th European conference on Machine Learning
Uncovering Fraud in Direct Marketing Data with a Fraud Auditing Case Builder

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions

AI*IA '07 Proceedings of the 10th Congress of the Italian Association for Artificial Intelligence on AI*IA 2007: Artificial Intelligence and Human-Oriented Computing
Parallel Reinforcement Learning for Weighted Multi-criteria Model with Adaptive Margin

Neural Information Processing
Estimating Internal Variables of a Decision Maker's Brain: A Model-Based Approach for Neuroscience

Neural Information Processing
Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

Neural Information Processing
Computational Modeling of Human-Robot Interaction Based on Active Intention Estimation

Neural Information Processing
Task Learning Based on Reinforcement Learning in Virtual Environment

Neural Information Processing
Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

RoboCup 2007: Robot Soccer World Cup XI
Model-Based Reinforcement Learning in a Complex Domain

RoboCup 2007: Robot Soccer World Cup XI
A Framework for Learning in Humanoid Simulated Robots

RoboCup 2007: Robot Soccer World Cup XI
Implementing Parametric Reinforcement Learning in Robocup Rescue Simulation

RoboCup 2007: Robot Soccer World Cup XI
Adaptive Power Management Based on Reinforcement Learning for Embedded System

IEA/AIE '08 Proceedings of the 21st international conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: New Frontiers in Applied Artificial Intelligence
Flexible Control Mechanism for Multi-DOF Robotic Arm Based on Biological Fluctuation

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Integrating Epistemic Action (Active Vision) and Pragmatic Action (Reaching): A Neural Architecture for Camera-Arm Robots

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Toward a Theory of Embodied Statistical Learning

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Closing the Sensory-Motor Loop on Dopamine Signalled Reinforcement Learning

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Mutual Development of Behavior Acquisition and Recognition Based on Value System

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
A Computational Model of the Amygdala Nuclei's Role in Second Order Conditioning

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Scheduling for Reliable Execution in Autonomic Systems

ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
Simulation-Based Optimization Approach for Software Cost Model with Rejuvenation

ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
Neural Approximation of Monte Carlo Policy Evaluation Deployed in Connect Four

ANNPR '08 Proceedings of the 3rd IAPR workshop on Artificial Neural Networks in Pattern Recognition
Toward Automatic Hint Generation for Logic Proof Tutoring Using Historical Student Data

ITS '08 Proceedings of the 9th international conference on Intelligent Tutoring Systems
Teaching Machine Learning to Design Students

Edutainment '08 Proceedings of the 3rd international conference on Technologies for E-Learning and Digital Entertainment
An Empirical Analysis of the Impact of Prioritised Sweeping on the DynaQ's Performance

ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Epoch-Incremental Queue-Dyna Algorithm

ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
On Using Reinforcement Learning to Solve Sparse Linear Systems

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
RLTE: Reinforcement Learning for Traffic-Engineering

AIMS '08 Proceedings of the 2nd international conference on Autonomous Infrastructure, Management and Security: Resilient Networks and Services
Online Phase-Adaptive Data Layout Selection

ECOOP '08 Proceedings of the 22nd European conference on Object-Oriented Programming
Mixture of Expert Used to Learn Game Play

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Multigrid Reinforcement Learning with Reward Shaping

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Self-organized Reinforcement Learning Based on Policy Gradient in Nonstationary Environments

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Robust Population Coding in Free-Energy-Based Reinforcement Learning

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
A Continuous Internal-State Controller for Partially Observable Markov Decision Processes

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Modular Neural Networks for Model-Free Behavioral Learning

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
From Exploration to Planning

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Mimicking Go Experts with Convolutional Neural Networks

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part II
A Computational Model of Cortico-Striato-Thalamic Circuits in Goal-Directed Behaviour

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part II
A Learning Automata Approach to Multi-agent Policy Gradient Learning

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Learning Grouping and Anti-predator Behaviors for Multi-agent Systems

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
QFCS: A Fuzzy LCS in Continuous Multi-step Environments with Continuous Vector Actions

Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Evolution Strategies for Direct Policy Search

Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Reinforcement Learning: Insights from Interesting Failures in Parameter Selection

Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Evolving Neural Networks for Online Reinforcement Learning

Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
A Steady-State Genetic Algorithm with Resampling for Noisy Inventory Control

Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Learning Smooth, Human-Like Turntaking in Realtime Dialogue

IVA '08 Proceedings of the 8th international conference on Intelligent Virtual Agents
A New Natural Policy Gradient by Stationary Distribution Metric

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
State-Dependent Exploration for Policy Gradient Methods

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Transferring Instances for Model-Based Reinforcement Learning

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Rule-Based Analysis of Behaviour Learned by Evolutionary and Reinforcement Algorithms

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
Recognizing the Enemy: Combining Reinforcement Learning with Strategy Selection Using Case-Based Reasoning

ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
Forgetting Reinforced Cases

ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
Agent Learning Instead of Behavior Implementation for Simulations --- A Case Study Using Classifier Systems

MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
Multi-Agent Reinforcement Learning for Intrusion Detection: A Case Study and Evaluation

MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
Robustness Analysis of SARSA(λ): Different Models of Reward and Initialisation

AIMSA '08 Proceedings of the 13th international conference on Artificial Intelligence: Methodology, Systems, and Applications
Robot Navigation Based on Fuzzy RL Algorithm

ISNN '08 Proceedings of the 5th international symposium on Neural Networks: Advances in Neural Networks
Applying Reinforcement Learning to Multi-robot Team Coordination

HAIS '08 Proceedings of the 3rd international workshop on Hybrid Artificial Intelligence Systems
Automated Generation of Knowledge Plane Components for Multimedia Access Networks

MACE '08 Proceedings of the 3rd IEEE international workshop on Modelling Autonomic Communications Environments
A Logical Framework to Reinforcement Learning Using Hybrid Probabilistic Logic Programs

SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
A comparison between ATNoSFERES and Learning Classifier Systems on non-Markov problems

Information Sciences: an International Journal
Value Function Based Reinforcement Learning in Changing Markovian Environments

The Journal of Machine Learning Research
Towards adaptive programming: integrating reinforcement learning into a programming language

Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
A reinforcement learning model for supply chain ordering management: An application to the beer game

Decision Support Systems
A reinforced learning control using iterative error compensation for uncertain dynamical systems

International Journal of Computer Mathematics
INFLUENCE OF TEMPERATURE ON SWARMBOTS THAT LEARN

Cybernetics and Systems
REINFORCEMENT LEARNING FOR POMDP USING STATE CLASSIFICATION

Applied Artificial Intelligence
Agent's actions as a classification criteria for the state space in a learning from rewards system

Journal of Experimental & Theoretical Artificial Intelligence
Computational memory architectures for autobiographic agents interacting in a complex virtual environment: a working model

Connection Science
Hierarchical pathfinding and AI-based learning approach in strategy game design

International Journal of Computer Games Technology - Joint International Conference on Cyber Games and Interactive Entertainment 2006
State space segmentation for acquisition of agent behavior

Web Intelligence and Agent Systems
Itinerary determination of imprecise mobile agents with firm deadline

Web Intelligence and Agent Systems
CCMAC: coordinated cooperative MAC for wireless LANs

Proceedings of the 11th international symposium on Modeling, analysis and simulation of wireless and mobile systems
A self-adaptive placement protocol for mobile directories in MANETs

Proceedings of the 11th international symposium on Modeling, analysis and simulation of wireless and mobile systems
Implementation of a neural-based navigation approach on indoor and outdoor mobile robots

CSTST '08 Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
The Neuromodulatory System: A Framework for Survival and Adaptive Behavior in a Challenging World

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Stimulus representation and the timing of reward-prediction errors in models of the dopamine system

Neural Computation
Eavesdropping: audience interaction in networked audio performance

MM '08 Proceedings of the 16th ACM international conference on Multimedia
A Cultural Algorithm for POMDPs from Stochastic Inventory Control

HM '08 Proceedings of the 5th International Workshop on Hybrid Metaheuristics
Optimal channel selection for spectrum-agile low-power wireless packet switched networks in unlicensed band

EURASIP Journal on Wireless Communications and Networking - Cognitive Radio and Dynamic Spectrum Sharing Systems
ART2 neural network interacting with environment

Neurocomputing
Using temporal-difference learning for multi-agent bargaining

Electronic Commerce Research and Applications
Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes

Simulation
Approximating Arbitrary Reinforcement Signal by Learning Classifier Systems using Micro Genetic Algorithm

Fundamenta Informaticae
Visual reinforcement learning algorithm using self organizing maps and its simulation in OpenGL environment

WSEAS Transactions on Information Science and Applications
Hybrid reinforcement/supervised learning of dialogue policies from fixed data sets

Computational Linguistics
Reinforcement Learning Based Precise Positioning Method for a Millimeters-Sized Omnidirectional Mobile Microrobot

ICIRA '08 Proceedings of the First International Conference on Intelligent Robotics and Applications: Part I
Comparing active vision models

Image and Vision Computing
Experimental Analysis of Sample-Based Maps for Long-Term SLAM

International Journal of Robotics Research
Actor Critic Learning: A Near Set Approach

RSCTC '08 Proceedings of the 6th International Conference on Rough Sets and Current Trends in Computing
Revisiting UCS: Description, Fitness Sharing, and Comparison with XCS

Learning Classifier Systems
A Learning Classifier System with Mutual-Information-Based Fitness

Learning Classifier Systems
Individual and Social Behaviour in the IPA Market with RL

SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Reinforcement Learning with Markov Logic Networks

MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Learning the Filling Policy of a Biodegradation Process by Fuzzy Actor---Critic Learning Methodology

MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Reinforcement Learning on a Futures Market Simulator

KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Proposal of Exploitation-Oriented Learning PS-r#

IDEAL '08 Proceedings of the 9th International Conference on Intelligent Data Engineering and Automated Learning
Simulating Interactions of Characters

Motion in Games
Learning to Attend -- From Bottom-Up to Top-Down

Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
Biologically Inspired Framework for Learning and Abstract Representation of Attention Control

Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
On the Role of Dopamine in Cognitive Vision

Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
A tutorial on adaptive MCMC

Statistics and Computing
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

Recent Advances in Reinforcement Learning
Bayesian Reward Filtering

Recent Advances in Reinforcement Learning
Basis Expansion in Natural Actor Critic Methods

Recent Advances in Reinforcement Learning
Reinforcement Learning with the Use of Costly Features

Recent Advances in Reinforcement Learning
Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem

Recent Advances in Reinforcement Learning
Optimistic Planning of Deterministic Systems

Recent Advances in Reinforcement Learning
Policy Iteration for Learning an Exercise Policy for American Options

Recent Advances in Reinforcement Learning
Tile Coding Based on Hyperplane Tiles

Recent Advances in Reinforcement Learning
Use of Reinforcement Learning in Two Real Applications

Recent Advances in Reinforcement Learning
Applications of Reinforcement Learning to Structured Prediction

Recent Advances in Reinforcement Learning
New Error Bounds for Approximations from Projected Linear Equations

Recent Advances in Reinforcement Learning
Partial Order Hierarchical Reinforcement Learning

AI '08 Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Making Financial Trading by Recurrent Reinforcement Learning

KES '07 Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International Conference
A Collaborative Reinforcement Learning Approach to Urban Traffic Control Optimization

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
A Study of Reinforcement Learning in a New Multiagent Domain

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Formalizing Multi-state Learning Dynamics

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Learning-Rate Adjusting Q-Learning for Prisoner's Dilemma Games

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
An Information-Theoretic Class of Stochastic Decision Processes

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Towards a Self-Organising Mechanism for Learning Adaptive Decision-Making Rules

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Optimal Local Basis: A Reinforcement Learning Approach for Face Recognition

International Journal of Computer Vision
Learning and planning in environments with delayed feedback

Autonomous Agents and Multi-Agent Systems
Learning to trust in the competence and commitment of agents

Autonomous Agents and Multi-Agent Systems
Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State Values in Policies

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Hybridizing evolutionary computation and reinforcement learning for the design of almost universal controllers for autonomous robots

Neurocomputing
Implementing plastic weights in neural networks using low precision arithmetic

Neurocomputing
A reinforcement learning based neural network architecture for obstacle avoidance in multi-fingered grasp synthesis

Neurocomputing
Towards end-to-end quality of service: controlling I/O interference in shared storage servers

Proceedings of the 9th ACM/IFIP/USENIX International Conference on Middleware
A novel Artificial Neural Network training method combined with Quantum Computational Multi-Agent System theory

International Journal of Intelligent Systems Technologies and Applications
Simulation and reinforcement learning with soccer agents

Multiagent and Grid Systems - Innovations in intelligent agent technology
A New Learning Algorithm for Optimal Stopping

Discrete Event Dynamic Systems
Strategy-acquisition system for video trading card game

ACE '08 Proceedings of the 2008 International Conference on Advances in Computer Entertainment Technology
Evolutionary computation using reinforced learning on image compression

ISTASC'08 Proceedings of the 8th conference on Systems theory and scientific computation
Unsupervised learning based feature points detection in ECG

ISTASC'08 Proceedings of the 8th conference on Systems theory and scientific computation
Evolutionary computation using reinforced learning on image compression

SSIP'08 Proceedings of the 8th conference on Signal, Speech and image processing
Unsupervised learning based feature points detection in ECG

SSIP'08 Proceedings of the 8th conference on Signal, Speech and image processing
Intentional learning agent architecture

Autonomous Agents and Multi-Agent Systems
General Game Playing with Ants

SEAL '08 Proceedings of the 7th International Conference on Simulated Evolution and Learning
Performance Evaluation of an Adaptive Ant Colony Optimization Applied to Single Machine Scheduling

SEAL '08 Proceedings of the 7th International Conference on Simulated Evolution and Learning
Improving the Exploration Strategy in Bandit Algorithms

Learning and Intelligent Optimization
Tuning Local Search by Average-Reward Reinforcement Learning

Learning and Intelligent Optimization
Hierarchical Classifiers for Complex Spatio-temporal Concepts

Transactions on Rough Sets IX
An adaptive middleware for supporting time-critical event response

Cluster Computing
Imitation guided learning in learning classifier systems

Natural Computing: an international journal
A Machine Learning Method for Dynamic Traffic Control and Guidance on Freeway Networks

CAR '09 Proceedings of the 2009 International Asia Conference on Informatics in Control, Automation and Robotics
A spiking neural network model of an actor-critic learning agent

Neural Computation
The factored policy-gradient planner

Artificial Intelligence
A new evolutionary reinforcement scheme for stochastic learning automata

ICCOMP'08 Proceedings of the 12th WSEAS international conference on Computers
How people talk when teaching a robot

Proceedings of the 4th ACM/IEEE international conference on Human robot interaction
Motivated Learning from Interesting Events: Adaptive, Multitask Learning Agents for Complex Environments

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Factored value iteration converges

Acta Cybernetica
Factored temporal difference learning in the new ties environment

Acta Cybernetica
A role-oriented BDI framework for real-time multiagent teaming

Intelligent Decision Technologies
A hypercube-based encoding for evolving large-scale neural networks

Artificial Life
Experimental analysis on Sarsa(λ) and Q(λ) with different eligibility traces strategies

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Theoretical advances of intelligent paradigms
Some topics for simulation optimization

Proceedings of the 40th Conference on Winter Simulation
Predictive models in the brain

Connection Science
Pruning an ensemble of classifiers via reinforcement learning

Neurocomputing
Reinforcement-learning agents with different temperature parameters explain the variety of human action-selection behavior in a Markov decision process task

Neurocomputing
Letters: On the bias of batch Bellman residual minimisation

Neurocomputing
QL2, a simple reinforcement learning scheme for two-player zero-sum Markov games

Neurocomputing
Gaussian process dynamic programming

Neurocomputing
Cognitive agents - a procedural perspective relying on the predictability of Object-Action-Complexes (OACs)

Robotics and Autonomous Systems
Reinforcement distribution in fuzzy Q-learning

Fuzzy Sets and Systems
A tractable hybrid ddn–pomdp approach to affective dialogue modeling for probabilistic frame-based dialogue systems

Natural Language Engineering
Does this list contain what you were searching for? Learning adaptive dialogue strategies for interactive question answering

Natural Language Engineering
Reinforcement Learning with Orthonormal Basis Adaptation Based on Activity-Oriented Index Allocation

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Modeling reinforcement learning algorithms for performance analysis

Proceedings of the International Conference on Advances in Computing, Communication and Control
Development of two-level decision tree-based real-time scheduling system under product mix variety environment

Robotics and Computer-Integrated Manufacturing
An Adaptable Oscillator-Based Controller for Autonomous Robots

Journal of Intelligent and Robotic Systems
Boosting the performance of computing systems through adaptive configuration tuning

Proceedings of the 2009 ACM symposium on Applied Computing
Comparing Learning Attention Control in Perceptual and Decision Space

Attention in Cognitive Systems
A case-based approach for coordinated action selection in robot soccer

Artificial Intelligence
A study of secondary spectrum use using agent-based computational economics

Netnomics
Linear Bellman combination for control of character animation

ACM SIGGRAPH 2009 papers
Learning Actions through Imitation and Exploration: Towards Humanoid Robots That Learn from Humans

Creating Brain-Like Intelligence
Co-evolution of Rewards and Meta-parameters in Embodied Evolution

Creating Brain-Like Intelligence
Basal Ganglia Models for Autonomous Behavior Learning

Creating Brain-Like Intelligence
A Probabilistic Approach for Mining Drifting User Interest

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Principal-agent learning

Decision Support Systems
COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS

Cybernetics and Systems
An Optimal Approximate Dynamic Programming Algorithm for the Lagged Asset Acquisition Problem

Mathematics of Operations Research
Reoptimization Approaches for the Vehicle-Routing Problem with Stochastic Demands

Operations Research
Dynamic Pricing with Online Learning and Strategic Consumers: An Application of the Aggregating Algorithm

Operations Research
Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions

Journal of Cognitive Neuroscience
Simultaneous Optimal Control and Discrete Stochastic Sensor Selection

HSCC '09 Proceedings of the 12th International Conference on Hybrid Systems: Computation and Control
Reinforcement Learning: A Tutorial Survey and Recent Advances

INFORMS Journal on Computing
Color learning and illumination invariance on mobile robots: A survey

Robotics and Autonomous Systems
Multi-robot task allocation through vacancy chain scheduling

Robotics and Autonomous Systems
Near-Term Liability of Exploitation: Exploration and Exploitation in Multistage Problems

Organization Science
An Approximate Dynamic Programming Algorithm for Large-Scale Fleet Management: A Case Application

Transportation Science
Fuzzy CMAC with automatic state partition for reinforcementlearning

Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
An Improved Hierarchical Markovian Target Tracking (I-HMTT) Algorithm for Energy Efficient Wireless Sensor Networks

CNSR '09 Proceedings of the 2009 Seventh Annual Communication Networks and Services Research Conference
Static strategy and dynamic adjustment: An effective method for Grid task scheduling

Future Generation Computer Systems
Designing autonomous layered video coders

Image Communication
Learning the IPA market with individual and social rewards

Web Intelligence and Agent Systems
Analysis and improvement of the genetic discovery component of XCS

International Journal of Hybrid Intelligent Systems - Data Mining and Hybrid Intelligent Systems
A model for the dynamic coordination of multiple competing goals

Journal of Experimental & Theoretical Artificial Intelligence
A new marketing strategy map for direct marketing

Knowledge-Based Systems
Dynamic analysis of multiagent Q-learning with ε-greedy exploration

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Hoeffding and Bernstein races for selecting policies in evolutionary direct policy search

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Regularization and feature selection in least-squares temporal difference learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Binary action search for learning continuous-action control policies

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Constraint relaxation in approximate linear programs

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Learning when to stop thinking and do something!

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Model-free reinforcement learning as mixture learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Discovering options from example trajectories

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
VCONF: a reinforcement learning approach to virtual machines auto-configuration

ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Automatic exploration of datacenter performance regimes

ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
Responsive elastic computing

GMAC '09 Proceedings of the 6th international conference industry session on Grids meets autonomic computing
Training a real-world POMDP-based dialogue system

NAACL-HLT-Dialog '07 Proceedings of the Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technologies
Technical support dialog systems: issues, problems, and solutions

NAACL-HLT-Dialog '07 Proceedings of the Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technologies
Improving recommender systems with adaptive conversational strategies

Proceedings of the 20th ACM conference on Hypertext and hypermedia
Stronger CDA strategies through empirical game-theoretic analysis and reinforcement learning

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Adaptive learning in evolving task allocation networks

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
SarsaLandmark: an algorithm for learning in POMDPs with landmarks

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Solving multiagent assignment Markov decision processes

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Generalized model learning for reinforcement learning in factored domains

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Online exploration in least-squares policy iteration

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
An empirical analysis of value function-based and policy search reinforcement learning

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
State-coupled replicator dynamics

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
A task specification language for bootstrap learning

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Modelling the dynamics of multiagent Q-learning with ε-greedy exploration

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Fuzzy Kanerva-based function approximation for reinforcement learning

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Novel reinforcement learning-based approaches to reduce loss probability in buffer-less OBS networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
Scheduling policy design for autonomic systems

International Journal of Autonomous and Adaptive Communications Systems
Immediate Reward Reinforcement Learning for Clustering and Topology Preserving Mappings

Similarity-Based Clustering
Using Strongly Connected Components as a Basis for Autonomous Skill Acquisition in Reinforcement Learning

ISNN '09 Proceedings of the 6th International Symposium on Neural Networks on Advances in Neural Networks
Reordering Sparsification of Kernel Machines in Approximate Policy Iteration

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Automatic control based on wasp behavioral model and stochastic learning automata

MAMECTIS'08 Proceedings of the 10th WSEAS international conference on Mathematical methods, computational techniques and intelligent systems
Demonstration of a POMDP voice dialer

HLT-Demonstrations '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Demo Session
An Inductive Logic Programming Approach to Statistical Relational Learning

Proceedings of the 2005 conference on An Inductive Logic Programming Approach to Statistical Relational Learning
Improving Batch Reinforcement Learning Performance through Transfer of Samples

Proceedings of the 2008 conference on STAIRS 2008: Proceedings of the Fourth Starting AI Researchers' Symposium
Direct Policy Search Reinforcement Learning for Robot Control

Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Transfer Learning and Intelligence: an Argument and Approach

Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Artificial general intelligence: an organism and level based position statement

Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
On the Broad Implications of Reinforcement Learning based AGI

Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Towards an Intelligent Tutoring System for Propositional Proof Construction

Proceedings of the 2008 conference on Current Issues in Computing and Philosophy
Evolving Computer Game Playing via Human-Computer Interaction: Machine Learning Tools in the Knowledge Engineering Life-Cycle

Proceedings of the 2008 conference on Knowledge-Based Software Engineering: Proceedings of the Eighth Joint Conference on Knowledge-Based Software Engineering
Fast Learning in an Actor-Critic Architecture with Reward and Punishment

Proceedings of the 2008 conference on Tenth Scandinavian Conference on Artificial Intelligence: SCAI 2008
Towards Automatic Model Generation by Optimization

Proceedings of the 2008 conference on Tenth Scandinavian Conference on Artificial Intelligence: SCAI 2008
Learning by Automatic Option Discovery from Conditionally Terminating Sequences

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Least Squares SVM for Least Squares TD Learning

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Multi-Agent Least-Squares Policy Iteration

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Reinforcement Learning with Classifier Selection for Focused Crawling

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Dynamic Multi-Armed Bandit with Covariates

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Reinforcement Learning with the Use of Costly Features

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Multi-Agent Reinforcement Learning for Intrusion Detection: A case study and evaluation

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning to Select Object Recognition Methods for Autonomous Mobile Robots

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning-Rate Adjusting Q-Learning for Two-Person Two-Action Symmetric Games

KES-AMSTA '09 Proceedings of the Third KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Design and performance analysis of an inductive QoS routing algorithm

Computer Communications
Reinforcement learning for robot soccer

Autonomous Robots
Learning teaching strategies in an Adaptive and Intelligent Educational System through Reinforcement Learning

Applied Intelligence
Neuroevolutionary reinforcement learning for generalized helicopter control

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Novelty of behaviour as a basis for the neuro-evolution of operant reward learning

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
EDA-RL: estimation of distribution algorithms for reinforcement learning problems

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Evolving an autonomous agent for non-Markovian reinforcement learning

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Uncertainty handling CMA-ES for reinforcement learning

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
uQFCS: QFCS with unfixed fuzzy sets in continuous multi-step environments with continuous vector actions

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Simulating human grandmasters: evolution and coevolution of evaluation functions

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
On the characteristics of sequential decision problems and their impact on evolutionary computation

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Learning in the time-dependent minority game

Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
Reinforcement learning for games: failures and successes

Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
TEMMAS: The Electricity Market Multi-Agent Simulator

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Two Steps Reinforcement Learning in Continuous Reinforcement Learning Tasks

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Social and Cognitive System for Learning Negotiation Strategies with Incomplete Information

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Motion Planning of a Non-holonomic Vehicle in a Real Environment by Reinforcement Learning*

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Contribution to the Control of a MAS's Global Behaviour: Reinforcement Learning Tools

Engineering Societies in the Agents World IX
Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes

Anticipatory Behavior in Adaptive Learning Systems
Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning

Anticipatory Behavior in Adaptive Learning Systems
The kNN-TD Reinforcement Learning Algorithm

IWINAC '09 Proceedings of the 3rd International Work-Conference on The Interplay Between Natural and Artificial Computation: Part I: Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira's Scientific Legacy
Multi-agent Reinforcement Learning in Network Management

AIMS '09 Proceedings of the 3rd International Conference on Autonomous Infrastructure, Management and Security: Scalability of Networks and Services
Finding Errors of Hybrid Systems by Optimising an Abstraction-Based Quality Estimate

TAP '09 Proceedings of the 3rd International Conference on Tests and Proofs
Learning Representation and Control in Markov Decision Processes: New Frontiers

Foundations and Trends® in Machine Learning
Using machine learning in a cooperative hybrid parallel strategy of metaheuristics

Information Sciences: an International Journal
Metastable Walking Machines

International Journal of Robotics Research
Toward Rough-Granular Computing

RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Robotic Target Tracking with Approximation Space-Based Feedback During Reinforcement Learning

RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
A q-learning based adaptive bidding strategy in combinatorial auctions

Proceedings of the 11th International Conference on Electronic Commerce
Randomized shortest-path problems: Two related models

Neural Computation
Performance bounded reinforcement learning in strategic interactions

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
An instance-based state representation for network repair

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Reinforcement learning for a CPG-driven biped robot

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Machine learning for adaptive image interpretation

IAAI'04 Proceedings of the 16th conference on Innovative applications of artifical intelligence
Towards autonomic computing: adaptive job routing and scheduling

IAAI'04 Proceedings of the 16th conference on Innovative applications of artifical intelligence
Incremental least squares policy iteration for POMDPs

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Learning representation and control in continuous Markov decision processes

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
QUICR-learning for multi-agent coordination

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Real-time evolution of neural networks in the NERO video game

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Curiosity-driven exploration with planning trajectories

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Inter-task action correlation for reinforcement learning tasks

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Modeling human decision making in cliff-edge environments

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Incremental least-squares temporal difference learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
A simple and effective method for incorporating advice into kernel methods

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Using Homomorphisms to transfer options across continuous reinforcement learning domains

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Sample-efficient evolutionary function approximation for reinforcement learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Hard constrained semi-Markov decision processes

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Learning partially observable action schemas

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Interactively shaping agents via human reinforcement: the TAMER framework

Proceedings of the fifth international conference on Knowledge capture
A GeoAgent-based framework for knowledge-oriented representation: Embracing social rules in GIS

International Journal of Geographical Information Science
Online Markov Decision Processes

Mathematics of Operations Research
Case-Based Reasoning in Transfer Learning

ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Quality Enhancement Based on Reinforcement Learning and Feature Weighting for a Critiquing-Based Recommender

ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Development of Symbiotic Brain-Machine Interfaces Using a Neurophysiology Cyberworkstation

Proceedings of the 13th International Conference on Human-Computer Interaction. Part II: Novel Interaction Methods and Techniques
NJFun: a reinforcement learning spoken dialogue system

ConversationalSys '00 Proceedings of the ANLP-NAACL 2000 Workshop on Conversational Systems
Natural language generation as planning under uncertainty for spoken dialogue systems

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Predicting investment behavior: An augmented reinforcement learning model

Neurocomputing
Emerging motor behaviors: Learning joint coordination in articulated mobile robots

Neurocomputing
Prediction of solar conditions by emotional learning

Intelligent Data Analysis
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games

Web Intelligence and Agent Systems
Learning lexical alignment policies for generating referring expressions in spoken dialogue systems

ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
Machine learning in digital games: a survey

Artificial Intelligence Review
Hybrid least-squares algorithms for approximate policy evaluation

Machine Learning
A DR algorithm based on artificial potential field method

Multimedia Tools and Applications
Layered Intelligence for Agent-based Crowd Simulation

Simulation
A Study on Real-Time Scheduling for Holonic Manufacturing Systems --- Determination of Utility Values Based on Multi-agent Reinforcement Learning

HoloMAS '09 Proceedings of the 4th International Conference on Industrial Applications of Holonic and Multi-Agent Systems: Holonic and Multi-Agent Systems for Manufacturing
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Optimal Online Learning Procedures for Model-Free Policy Evaluation

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Learning the Difference between Partially Observable Dynamical Systems

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Extending the Strada Framework to Design an AI for ORTS

ICEC '09 Proceedings of the 8th International Conference on Entertainment Computing
Reinforcement Learning for Blackjack

ICEC '09 Proceedings of the 8th International Conference on Entertainment Computing
Efficient Sample Reuse in EM-Based Policy Search

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Feature Selection for Value Function Approximation Using Bayesian Model Selection

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Considering Unseen States as Impossible in Factored Reinforcement Learning

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Learning to become an expert: Reinforcement learning and the acquisition of perceptual expertise

Journal of Cognitive Neuroscience
Robust task-based control policies for physics-based characters

ACM SIGGRAPH Asia 2009 papers
Efficient no-regret multiagent learning

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Non-stationary policy learning in 2-player zero sum games

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Value functions for RL-based behavior transfer: a comparative study

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Samuel meets Amarel: automating value function approximation using global state space analysis

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Error bounds for approximate value iteration

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Improving action selection in MDP's via knowledge transfer

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Lazy approximation for solving continuous finite-horizon MDPs

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
The max K-armed bandit: a new model of exploration applied to search heuristic selection

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Adaptive modeling and planning for reactive agents

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Towards competence in autonomous agents

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Improving reinforcement learning function approximators via neuroevolution

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Compact spectral bases for value function approximation using Kronecker factorization

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Efficient reinforcement learning with relocatable action models

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Efficient structure learning in factored-state MDPs

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Thresholded rewards: acting optimally in timed, zero-sum games

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Temporal difference and policy search methods for reinforcement learning: an empirical comparison

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
On policy learning in restricted policy spaces

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Autonomous inter-task transfer in reinforcement learning domains

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Coordination and multi-tasking using EMT

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Markov decision processes for control of a sensor network-based health monitoring system

IAAI'05 Proceedings of the 17th conference on Innovative applications of artificial intelligence - Volume 3
RETALIATE: learning winning policies in first-person shooter games

IAAI'07 Proceedings of the 19th national conference on Innovative applications of artificial intelligence - Volume 2
Reinforcement learning for vulnerability assessment in peer-to-peer networks

IAAI'08 Proceedings of the 20th national conference on Innovative applications of artificial intelligence - Volume 3
Adaptive treatment of epilepsy via batch-mode reinforcement learning

IAAI'08 Proceedings of the 20th national conference on Innovative applications of artificial intelligence - Volume 3
A case study on the critical role of geometric regularity in machine learning

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Adaptive importance sampling with automatic model selection in value function approximation

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Fast spectral learning using Lanczos eigenspace projections

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Adaptive management of air traffic flow: a multiagent coordination approach

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Autonomous robot skill acquisition

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Evaluation of a hierarchical reinforcement learning spoken dialogue system

Computer Speech and Language
Searching for grammar right

ScaNaLU '06 Proceedings of the Third Workshop on Scalable Natural Language Understanding
An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email

Journal of Artificial Intelligence Research
OBDD-based universal planning for synchronized agents in non-deterministic domains

Journal of Artificial Intelligence Research
Hierarchical reinforcement learning with the MAXQ value function decomposition

Journal of Artificial Intelligence Research
Accelerating reinforcement learning by composing solutions of automatically identified subtasks

Journal of Artificial Intelligence Research
Optimizing dialogue management with reinforcement learning: experiments with the NJFun system

Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods

Journal of Artificial Intelligence Research
Collective intelligence, data routing and braess' paradox

Journal of Artificial Intelligence Research
Potential-based shaping and Q-value initialization are equivalent

Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation

Journal of Artificial Intelligence Research
Reinforcement learning for agents with many sensors and actuators acting in categorizable environments

Journal of Artificial Intelligence Research
Risk-sensitive reinforcement learning applied to control under constraints

Journal of Artificial Intelligence Research
Perseus: randomized point-based value iteration for POMDPs

Journal of Artificial Intelligence Research
Integrating learning from examples into the search for diagnostic policies

Journal of Artificial Intelligence Research
Cooperative information sharing to improve distributed learning in multi-agent systems

Journal of Artificial Intelligence Research
Learning in real-time search: a unifying framework

Journal of Artificial Intelligence Research
Solving factored MDPs with hybrid state and action variables

Journal of Artificial Intelligence Research
Anytime point-based approximations for large POMDPs

Journal of Artificial Intelligence Research
Closed-loop learning of visual control policies

Journal of Artificial Intelligence Research
Learning to play using low-complexity rule-based policies: illustrations through Ms. Pac-Man

Journal of Artificial Intelligence Research
Optimal and approximate Q-value functions for decentralized POMDPs

Journal of Artificial Intelligence Research
Adaptive stochastic resource control: a machine learning approach

Journal of Artificial Intelligence Research
Learning partially observable deterministic action models

Journal of Artificial Intelligence Research
Learning to reach agreement in a continuous ultimatum game

Journal of Artificial Intelligence Research
Interactive policy learning through confidence-based autonomy

Journal of Artificial Intelligence Research
Infinite-horizon policy-gradient estimation

Journal of Artificial Intelligence Research
Experiments with infinite-horizon, policy-gradient estimation

Journal of Artificial Intelligence Research
Autonomous concept formation

IJCAI'99 Proceedings of the 16th international joint conference on Artifical intelligence - Volume 1
Reinforcement algorithms using functional approximation for generalization and their application to cart centering and fractal compression

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Learning and multiagent reasoning for autonomous agents

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A call admission control scheme using neuroevolution algorithm in cellular networks

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
General game learning using knowledge transfer

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Online learning and exploiting relational models in reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Utile distinctions for relational reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
State similarity based approach for improving performance in RL

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Building portable options: skill transfer in reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Transfer learning in real-time strategy games using hybrid CBR/RL

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Direct code access in self-organizing neural networks for reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Dynamics of temporal difference learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning policies for embodied virtual agents through demonstration

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning to walk through imitation

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Using linear programming for Bayesian exploration in Markov decision processes

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
An analysis of Laplacian methods for value function approximation in MDPs

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Bayesian inverse reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Detecting and forecasting economic regimes in multi-agent automated exchanges

Decision Support Systems
Simultaneous adversarial multi-robot learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Generalizing plans to new environments in relational MDPs

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Use of off-line dynamic programming for efficient image interpretation

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Modular self-organization for a long-living autonomous agent

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Topology selection for stream mining systems

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Adaptive Learning Based on Exercises Fitness Degree

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Reinforcement Learning in RoboCup KeepAway with Partial Observability

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Tank War Using Online Reinforcement Learning

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Real-time planning for parameterized human motion

Proceedings of the 2008 ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Customizing directions in an automated wayfinding system for individuals with cognitive impairment

Proceedings of the 11th international ACM SIGACCESS conference on Computers and accessibility
Selecting GVT interval for time-warp-based distributed simulation using reinforcement learning technique

SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Automatic abstraction in reinforcement learning using data mining techniques

Robotics and Autonomous Systems
Operant matching as a nash equilibrium of an intertemporal game

Neural Computation
A neurocomputational model for cocaine addiction

Neural Computation
Learning Bayesian network equivalence classes with Ant Colony optimization

Journal of Artificial Intelligence Research
Towards a general framework for cross-layer decision making in multimedia systems

IEEE Transactions on Circuits and Systems for Video Technology
Temporal difference learning applied to a high-performance game-playing program

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Reinforcement learning in distributed domains: beyond team games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Fast concurrent reinforcement learners

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Multi-agent systems by incremental gradient reinforcement learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
R-MAX: a general polynomial time algorithm for near-optimal reinforcement learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Exploiting multiple secondary reinforcers in policy gradient reinforcement learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Rational and convergent learning in stochastic games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
State abstraction discovery from irrelevant state variables

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Using predictive representations to improve generalization in reinforcement learning

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Two-sided bandits and the dating market

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Robust planning with (L)RTDP

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Motivated agents

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Evolutionary behavior learning for action-based environment modeling by a mobile robot

Applied Soft Computing
Dynamic Customer Management and the Value of One-to-One Marketing

Marketing Science
Intelligence Dynamics: a concept and preliminary experiments for open-ended learning agents

Autonomous Agents and Multi-Agent Systems
Learning classifier systems: a complete introduction, review, and roadmap

Journal of Artificial Evolution and Applications
Finding optimal satisficing strategies for and-or trees

Artificial Intelligence
Effective learning in the presence of adaptive counterparts

Journal of Algorithms
Learning of shared attention in sociable robotics

Journal of Algorithms
Neuroevolution strategies for episodic reinforcement learning

Journal of Algorithms
Intensional dynamic programming. A Rosetta stone for structured dynamic programming

Journal of Algorithms
Short survey: Taxonomy and survey of RFID anti-collision protocols

Computer Communications
States representations with a hierarchical dependency in reinforcement learning

ISC '07 Proceedings of the 10th IASTED International Conference on Intelligent Systems and Control
An analytic modelling approach for network routing algorithms that use "ant-like" mobile agents

Computer Networks: The International Journal of Computer and Telecommunications Networking
Remote patient monitoring service using heterogeneous wireless access networks: architecture and optimization

IEEE Journal on Selected Areas in Communications - Special issue on wireless and pervasive communications for healthcare
Fuzzy-UCS: a Michigan-style learning fuzzy-classifier system for supervised learning

IEEE Transactions on Evolutionary Computation
Interactive evolution of particle systems for computer graphics and animation

IEEE Transactions on Evolutionary Computation
A reward field model generation in Q-learning by dynamic programming

Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
Modeling the influences of cyclic top-down and bottom-up processes for reinforcement learning in eye movements

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Ant colony optimization incorporated with fuzzy Q-learning for reinforcement fuzzy control

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Optimal contraction theorem for exploration-exploitation tradeoff in search and optimization

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Keeping the resident in the loop: adapting the smart home to the user

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Reinforcement learning versus model predictive control: a comparison on a power system problem

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Neural network output optimization using interval analysis

IEEE Transactions on Neural Networks
A Q-learning approach to derive optimal consumption and investment strategies

IEEE Transactions on Neural Networks
Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit

IEEE Transactions on Neural Networks
Learning Deep Architectures for AI

Foundations and Trends® in Machine Learning
Coordination motion-tasks using actual robot dynamics

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
To Elicit Or To Tell: Does It Matter?

Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
Utility in hint generation: Selection of hints from a corpus of student work

Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
Using neural gas for a better machine identity description

ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
DCOPs meet the realworld: exploring unknown reward matrices with applications to mobile sensor networks

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Active policy iteration: efficient exploration through active learning for value function approximation in reinforcement learning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Efficient skill learning using abstraction selection

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Autonomously learning an action hierarchy using a learned qualitative state representation

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Solving POMDPs: RTDP-bel vs. point-based algorithms

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Learning hierarchical task networks for nondeterministic planning domains

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
An RL-based scheduling algorithm for video traffic in high-rate wireless personal area networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
FICA: A novel intelligent crawling algorithm based on reinforcement learning

Web Intelligence and Agent Systems
Enaction-Based Artificial Intelligence: Toward Co-evolution with Humans in the Loop

Minds and Machines
Structured prediction with reinforcement learning

Machine Learning
Imitation as a mechanism of cultural transmission

Artificial Life
Reinforcement learning and adaptive dynamic programming for feedback control

IEEE Circuits and Systems Magazine
Measuring Resemblances Between Swarm Behaviours: A Perceptual Tolerance Near Set Approach

Fundamenta Informaticae - Swarm Intelligence
Coordination motion-tasks using actual robot dynamics

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
A rules-based approach for configuring chains of classifiers in real-time stream mining systems

EURASIP Journal on Advances in Signal Processing
An efficient MAC protocol for throughput enhancement in dense RFID system

ISWPC'09 Proceedings of the 4th international conference on Wireless pervasive computing
Customized learning algorithms for episodic tasks withacyclic state spaces

CASE'09 Proceedings of the fifth annual IEEE international conference on Automation science and engineering
MDP based active localization for multiple robots

CASE'09 Proceedings of the fifth annual IEEE international conference on Automation science and engineering
A computational neuroscience model of working memory with application to robot perceptual learning

CI '07 Proceedings of the Third IASTED International Conference on Computational Intelligence
Exploration and exploitation balance management in fuzzy reinforcement learning

Fuzzy Sets and Systems
Beyond economics for guiding large public policy issues: Lessons from the Bell System divestiture and the California electricity crisis

Decision Support Systems
SIMBA: A simulator for business education and research

Decision Support Systems
Stochastic model for outcome prediction in acute illness

Computers in Biology and Medicine
Reinforcement learning for mapping instructions to actions

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
A novel framework for dynamic spectrum management in multicell OFDMA networks based on reinforcement learning

WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
Markov decision process frameworks for cooperative retransmission in wireless networks

WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
Q-learning for joint access decision in heterogeneous networks

WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
Coevolving intelligent game players in a cultural framework

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Overcoming the bootstrap problem in evolutionary robotics using behavioral diversity

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
On-line neuroevolution applied to the open racing car simulator

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Investigating the effect of pruning on the diversity and fitness of robot controllers based on MDL2Ɛ during genetic programming

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Memory-enhanced evolutionary robotics: the echo state network approach

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
An Additive Reinforcement Learning

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Learning Automata Based Intelligent Tutorial-like System

KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part I
Inference and Learning in Planning (Extended Abstract)

DS '09 Proceedings of the 12th International Conference on Discovery Science
EcoSimNet: A Multi-Agent System for Ecological Simulation and Optimization

EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Anytime Self-play Learning to Satisfy Functional Optimality Criteria

ADT '09 Proceedings of the 1st International Conference on Algorithmic Decision Theory
Effectiveness of Intrinsically Motivated Adaptive Agent for Sustainable Human-Agent Interaction

ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

PRIMA '09 Proceedings of the 12th International Conference on Principles of Practice in Multi-Agent Systems
Dynamic tuning of online data migration policies in hierarchical storage systems using reinforcement learning

Dynamic tuning of online data migration policies in hierarchical storage systems using reinforcement learning
A reinforcement learning framework for utility-based scheduling in resource-constrained systems

A reinforcement learning framework for utility-based scheduling in resource-constrained systems
A reinforcement learning approach to dynamic resource allocation

A reinforcement learning approach to dynamic resource allocation
Adaptive data-aware utility-based scheduling in resource-constrained systems

Adaptive data-aware utility-based scheduling in resource-constrained systems
A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments
Dynamic adaptation of user migration policies in distributed virtual environments

Dynamic adaptation of user migration policies in distributed virtual environments
Scalable approach for effective control of gene regulatory networks

Artificial Intelligence in Medicine
An artificial immune network approach for pinyin-to- character conversion

VECIMS'09 Proceedings of the 2009 IEEE international conference on Virtual Environments, Human-Computer Interfaces and Measurement Systems
Simulating sellers in online exchanges

Decision Support Systems
Approximate dynamic programming using Bellman residual elimination and Gaussian process regression

ACC'09 Proceedings of the 2009 conference on American Control Conference
Fuzzy ant colony optimization for optimal control

ACC'09 Proceedings of the 2009 conference on American Control Conference
Robust adaptive Markov decision processes in multi-vehicle applications

ACC'09 Proceedings of the 2009 conference on American Control Conference
A Q-learning model-independent flow controller for high-speed networks

ACC'09 Proceedings of the 2009 conference on American Control Conference
Multiresolution state-space discretization method for Q-learning

ACC'09 Proceedings of the 2009 conference on American Control Conference
Nash Q-learning multi-agent flow control for high-speed networks

ACC'09 Proceedings of the 2009 conference on American Control Conference
Which landmark is useful?: learning selection policies for navigation in unknown environments

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Least absolute policy iteration for robust value function approximation

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Architecture of behavior-based and robotics self-optimizing memory controller

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Transfer of knowledge for a climbing virtual human: a reinforcement learning approach

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Learning motor primitives for robotics

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Adaptive autonomous control using online value iteration with Gaussian processes

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Smoothed Sarsa: reinforcement learning for robot delivery tasks

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A hybrid search algorithm in a multi-agent system environment for multicriteria optimization of products design

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A motor learning neural model based on Bayesian network and reinforcement learning

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Prerequisites for integrating unsupervised and reinforcement learning in a single network of spiking neurons

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Relational reinforcement learning applied to shared attention

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Using continuous action spaces to solve discrete problems

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Coordinated multiple ramps metering based on neuro-fuzzy adaptive dynamic programming

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Reinforcement learning of multiple tasks using parametric bias

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A dynamical connectionist model of idea generation

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A parallel hybrid implementation using genetic algorithm, GRASP and reinforcement learning

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Generalized policy iteration for continuous-time systems

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Reconfigurable disruption tolerant routing via reinforcement learning

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Improving management of Anemia in end stage renal disease using reinforcement learning

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Goal-directed feature learning

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
From mirror neurons to computational neurolinguistics

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Dialogue act prediction using stochastic context-free grammar induction

CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
On the asymptotic equivalence between differential Hebbian and temporal difference learning

Neural Computation
Novel runtime systems support for adaptive compositional modeling in PSEs

Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
k-nearest neighbor Monte-Carlo control algorithm for POMDP-based dialogue systems

SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Agent-based buddy-finding methodology for knowledge sharing

Information and Management
Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning

Neural Computation
Associating domain-dependent knowledge and Monte Carlo approaches within a Go program

Information Sciences: an International Journal
Layering and heterogeneity as design principles for animated embedded agents

Information Sciences: an International Journal
Computational intelligence for structured learning of a partner robot based on imitation

Information Sciences: an International Journal
Switching between different state representations in reinforcement learning

AIA '08 Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications
Impacts of team size on role learning in multiagent systems

AIA '08 Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications
RL-based superframe order adaptation algorithm for IEEE 802.15.4 networks

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
An adaptive inventory control for a supply chain

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
SI-CCMAC: sender initiating concurrent cooperative MAC for wireless LANs

WiOPT'09 Proceedings of the 7th international conference on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks
Bridging the gap between feature- and grid-based SLAM

Robotics and Autonomous Systems
An energy-efficient data gathering algorithm to prolong lifetime of wireless sensor networks

Computer Communications
Probabilistic fuzzy logic system: a tool to process stochastic and imprecise information

FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
View estimation learning based on value system

FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Human instruction recognition and self behavior acquisition based on state value

FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Protecting buying agents in e-marketplaces by direct experience trust modelling

Knowledge and Information Systems
Temporal difference learning with interpolated table value functions

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Improving temporal difference game agent control using a dynamic exploration rate during control learning

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Evolution versus temporal difference learning for learning to play Ms. Pac-Man

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Coevolutionary temporal difference learning for Othello

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Introducing a round robin tournament into Blondie24

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Fuzzy Q-learning in a nondeterministic environment: developing an intelligent Ms. Pac-Man agent

CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Facetwise analysis of XCS for problems with class imbalances

IEEE Transactions on Evolutionary Computation
Reinforcement interval type-2 fuzzy controller design by online rule generation and Q-value-aided ant colony optimization

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Emotion Learning: Solving a Shortest Path Problem in an Arbitrary Deterministic Environment in Linear Time with an Emotional Agent

International Journal of Applied Mathematics and Computer Science - Selected Problems of Computer Science and Control
An optimal warning-zone-length assignment algorithm for real-time and multiple-QoS on-chip bus arbitration

ACM Transactions on Embedded Computing Systems (TECS)
Adaptive dynamic programming: an introduction

IEEE Computational Intelligence Magazine
A survey of collaborative filtering techniques

Advances in Artificial Intelligence
Intercluster connection in cognitive wireless mesh networks based on intelligent network coding

EURASIP Journal on Advances in Signal Processing - Special issue on dynamic spectrum access for wireless networking
Autonomous development of vergence control driven by disparity energy neuron populations

Neural Computation
Multi-agent Q-learning of channel selection in multi-user cognitive radio systems: a two by two case

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Generation of roles in reinforcement learning considering redistribution of reward between agents

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A novel technique to design a fuzzy logic controller using Q(λ)-learning and genetic algorithms in the pursuit-evasion game

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A novel hybrid learning technique applied to a self-learning multi-robot system

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Dimensionality effects on the Markov property in shape memory alloy hysteretic environment

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Multiresolution state-space discretization method for Q-learning with function approximation and policy iteration

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Learning intialized by topologically correct representation

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A probabilistic fuzzy logic system: learning in the stochastic environment with incomplete dynamics

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Real-valued Q-learning in multi-agent cooperation

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Behavioral-fusion control based on reinforcement learning

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Implementation of fuzzy Q-learning based on modular fuzzy model and parallel structured learning

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Prerequesites for symbiotic brain-machine interfaces

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Planning-based prediction for pedestrians

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Consideration on robotic giant-swing motion generated by reinforcement learning

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Decision-theoretic robot guidance for active cooperative perception

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Robot task switching under diminishing returns

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Using eigenposes for lossless periodic human motion imitation

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Hardware design of autonomous snake-like robot for reinforcement learning based on environment: discussion of versatility on different tasks

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Design of semi-decentralized control laws for distributed-air-jet micromanipulators by reinforcement learning

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
A learning approach to integration of layers of a hybrid control architecture

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Cooperative multi-robot reinforcement learning: a framework in hybrid state space

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
From manipulation to communicative gesture

Proceedings of the 5th ACM/IEEE international conference on Human-robot interaction
Collective iterative allocation: Enabling fast and optimal group decision making: The role of group knowledge, optimism, and decision policies in distributed coordination

Web Intelligence and Agent Systems
Solving multiconstraint assignment problems using learning automata

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A new mobile robot navigation method using fuzzy logic and a modified Q-learning algorithm

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Feature Article---Merging AI and OR to Solve High-Dimensional Stochastic Optimization Problems Using Approximate Dynamic Programming

INFORMS Journal on Computing
Rejoinder---The Languages of Stochastic Optimization

INFORMS Journal on Computing
Truncated fourier series formulation for bipedal walking balance control

Robotica
Approximate dynamic programming techniques for the control of time-varying queuing systems applied to call centers with abandonments and retrials

Probability in the Engineering and Informational Sciences
Review:

The Knowledge Engineering Review
A framework for the design of a military operational supply network

CISDA'09 Proceedings of the Second IEEE international conference on Computational intelligence for security and defense applications
Online reinforcement learning for dynamic multimedia systems

IEEE Transactions on Image Processing
Reference traces by simulation for tracking control-logic

ETFA'09 Proceedings of the 14th IEEE international conference on Emerging technologies & factory automation
Data mining with agent gaming

Information Technology and Management
Online adaptive policies for ensemble classifiers

Neurocomputing
Improving iterative repair strategies for scheduling with the SVM

Neurocomputing
Asynchronous neurocomputing for optimal control and reinforcement learning with large state spaces

Neurocomputing
Model-based reinforcement learning: a computational model and an fMRI study

Neurocomputing
A new approach to fuzzy classifier systems and its application in self-generating neuro-fuzzy systems

Neurocomputing
Reinforcement learning combined with a fuzzy adaptive learning control network (FALCON-R) for pattern classification

Pattern Recognition
On Evaluating Information Revelation Policies in Procurement Auctions: A Markov Decision Process Approach

Information Systems Research
Induction over Strategic Agents

Information Systems Research
CCMAC: Coordinated cooperative MAC for wireless LANs

Computer Networks: The International Journal of Computer and Telecommunications Networking
A model of portfolio optimization using time adapting genetic network programming

Computers and Operations Research
A self-organizing neural architecture integrating desire, intention and reinforcement learning

Neurocomputing
Review article: Synergizing reinforcement learning and game theory-A new direction for control

Applied Soft Computing
Managing Adaptive Versatile environments

Pervasive and Mobile Computing
Fuzzy decision tree function approximation in reinforcement learning

International Journal of Artificial Intelligence and Soft Computing
A Least-squares Approach to Direct Importance Estimation

The Journal of Machine Learning Research
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
Provably Efficient Learning with Typed Parametric Models

The Journal of Machine Learning Research
RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments

The Journal of Machine Learning Research
Reinforcement Learning in Finite MDPs: PAC Analysis

The Journal of Machine Learning Research
A Convergent Online Single Time Scale Actor Critic Algorithm

The Journal of Machine Learning Research
Bounding the population size in XCS to ensure reproductive opportunities

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Designing efficient exploration with MACS: modules and function approximation

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Simulating sellers' behavior in a reverse auction B2B exchange

ICCS'03 Proceedings of the 2003 international conference on Computational science
Reinforcement learning as a means of dynamic aggregate QoS provisioning

Art-QoS'03 Proceedings of the 2003 international conference on Architectures for quality of service in the internet
Learning and evolution affected by spatial structure

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
An adaptive inventory control model for a supply chain with nonstationary customer demands

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
A learning autonomous driver system on the basis of image classification and evolutional learning

MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition
Model-based least-squares policy evaluation

AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Knowledge discovery and emergent complexity in bioinformatics

KDECB'06 Proceedings of the 1st international conference on Knowledge discovery and emergent complexity in bioinformatics
Integration of genetic programming and reinforcement learning for real robots

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartI
Analyzing parameter sensitivity and classifier representations for real-valued XCS

IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Counter example for Q-bucket-brigade under prediction problem

IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
An experimental comparison between ATNoSFERES and ACS

IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Adaptive value function approximations in classifier systems

IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Three architectures for continuous action

IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Parallelizing parallel rollout algorithm for solving Markov decision processes

WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
A functional spiking neuron hardware oriented model

IWANN'03 Proceedings of the Artificial and natural neural networks 7th international conference on Computational methods in neural modeling - Volume 1
Reinforcement learning for online control of evolutionary algorithms

ESOA'06 Proceedings of the 4th international conference on Engineering self-organising systems
Defending DDoS attacks using hidden Markov models and cooperative reinforcement learning

PAISI'07 Proceedings of the 2007 Pacific Asia conference on Intelligence and security informatics
Unified criterion of state generalization for reactive autonomous agents

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Generating hierarchical structure in reinforcement learning from state variables

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Automatic development of robot behaviour using Monte Carlo methods

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Evolving symbolic controllers

EvoWorkshops'03 Proceedings of the 2003 international conference on Applications of evolutionary computing
On a dynamical analysis of reinforcement learning in games: emergence of Occam's Razor

CEEMAS'03 Proceedings of the 3rd Central and Eastern European conference on Multi-agent systems
Evolving reinforcement learning-like abilities for robots

ICES'03 Proceedings of the 5th international conference on Evolvable systems: from biology to hardware
Using genetic programming to generate protocol adaptors for interprocess communication

ICES'03 Proceedings of the 5th international conference on Evolvable systems: from biology to hardware
Acceleration of game learning with prediction-based reinforcement learning: toward the emergence of planning behavior

ICANN/ICONIP'03 Proceedings of the 2003 joint international conference on Artificial neural networks and neural information processing
Adaptive focused crawling

The adaptive web
Heuristic search based exploration in reinforcement learning

IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Learning autonomous behaviours for non-holonomic vehicles

IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
A novel burst assembly algorithm for optical burst switched networks based on learning automata

ONDM'07 Proceedings of the 11th international IFIP TC6 conference on Optical network design and modeling
Decentralized information aggregation and central control in networked production environments

HCI'07 Proceedings of the 12th international conference on Human-computer interaction: applications and services
Computing and using lower and upper bounds for action elimination in MDP planning

SARA'07 Proceedings of the 7th International conference on Abstraction, reformulation, and approximation
Model-based exploration in continuous state spaces

SARA'07 Proceedings of the 7th International conference on Abstraction, reformulation, and approximation
Active learning of dynamic Bayesian networks in Markov decision processes

SARA'07 Proceedings of the 7th International conference on Abstraction, reformulation, and approximation
Field-based coordination of mobile intelligent agents: an evolutionary game theoretic analysis

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part I
Reinforcement learning of competitive skills with soccer agents

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part I
Optimal convergence in multi-agent MDPs

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Reinforcement learning scheme for grouping and anti-predator behavior

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Learning evaluation functions of Shogi positions from different sets of games

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Grounding action-selection in event-based anticipation

ECAL'07 Proceedings of the 9th European conference on Advances in artificial life
Evolution and learning in an intrinsically motivated reinforcement learning robot

ECAL'07 Proceedings of the 9th European conference on Advances in artificial life
Efficient learning of neural networks with evolutionary algorithms

Proceedings of the 29th DAGM conference on Pattern recognition
Decomposition principles and online learning in cross-layer optimization for delay-sensitive applications

IEEE Transactions on Signal Processing
Virtual markets: Q-learning sellers with simple state representation

AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
On-line agent teamwork training using immunological network model

AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
Reinforcement learning-based load shared sequential routing

NETWORKING'07 Proceedings of the 6th international IFIP-TC6 conference on Ad Hoc and sensor networks, wireless networks, next generation internet
Online learning of task-driven object-based visual attention control

Image and Vision Computing
Plan-based control of robotic agents: improving the capabilities of autonomous robots

Plan-based control of robotic agents: improving the capabilities of autonomous robots
Posterior weighted reinforcement learning with state uncertainty

Neural Computation
Convergence analysis on approximate reinforcement learning

KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Simple model-based exploration and exploitation of Markov decision processes using the elimination algorithm

MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Learning models of relational MDPs using graph kernels

MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Improving optimality of neural rewards regression for data-efficient batch near-optimal policy identification

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Reinforcement learning for cooperative actions in a partially observable multi-agent system

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Stochastic weights reinforcement learning for exploratory data analysis

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Cooperation between multiple agents based on partially sharing policy

ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Efficient selectivity and backup operators in Monte-Carlo tree search

CG'06 Proceedings of the 5th international conference on Computers and games
Feature construction for reinforcement learning in hearts

CG'06 Proceedings of the 5th international conference on Computers and games
Skill combination for reinforcement learning

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Clustering with reinforcement learning

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Independent factor reinforcement learning for portfolio management

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
A novel ANN model based on quantum computational MAS theory

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
A novel neural network based reinforcement learning

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
An agent reinforcement learning model based on neural networks

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Reinforcement learning algorithms based on mGA and EA with policy iterations

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Toward perception based computing: a rough-granular perspective

WImBI'06 Proceedings of the 1st WICI international conference on Web intelligence meets brain informatics
Optimizing walking controllers for uncertain inputs and environments

ACM SIGGRAPH 2010 papers
Reducing trials by thinning-out in skill discovery

DS'07 Proceedings of the 10th international conference on Discovery science
Safe state abstraction and reusable continuing subtasks in hierarchical reinforcement learning

AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Validation of a reinforcement learning policy for dosage optimization of erythropoietin

AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Generalization and transfer learning in noise-affected robot navigation tasks

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Heuristic Q-learning soccer players: a new reinforcement learning approach to RoboCup simulation

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Intelligent farmer agent for multi-agent ecological simulations optimization

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Learning to use a perishable good as money

MABS'06 Proceedings of the 2006 international conference on Multi-agent-based simulation VII
Can agents acquire human-like behaviors in a sequential bargaining game?: comparison of Roth's and Q-learning agents

MABS'06 Proceedings of the 2006 international conference on Multi-agent-based simulation VII
A k-NN based perception scheme for reinforcement learning

EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
A practical learning-based approach for dynamic storage bandwidth allocation

IWQoS'03 Proceedings of the 11th international conference on Quality of service
Semi-supervised speaker identification under covariate shift

Signal Processing
The MACS project: an approach to affordance-inspired robot control

Proceedings of the 2006 international conference on Towards affordance-based robot control
Temporal difference learning and simulated annealing for optimal control: a case study

KES-AMSTA'08 Proceedings of the 2nd KES International conference on Agent and multi-agent systems: technologies and applications
Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications
Bio-inspired self-organizing relationship network as knowledge acquisition tool and fuzzy inference engine

WCCI'08 Proceedings of the 2008 IEEE world conference on Computational intelligence: research frontiers
Feature discovery in reinforcement learning using genetic programming

EuroGP'08 Proceedings of the 11th European conference on Genetic programming
Opportunistic transmission for wireless sensor networks under delay constraints

ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III
Learning relational options for inductive transfer in relational reinforcement learning

ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Relational macros for transfer in reinforcement learning

ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Building relational world models for reinforcement learning

ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Relational sequence learning

Probabilistic inductive logic programming
The evolution of cognition: from first order to second order embodiment

ZiF'06 Proceedings of the Embodied communication in humans and machines, 2nd ZiF research group international conference on Modeling communication with robots and virtual humans
Seeing the forest despite the trees: large scale spatial-temporal decision making

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Regret-based reward elicitation for Markov decision processes

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Temporal-difference networks for dynamical systems with continuous observations and actions

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Exploring compact reinforcement-learning representations with linear regression

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
A state-cluster based Q-learning

ICNC'09 Proceedings of the 5th international conference on Natural computation
Urban traffic signal learning control using fuzzy actor-critic methods

ICNC'09 Proceedings of the 5th international conference on Natural computation
Unifying perceptual and behavioral learning with a correlative subspace learning rule

Neurocomputing
Using cognition and learning to improve agents' reactions

Adaptive agents and multi-agent systems
Relational reinforcement learning for agents in worlds with objects

Adaptive agents and multi-agent systems
Character animation in two-player adversarial games

ACM Transactions on Graphics (TOG)
2006: celebrating 75 years of AI - history and outlook: the next 25 years

50 years of artificial intelligence
Adaptive multi-modal sensors

50 years of artificial intelligence
Intrinsically motivated machines

50 years of artificial intelligence
Reward-modulated hebbian learning of decision making

Neural Computation
Planning to see: A hierarchical approach to planning visual actions on a robot using POMDPs

Artificial Intelligence
Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks

Information Sciences: an International Journal
Smartlocks: lock acquisition scheduling for self-aware synchronization

Proceedings of the 7th international conference on Autonomic computing
Applying reinforcement learning to scheduling strategies in an actual grid environment

International Journal of High Performance Systems Architecture
Reinforcement learning for training a computer program of Chinese chess

International Journal of Intelligent Information and Database Systems
Planning of diverse complex cooperative robot actions using multi-stage genetic algorithm

CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
Use of the knowledge which is independence on reward in reinforcement learning

CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
A study on hierarchical modular reinforcement learning for multi-agent pursuit problem based on relative coordinate states

CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
Opportunistic exploitation of bandwidth resources through reinforcement learning

GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
A reinforcement learning-based lightpath establishment for service differentiation in all-optical WDM networks

GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
Cooperative communications with relay selection for QoS provisioning in wireless sensor networks

GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
On balancing exploration vs. exploitation in a cognitive engine for multi-antenna systems

GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
A common-neural-pattern based reasoning for mobile robot cognitive mapping

ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Learning of action generation from raw camera images in a real-world-like environment by simple coupling of reinforcement learning and a neural network

ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Brain-inspired emergence of behaviors based on the desire for existence by reinforcement learning

ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Improving optimistic exploration in model-free reinforcement learning

ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
A cat-like robot real-time learning to run

ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Bounds for multistage stochastic programs using supervised learning strategies

SAGA'09 Proceedings of the 5th international conference on Stochastic algorithms: foundations and applications
Forward chaining algorithm for solving the shortest path problem in arbitrary deterministic environment in linear time: applied for the tower of Hanoi problem

KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Emulation and behavior understanding through shared values

Robotics and Autonomous Systems
Q-learning for opportunistic spectrum access

Proceedings of the 6th International Wireless Communications and Mobile Computing Conference
Joint path and wavelength selection using Q-learning in optical burst switching networks

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
A multi-agent reinforcement learning approach to path selection in optical burst switching networks

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
A self-organized spectrum assignment strategy in next generation OFDMA networks providing secondary spectrum access

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Needle target-insertion trajectory planning based on reforcement learning expert's skill

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Interaction of culture-based learning and cooperative co-evolution and its application to automatic behavior-based system design

IEEE Transactions on Evolutionary Computation
Impedance learning for robotic contact tasks using natural actor-critic algorithm

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Monotonicity of constrained optimal transmission policies in correlated fading channels with ARQ

IEEE Transactions on Signal Processing
On-line learning and optimization for wireless video transmission

IEEE Transactions on Signal Processing
A systematic framework for dynamically optimizing multi-user wireless video transmission

IEEE Journal on Selected Areas in Communications
MLeXAI: A Project-Based Application-Oriented Model

ACM Transactions on Computing Education (TOCE)
Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

Autonomous Agents and Multi-Agent Systems
What the 2007 TAC Market Design Game tells us about effective auction mechanisms

Autonomous Agents and Multi-Agent Systems
Efficient vision-based navigation

Autonomous Robots
Finding and transferring policies using stored behaviors

Autonomous Robots
Non-parametric Learning to Aid Path Planning over Slopes

International Journal of Robotics Research
Evolving agent behavior in multiobjective domains using fitness-based shaping

Proceedings of the 12th annual conference on Genetic and evolutionary computation
An activation reinforcement based classifier system for balancing generalisation and specialisation (ARCS)

Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Learning classifier systems

Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
How "Authentic Intentionality" can be Enabled: a Neurocomputational Hypothesis

Minds and Machines
Neural mechanisms of the mind, Aristotle, Zadeh, and fMRI

IEEE Transactions on Neural Networks
A MDP approach to fault-tolerant routing

WD'09 Proceedings of the 2nd IFIP conference on Wireless days
Exploitation and exploration in a performance based contextual advertising system

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Optimizing debt collections using constrained reinforcement learning

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Error Bounds for Approximations from Projected Linear Equations

Mathematics of Operations Research
Spectrum management of cognitive radio using multi-agent reinforcement learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Industry track
Combining manual feedback with subsequent MDP reward signals for reinforcement learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
High-level reinforcement learning in strategy games

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
To teach or not to teach?: decision making under uncertainty in ad hoc teams

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Frequency adjusted multi-agent Q-learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Using spatial hints to improve policy reuse in a reinforcement learning agent

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
PAC-MDP learning with knowledge-based admissible models

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Using graph analysis to study networks of adaptive agent

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Optimal policy switching algorithms for reinforcement learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Learning multi-agent state space representations

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Strategy generation in multi-agent imperfect-information pursuit games

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Cultivating desired behaviour: policy teaching via environment-dynamics tweaks

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
A reward function generation method using genetic algorithms: a robot soccer case study

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Online model learning in adversarial Markov decision processes

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Action discovery for reinforcement learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Model-based direct policy search

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Training a Tetris agent via interactive shaping: a demonstration of the TAMER framework

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Adaptive Auction Mechanism Design and the Incorporation of Prior Knowledge

INFORMS Journal on Computing
Action selection and task sequence learning for hybrid dynamical cognitive agents

Robotics and Autonomous Systems
Combining active learning and reactive control for robot grasping

Robotics and Autonomous Systems
Adaptive data-aware utility-based scheduling in resource-constrained systems

Journal of Parallel and Distributed Computing
Reinforcement learning of interface mapping for interactivity enhancement of robot control in assistive environments

Proceedings of the 3rd International Conference on PErvasive Technologies Related to Assistive Environments
An application of reinforcement learning for efficient spectrum usage in next-generation mobile cellular networks

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
A step toward an adaptive composition of query suggestion approaches

Proceedings of the third symposium on Information interaction in context
MRL-CC: a novel cooperative communication protocol for QoS provisioning in wireless sensor networks

International Journal of Sensor Networks
Learning and Reversal Learning in the Subcortical Limbic System: A Computational Model

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Self-Organizing Sensorimotor Maps Plus Internal Motivations Yield Animal-Like Behavior

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Self-configured multipath routing using path lifetime for video-streaming services over Ad Hoc networks

Computer Communications
Coordinated learning in multiagent MDPs with infinite state-space

Autonomous Agents and Multi-Agent Systems
A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

ACM Transactions on Modeling and Computer Simulation (TOMACS)
Decision-theoretic design space exploration of multiprocessor platforms

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
A survey of Tactile Human-Robot Interactions

Robotics and Autonomous Systems
A learning automata based scheduling solution to the dynamic point coverage problem in wireless sensor networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
Reinforcement learning intellectual agent of protection for adapting to surrounding environment

Proceedings of the 3rd international conference on Security of information and networks
An adaptive link layer for heterogeneous multi-radio mobile sensor networks

IEEE Journal on Selected Areas in Communications - Special issue on simple wireless sensor networking solutions
Model-free control based on reinforcement learning for a wastewater treatment problem

Applied Soft Computing
Reinforcement learning of competitive and cooperative skills in soccer agents

Applied Soft Computing
Learning to adapt to unknown users: referring expression generation in spoken dialogue systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Importance-Driven Turn-Bidding for spoken dialogue systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Learning to follow navigational directions

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Optimising information presentation for spoken dialogue systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Reading between the lines: learning to map high-level instructions to commands

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Near-optimal Regret Bounds for Reinforcement Learning

The Journal of Machine Learning Research
Evolving Static Representations for Task Transfer

The Journal of Machine Learning Research
EA2: The Winning Strategy for the Inaugural Lemonade Stand Game Tournament

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Uncertainty Propagation for Efficient Exploration in Reinforcement Learning

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
The Dynamics of Multi-Agent Reinforcement Learning

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
A NEAT Way for Evolving Echo State Networks

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
ANTIPA: an agent architecture for intelligent information assistance

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
A reinforcement learning with switching controllers for a continuous action space

Artificial Life and Robotics
Intelligent agent construction using the attentive characteristic patterns of chaotic neural networks

Artificial Life and Robotics
Control of unknown nonlinear systems with efficient transient performance using concurrent exploitation and exploration

IEEE Transactions on Neural Networks
Adapting and evaluating distributed real-time and embedded systems in dynamic environments

Proceedings of the First International Workshop on Data Dissemination for Large Scale Complex Critical Infrastructures
Optimizing a new nonlinear reinforcement scheme with Breeder genetic algorithm

NN'10/EC'10/FS'10 Proceedings of the 11th WSEAS international conference on nural networks and 11th WSEAS international conference on evolutionary computing and 11th WSEAS international conference on Fuzzy systems
Individual differences in nucleus accumbens dopamine receptors predict development of addiction-like behavior: A computational approach

Neural Computation
Motion fields for interactive character locomotion

ACM SIGGRAPH Asia 2010 papers
Towards modeling the behavior of physical intruders in a region monitored by a wireless sensor network

Proceedings of the 3rd ACM workshop on Artificial intelligence and security
An adaptive Q-learning algorithm developed for agent-based computational modeling of electricity market

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Cognitive engine design for link adaptation: an application to multi-antenna systems

IEEE Transactions on Wireless Communications
Using reinforcement learning to create communication channel management strategies for diverse users

SLPAT '10 Proceedings of the NAACL HLT 2010 Workshop on Speech and Language Processing for Assistive Technologies
Learning in closed-loop brain-machine interfaces: modeling and experimental validation

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
The neuronal replicator hypothesis

Neural Computation
Functional Optimization Through Semilocal Approximate Minimization

Operations Research
Measuring universal intelligence: Towards an anytime intelligence test

Artificial Intelligence
MDP-based lightpath establishment for service differentiation in all-optical WDM networks with wavelength conversion capability

Photonic Network Communications
A Human-Robot Collaborative Reinforcement Learning Algorithm

Journal of Intelligent and Robotic Systems
Reinforcement learning using Voronoi space division

Artificial Life and Robotics
A study of Q-learning considering negative rewards

Artificial Life and Robotics
Coalition-based metaheuristic: a self-adaptive metaheuristic using reinforcement learning and mimetism

Journal of Heuristics
Hierarchical reinforcement learning for adaptive text generation

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Towards a programmable instrumented generator

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
End-to-end stochastic scheduling of scalable video overtime-varying channels

Proceedings of the international conference on Multimedia
Rule acquisition for cognitive agents by using estimation of distribution algorithms

International Journal of Knowledge Engineering and Soft Data Paradigms
Leveling-up in heroes of might and magic III

FUN'10 Proceedings of the 5th international conference on Fun with algorithms
An autonomic testing framework for IPv6 configuration protocols

AIMS'10 Proceedings of the Mechanisms for autonomous management of networks and services, and 4th international conference on Autonomous infrastructure, management and security
An algorithmic game theory study of wholesale electricity markets based on central auction

Integrated Computer-Aided Engineering - Multi-Agent Systems for Energy Management
Agent-based coordination techniques for matching supply and demand in energy networks

Integrated Computer-Aided Engineering - Multi-Agent Systems for Energy Management
Unsupervised learning of background modeling parameters in multicamera systems

Computer Vision and Image Understanding
Learning adaptive referring expression generation policies for spoken dialogue systems

Empirical methods in natural language generation
Natural language generation as planning under uncertainty for spoken dialogue systems

Empirical methods in natural language generation
Time-based reward shaping in real-time strategy games

ADMI'10 Proceedings of the 6th international conference on Agents and data mining interaction
Multi-policy optimization in self-organizing systems

SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Adaptive ε-greedy exploration in reinforcement learning based on value differences

KI'10 Proceedings of the 33rd annual German conference on Advances in artificial intelligence
Bayesian reasoning for software testing

Proceedings of the FSE/SDP workshop on Future of software engineering research
Learning to coordinate in complex networks

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
On the characteristics of sequential decision problems and their impact on evolutionary computation and reinforcement learning

EA'09 Proceedings of the 9th international conference on Artificial evolution
AdQL - anomaly detection Q-learning in control multi-queue systems with QoS constraints

KES-AMSTA'10 Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II
Three-subagent adapting architecture for fighting videogames

PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
The penalty avoiding rational policy making algorithm in continuous action spaces

IDEAL'10 Proceedings of the 11th international conference on Intelligent data engineering and automated learning
Tug-of-war model for multi-armed bandit problem

UC'10 Proceedings of the 9th international conference on Unconventional computation
From mirror writing to mirror neurons

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Learning to look in different environments: an active-vision model which learns and readapts visual routines

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Minimal model of strategy switching in the plus-maze navigation task

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
A computational model of integration between reinforcement learning and task monitoring in the prefrontal cortex

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Noisy-or nodes for conditioning models

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
TeXDYNA: hierarchical reinforcement learning in factored MDPs

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
A novel information measure for predictive learning in a social system setting

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Anticipation as a strategy: a design paradigm for robotics

KSEM'10 Proceedings of the 4th international conference on Knowledge science, engineering and management
Improving reinforcement learning agents using genetic algorithms

AMT'10 Proceedings of the 6th international conference on Active media technology
A model of basal ganglia in saccade generation

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part I
Generating adaptive route instructions using hierarchical reinforcement learning

SC'10 Proceedings of the 7th international conference on Spatial cognition
Evolving a single scalable controller for an octopus arm with a variable number of segments

PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part II
Social conformity and its convergence for reinforcement learning

MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Evaluation of techniques for a learning-driven modeling methodology in multiagent simulation

MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Smarter sampling in model-based Bayesian reinforcement learning

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Adaptive bases for reinforcement learning

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Feature selection for reinforcement learning: evaluating implicit state-reward dependency via conditional mutual information

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Evolutionary dynamics of regret minimization

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Hidden Markov model for human decision process in a partially observable environment

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
An incremental probabilistic neural network for regression and reinforcement learning tasks

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Exploring continuous action spaces with diffusion trees for reinforcement learning

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
An alternative approach to the revision of ordinal conditional functions in the context of multi-valued logic

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
One-shot supervised reinforcement learning for multi-targeted tasks: RL-SAS

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
An oscillatory neural network model for birdsong learning and generation

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Incorporating domain models into Bayesian optimization for RL

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
On the potential of process simulation in software project schedule optimization

COMPSAC-W'05 Proceedings of the 29th annual international conference on Computer software and applications conference
Simultaneous learning of perception and action in mobile robots

Robotics and Autonomous Systems
ACE (Actor-Critic-Explorer) paradigm for reinforcement learning in basal ganglia: Highlighting the role of subthalamic and pallidal nuclei

Neurocomputing
A View on Human Goal-Directed Activity and the Construction of Artificial Intelligence

Minds and Machines
Minimizing total tardiness in a stochastic single machine scheduling problem using approximate dynamic programming

Journal of Scheduling
Distributed Reinforcement Learning for Coordinate Multi-Robot Foraging

Journal of Intelligent and Robotic Systems
Reducing reinforcement learning to KWIK online regression

Annals of Mathematics and Artificial Intelligence
Resource-driven mission-phasing techniques for constrained agents in stochastic environments

Journal of Artificial Intelligence Research
A minimum relative entropy principle for learning and acting

Journal of Artificial Intelligence Research
Automatic induction of bellman-error features for probabilistic planning

Journal of Artificial Intelligence Research
Pagerank optimization in polynomial time by stochastic shortest path reformulation

ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Prediction with expert advice under discounted loss

ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Optimality issues of universal greedy agents with static priors

ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Consistency of feature Markov processes

ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Developing strategies for the ART domain

CAEPIA'09 Proceedings of the Current topics in artificial intelligence, and 13th conference on Spanish association for artificial intelligence
Transfer learning via relational templates

ILP'09 Proceedings of the 19th international conference on Inductive logic programming
Policy transfer via Markov logic networks

ILP'09 Proceedings of the 19th international conference on Inductive logic programming
Algorithm selection as a bandit problem with unbounded losses

LION'10 Proceedings of the 4th international conference on Learning and intelligent optimization
Coaching to enhance the online behavior learning of a robotic agent

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part I
Fuzzy Q(λ)-learning algorithm

ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
Emotion and reinforcement: affective facial expressions facilitate robot learning

ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
An unsupervised, online learning framework for moving object detection

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Reinforcement learning based resource allocation in business process management

Data & Knowledge Engineering
Self-learning fuzzy logic controllers for pursuit-evasion differential games

Robotics and Autonomous Systems
Generalized learning automata for multi-agent reinforcement learning

AI Communications - European Workshop on Multi-Agent Systems (EUMAS) 2009
Continuous-state reinforcement learning with fuzzy approximation

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Parallel reinforcement learning with linear function approximation

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Bifurcation analysis of reinforcement learning agents in the Selten's horse game

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Solving multi-stage games with hierarchical learning automata that bootstrap

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Multi-agent reinforcement learning for intrusion detection

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
A new feature for approximate dynamic programming traffic light controller

Proceedings of the Second International Workshop on Computational Transportation Science
Generating three binary addition algorithms using reinforcement programming

Proceedings of the 48th Annual Southeast Regional Conference
Adaptive case-based reasoning using retention and forgetting strategies

Knowledge-Based Systems
Web-based multi-agent system architecture in a dynamic environment

International Journal of Knowledge-based and Intelligent Engineering Systems
User and noise adaptive dialogue management using hybrid system actions

IWSDS'10 Proceedings of the Second international conference on Spoken dialogue systems for ambient environments
Autonomous discovery of subgoals using acyclic state trajectories

ICICA'10 Proceedings of the First international conference on Information computing and applications
Teaching a robot to perform tasks with voice commands

MICAI'10 Proceedings of the 9th Mexican international conference on Advances in artificial intelligence: Part I
On-line adaptive algorithms in autonomic restart control

ATC'10 Proceedings of the 7th international conference on Autonomic and trusted computing
Agent-augmented co-space: toward merging of real world and cyberspace

ATC'10 Proceedings of the 7th international conference on Autonomic and trusted computing
Multiagent Q-learning for aloha-like spectrum access in cognitive radio systems

EURASIP Journal on Wireless Communications and Networking
Adaptation-based programming in java

Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
Improving space representation in multiagent learning via tile coding

SBIA'10 Proceedings of the 20th Brazilian conference on Advances in artificial intelligence
Structural knowledge transfer by spatial abstraction for reinforcement learning agents

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms

Proceedings of the fourth ACM international conference on Web search and data mining
Human-inspired computational fairness

Autonomous Agents and Multi-Agent Systems
Learning the behavior model of a robot

Autonomous Robots
Expert-driven genetic algorithms for simulating evaluation functions

Genetic Programming and Evolvable Machines
Stochastic control via direct comparison

Discrete Event Dynamic Systems
Evaluating Q-learning policies for multi-objective foraging task in a multi-agent environment

ICIRA'10 Proceedings of the Third international conference on Intelligent robotics and applications - Volume Part II
Free-energy based reinforcement learning for vision-based navigation with high-dimensional sensory inputs

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Reinforcement learning by KFM probabilistic associative memory based on weights distribution and area neuron increase and decrease

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
An information-spectrum approach to analysis of return maximization in reinforcement learning

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Adaptive decision making in ant colony system by reinforcement learning

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Multivariate decision tree function approximation for reinforcement learning

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Creating long gait animation sequences through Reinforcement Learning

Proceedings of the 2011 conference on Neural Nets WIRN10: Proceedings of the 20th Italian Workshop on Neural Nets
Improved AP association management using machine learning

ACM SIGMOBILE Mobile Computing and Communications Review
Self-organizing networks in next generation radio access networks: Application to fractional power control

Computer Networks: The International Journal of Computer and Telecommunications Networking
Internal-time temporal difference model for neural value-based decision making

Neural Computation
Modeling basal ganglia for understanding parkinsonian reaching movements

Neural Computation
A reinforcement learning framework for answering complex questions

Proceedings of the 16th international conference on Intelligent user interfaces
Continuous state/action reinforcement learning: A growing self-organizing map approach

Neurocomputing
Foresighted tree configuration games in resource constrained distributed stream mining sensors

Ad Hoc Networks
Learning dialogue strategies from older and younger simulated users

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Adaptive referring expression generation in spoken dialogue systems: evaluation with real users

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Gaussian processes for fast policy optimisation of POMDP-based dialogue managers

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Two coupled neural-networks-based solution of the Hamilton-Jacobi-Bellman equation

Applied Soft Computing
A novel multi-agent reinforcement learning approach for job scheduling in Grid computing

Future Generation Computer Systems
Particle swarm optimization in exploratory data analysis

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
Learning visual representations for perception-action systems

International Journal of Robotics Research
Solving non-stationary bandit problems by random sampling from sibling Kalman filters

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III
Temporal vagueness, coordination and communication

ViC'09 Proceedings of the 2009 international conference on Vagueness in communication
Planning with noisy probabilistic relational rules

Journal of Artificial Intelligence Research
The inverse classification problem

Journal of Computer Science and Technology
Swarm reinforcement learning method based on an actor-critic method

SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
A parameterless biologically inspired control algorithm robust to nonlinearities, dead-times and low-pass filtering effects

SIMPAR'10 Proceedings of the Second international conference on Simulation, modeling, and programming for autonomous robots
Reduct based Q-learning: an introduction

Proceedings of the 2011 International Conference on Communication, Computing & Security
Studying the emergence of money by means of swarm multi-agent simulation

IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Dynamic reward shaping: training a robot by voice

IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
State representation with perceptual constancy based on active motion

ICSR'10 Proceedings of the Second international conference on Social robotics
Selection of actions for an autonomous social robot

ICSR'10 Proceedings of the Second international conference on Social robotics
A Markovian process modeling for Pickomino

CG'10 Proceedings of the 7th international conference on Computers and games
Enhancements for multi-player Monte-Carlo tree search

CG'10 Proceedings of the 7th international conference on Computers and games
Teacher feedback to scaffold and refine demonstrated motion primitives on a mobile robot

Robotics and Autonomous Systems
Empowerment for continuous agent-environment systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Combining Constraint Programming and Local Search for Job-Shop Scheduling

INFORMS Journal on Computing
A CMOS current-mode dynamic programming circuit

IEEE Transactions on Circuits and Systems Part I: Regular Papers - Special section on 2009 IEEE system-on-chip conference
A Generalized Path Integral Control Approach to Reinforcement Learning

The Journal of Machine Learning Research
Hessian matrix distribution for Bayesian policy gradient reinforcement learning

Information Sciences: an International Journal
Froms: A failure tolerant and mobility enabled multicast routing paradigm with reinforcement learning for WSNs

Ad Hoc Networks
Multi-level cognitive machine-learning based concept for human-like "artificial" walking: Application to autonomous stroll of humanoid robots

Neurocomputing
Robust high performance reinforcement learning through weighted k-nearest neighbors

Neurocomputing
Reinforcement Learning Enhanced Iterative Power Allocation in Stochastic Cognitive Wireless Mesh Networks

Wireless Personal Communications: An International Journal
Cognitive Radio with Reinforcement Learning Applied to Multicast Downlink Transmission with Power Adjustment

Wireless Personal Communications: An International Journal
A bionic model of adaptive searching behavior

Journal of Computer and Systems Sciences International
Representing trust in cognitive social simulations

SBP'11 Proceedings of the 4th international conference on Social computing, behavioral-cultural modeling and prediction
Introduction to special issue on machine learning for adaptivity in spoken dialogue systems

ACM Transactions on Speech and Language Processing (TSLP)
Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager

ACM Transactions on Speech and Language Processing (TSLP)
Spatially-aware dialogue control using hierarchical reinforcement learning

ACM Transactions on Speech and Language Processing (TSLP)
Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs

ACM Transactions on Speech and Language Processing (TSLP)
Comparing user simulations for dialogue strategy learning

ACM Transactions on Speech and Language Processing (TSLP)
Modeling spoken decision support dialogue and optimization of its dialogue strategy

ACM Transactions on Speech and Language Processing (TSLP)
Self-organizing state aggregation for architecture design of Q-learning

Information Sciences: an International Journal
A Modified Memory-Based Reinforcement Learning Method for Solving POMDP Problems

Neural Processing Letters
Microassembly path planning using reinforcement learning for improving positioning accuracy of a 1 cm3 omni-directional mobile microrobot

Applied Intelligence
Nonverbal acoustic communication in human-computer interaction

Artificial Intelligence Review
Empirically evaluating the application of reinforcement learning to the induction of effective and adaptive pedagogical strategies

User Modeling and User-Adapted Interaction
Efficient program generation by evolving graph structures with multi-start nodes

Applied Soft Computing
Integration of reinforcement learning and optimal decision-making theories of the basal ganglia

Neural Computation
Learning and using domain-specific heuristics in ASP solvers

AI Communications - Answer Set Programming
Darwinian embodied evolution of the learning ability for survival

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems
Behavioural analysis in network formation using agent-based simulation systems

International Journal of Knowledge Engineering and Soft Data Paradigms
Sampled fictitious play for approximate dynamic programming

Computers and Operations Research
Noisy reinforcements in reinforcement learning: some case studies based on gridworlds

ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
Intelligent "health restoration system": reinforcement learning feedback to diagnosis and treatment planning

TELE-INFO'06 Proceedings of the 5th WSEAS international conference on Telecommunications and informatics
Adaptive navigation for autonomous robots

Robotics and Autonomous Systems
Reinforcement learning for joint radio resource management in LTE-UMTS scenarios

Computer Networks: The International Journal of Computer and Telecommunications Networking
The implementation of Q-learning for problems in continuous state and action space using SOM-based fuzzy systems

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Design of a multi agent adaptive critic based neuro-fuzzy controller for multi-objective nonlinear systems

ICOSSSE'05 Proceedings of the 4th WSEAS/IASME international conference on System science and simulation in engineering
Learning powerful kicks on the aibo ERS-7: the quest for a striker

RoboCup 2010
LearnPNP: a tool for learning agent behaviors

RoboCup 2010
A nonlinear reinforcement scheme for stochastic learning automata

MMACTEE'06 Proceedings of the 8th WSEAS international conference on Mathematical methods and computational techniques in electrical engineering
Using a coevolution mechanism with a Dyna architecture for parameter adaptation in XCS classifier systems

CIMMACS'05 Proceedings of the 4th WSEAS international conference on Computational intelligence, man-machine systems and cybernetics
Knowledge of opposite actions for reinforcement learning

Applied Soft Computing
An educational tool for artificial neural networks

Computers and Electrical Engineering
AutoBlackTest: a tool for automatic black-box testing

Proceedings of the 33rd International Conference on Software Engineering
Short term memories and forcing the re-use of knowledge for generalization

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Stochastic processes for return maximization in reinforcement learning

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
An off-policy natural policy gradient method for a partial observable Markov decision process

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Back-propagation as reinforcement in prediction tasks

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Evolving optimal feature set by interactive reinforcement learning for image retrieval

ISNN'05 Proceedings of the Second international conference on Advances in neural networks - Volume Part II
A Multi-agent-based voltage control in power systems using distributed reinforcement learning

Simulation
Generic reinforcement schemes and their optimization

ECC'11 Proceedings of the 5th European conference on European computing conference
Self-adaptive provisioning of virtualized resources in cloud computing

Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Decentralized MDPs with sparse interactions

Artificial Intelligence
Reinforcement learning for model building and variance-penalized control

Winter Simulation Conference
Using genetic algorithms to limit the optimism in time warp

Winter Simulation Conference
A simulation-based approximate dynamic programming approach for the control of the Intel Mini-Fab benchmark model

Winter Simulation Conference
Balancing exploration and exploitation in learning to rank online

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Chaotic exploration generator for evolutionary reinforcement learning agents in nondeterministic environments

ICANNGA'11 Proceedings of the 10th international conference on Adaptive and natural computing algorithms - Volume Part II
Fault oblivious high performance computing with dynamic task replication and substitution

Computer Science - Research and Development
Smart data structures: an online machine learning approach to multicore data structures

Proceedings of the 8th ACM international conference on Autonomic computing
Decision making in autonomic computing systems: comparison of approaches and techniques

Proceedings of the 8th ACM international conference on Autonomic computing
Using reinforcement learning for controlling an elastic web application hosting platform

Proceedings of the 8th ACM international conference on Autonomic computing
Towards a real-world scenario for investigating organic computing principles in heterogeneous societies of robots

Proceedings of the 2011 workshop on Organic computing
A framework of intentional characters for simulation of social behavior

Proceedings of the 2010 Summer Computer Simulation Conference
Automatic abstraction and fault tolerance in cortical microachitectures

Proceedings of the 38th annual international symposium on Computer architecture
FQL-RED: an adaptive scalable schema for active queue management

International Journal of Network Management
Use of infeasible individuals in probabilistic model building genetic network programming

Proceedings of the 13th annual conference on Genetic and evolutionary computation
On the relationships between synaptic plasticity and generative systems

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Enhanced rule extraction and classification mechanism of genetic network programming for stock trading signal generation

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Policy learning in resource-constrained optimization

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Evolution of reward functions for reinforcement learning

Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Evolution for modeling: a genetic programming framework for sesam

Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Learning classifier systems

Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Learning to win by reading manuals in a Monte-Carlo framework

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Hierarchical reinforcement learning and hidden Markov models for task-oriented natural language generation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Optimization of heuristic search using recursive algorithm selection and reinforcement learning

Annals of Mathematics and Artificial Intelligence
A dynamic programming strategy to balance exploration and exploitation in the bandit problem

Annals of Mathematics and Artificial Intelligence
Adaptive co-construction of state and action spaces in reinforcement learning

Artificial Life and Robotics
Self-adaptive provisioning of virtualized resources in cloud computing

ACM SIGMETRICS Performance Evaluation Review - Performance evaluation review
Planning with incomplete information

MoChArt'10 Proceedings of the 6th international conference on Model checking and artificial intelligence
Training neural networks to play backgammon variants using reinforcement learning

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Learning chasing behaviours of non-player characters in games using SARSA

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Adaptive kernel-width selection for kernel-based least-squares policy iteration algorithm

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Concurrent modular Q-learning with local rewards on linked multi-component robotic systems

IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
Study of a multi-robot collaborative task through reinforcement learning

IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
Reinforcement learning techniques for the control of wastewater treatment plants

IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation: new challenges on bioinspired applications - Volume Part II
Towards a collaborative ranking mechanism for efficient and personalized internet search service provisioning

Journal of Computational Methods in Sciences and Engineering - Intelligent Systems and Knowledge Management (Part II)
Semi-automatic end-user tools for construction of virtual avatar behaviors

Proceedings of the 16th International Conference on 3D Web Technology
Selecting Simulation Algorithm Portfolios by Genetic Algorithms

PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
A Multi-State Q-Learning Approach for the Dynamic Load Balancing of Time Warp

PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Dynamic game difficulty balancing for backgammon

Proceedings of the 49th Annual Southeast Regional Conference
Non-deterministic policies in Markovian decision processes

Journal of Artificial Intelligence Research
A Monte-Carlo AIXI approximation

Journal of Artificial Intelligence Research
Learning in minority games with multiple resources

ECAL'09 Proceedings of the 10th European conference on Advances in artificial life: Darwin meets von Neumann - Volume Part II
A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes

The Journal of Machine Learning Research
Generalized TD Learning

The Journal of Machine Learning Research
Exploiting Best-Match Equations for Efficient Reinforcement Learning

The Journal of Machine Learning Research
On-line classification of data streams with missing values based on reinforcement learning

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Using artificial intelligence techniques for strategy generation in the commons game

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I
Empirical study of Q-learning based elemental hose transport control

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
Towards concurrent Q-learning on linked multi-component robotic systems

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
What kinds of human negotiation skill can be acquired by changing negotiation order of bargaining agents?

HCII'11 Proceedings of the 1st international conference on Human interface and the management of information: interacting with information - Volume Part II
Modelling coordination of learning systems: a reservoir systems approach to dopamine modulated pavlovian conditioning

ECAL'09 Proceedings of the 10th European conference on Advances in artificial life: Darwin meets von Neumann - Volume Part I
Machine learning and agents

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Voting in multi-agent system for improvement of partial observations

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
An agent-based approach to the dynamic price problem

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Deriving a near-optimal power management policy using model-free reinforcement learning and Bayesian classification

Proceedings of the 48th Design Automation Conference
Dynamic thermal management for multimedia applications using machine learning

Proceedings of the 48th Design Automation Conference
Experimental evaluation of automatic hint generation for a logic tutor

AIED'11 Proceedings of the 15th international conference on Artificial intelligence in education
Learning culture-specific dialogue models from non culture-specific data

UAHCI'11 Proceedings of the 6th international conference on Universal access in human-computer interaction: users diversity - Volume Part II
Multiagent reactive plan application learning in dynamic environments

Proceedings of the 15th WSEAS international conference on Computers
A distributed reinforcement learning approach for solving optimization problems

CIT'11 Proceedings of the 5th WSEAS international conference on Communications and information technology
Theoretical considerations of potential-based reward shaping for multi-agent systems

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Evolving subjective utilities: Prisoner's Dilemma game examples

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Information systems in modeling interactive computations on granules

Theoretical Computer Science
Empirical evaluation of ad hoc teamwork in the pursuit domain

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Integrating reinforcement learning with human demonstrations of varying ability

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Metric learning for reinforcement learning agents

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Argumentation-based reasoning in agents with varying degrees of trust

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Sequential constant size compressors for reinforcement learning

AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Comparing humans and AI agents

AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Compression and intelligence: social environments and communication

AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Reinforcement learning and the Bayesian control rule

AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
AGI and neuroscience: open sourcing the brain

AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Investigation in transfer learning: better way to apply transfer learning between agents

MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Preference-based policy learning

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Datum-wise classification: a sequential approach to sparsity

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Lagrange dual decomposition for finite horizon Markov decision processes

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Reinforcement learning through global stochastic search in N-MDPs

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Sparse Kernel-SARSA(λ) with an eligibility trace

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Efficient planning in R-max

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Solving delayed coordination problems in MAS

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Agent-based resource allocation in dynamically formed CubeSat constellations

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Agent sensing with stateful resources

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Robot Head Motion Control with an Emphasis on Realism of Neck---Eye Coordination during Object Tracking

Journal of Intelligent and Robotic Systems
Instance-based reinforcement learning technique with a meta-learning mechanism for robust multi-robot systems

TAROS'11 Proceedings of the 12th Annual conference on Towards autonomous robotic systems
Real-world reinforcement learning for autonomous humanoid robot charging in a home environment

TAROS'11 Proceedings of the 12th Annual conference on Towards autonomous robotic systems
Heliza: talking dirty to the attackers

Journal in Computer Virology
On the Curse of Dimensionality in Supervised Learning of Smooth Regression Functions

Neural Processing Letters
Personalized pricing recommender system: multi-stage epsilon-greedy approach

Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems
Economic learning for thermal-aware power budgeting in many-core architectures

CODES+ISSS '11 Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
A framework for Resource-Aware Data Accumulation in sparse wireless sensor networks

Computer Communications
Study of SOM-based intelligent multi-controller for real-time scheduling

Applied Soft Computing
Ensemble methods for reinforcement learning with function approximation

MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Policy gradient reinforcement learning with environmental dynamics and action-values in policies

KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part I
A two-armed bandit collective for examplar based mining of frequent itemsets with applications to intrusion detection

ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part I
Agent-based system with learning capabilities for transport problems

ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Evolving equilibrium policies for a multiagent reinforcement learning problem with state attractors

ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Modeling agents and agent systems

Transactions on computational collective intelligence V
Tactical agent personality

International Journal of Computer Games Technology
The Effect of Robust Decisions on the Cost of Uncertainty in Military Airlift Operations

ACM Transactions on Modeling and Computer Simulation (TOMACS)
On-line regression algorithms for learning mechanical models of robots: A survey

Robotics and Autonomous Systems
Strategic points to minimize time cost for decision making under asynchronous time constraints

WISS'10 Proceedings of the 2010 international conference on Web information systems engineering
Reinforcement learning for context aware segmentation

MICCAI'11 Proceedings of the 14th international conference on Medical image computing and computer-assisted intervention - Volume Part III
Principled methods for biasing reinforcement learning agents

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Balancing exploration and exploitation ratio in reinforcement learning

Proceedings of the 2011 Military Modeling & Simulation Symposium
Events, neural systems and time series

ServiceWave'10 Proceedings of the 2010 international conference on Towards a service-based internet
Learning to act optimally in partially observable Markov decision processes using hybrid probabilistic logic programs

SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
Deviations of stochastic bandit regret

ALT'11 Proceedings of the 22nd international conference on Algorithmic learning theory
Universal knowledge-seeking agents

ALT'11 Proceedings of the 22nd international conference on Algorithmic learning theory
On the power of global reward signals in reinforcement learning

MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
Learning complex concepts using crowdsourcing: a Bayesian approach

ADT'11 Proceedings of the Second international conference on Algorithmic decision theory
Value-difference based exploration: adaptive control between epsilon-greedy and softmax

KI'11 Proceedings of the 34th Annual German conference on Advances in artificial intelligence
A reinforcement learning based method for optimizing the process of decision making in fire brigade agents

EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
Market-based dynamic task allocation using heuristically accelerated reinforcement learning

EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
Policy invariance under reward transformations for general-sum stochastic games

Journal of Artificial Intelligence Research
A Zeroth-Level Classifier System for Real Time Strategy Games

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Emotion-based intrinsic motivation for reinforcement learning agents

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part I
A probabilistic method for inferring preferences from clicks

Proceedings of the 20th ACM international conference on Information and knowledge management
A self-adaptive routing paradigm for wireless mesh networks based on reinforcement learning

Proceedings of the 14th ACM international conference on Modeling, analysis and simulation of wireless and mobile systems
Self-teaching adaptive dynamic programming for Gomoku

Neurocomputing
A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Neurocomputing
Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach

Neurocomputing
Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning

Computers and Operations Research
On the complexity of policy iteration

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Approximate planning for factored POMDPs using belief state simplification

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Solving POMDPs by searching the space of finite policies

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning finite-state controllers for partially observable environments

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Qualitative MDPs and POMDPs: an order-of-magnitude approximation

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
PEGASUS: a policy search method for large MDPs and POMDPs

UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
Learning to cooperate via policy search

UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
The optimal reward baseline for gradient-based reinforcement learning

UAI'01 Proceedings of the Seventeenth conference on Uncertainty in artificial intelligence
Fractionated software for networked cyber-physical systems: research directions and long-term vision

Formal modeling
Evaluating a reinforcement learning algorithm with a general intelligence test

CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
"if you can't be with the one you love, love the one you're with": How individual habituation of agent interactions improves global utility

Artificial Life
Reward-weighted regression with sample reuse for direct policy search in reinforcement learning

Neural Computation
Human dorsal striatum encodes prediction errors during observational learning of instrumental actions

Journal of Cognitive Neuroscience
Vigor in the face of fluctuating rates of reward: An experimental examination

Journal of Cognitive Neuroscience
Convergence Rates of Efficient Global Optimization Algorithms

The Journal of Machine Learning Research
Robust Approximate Bilinear Programming for Value Function Approximation

The Journal of Machine Learning Research
The application of learning algorithms in the development of natural interaction

Procedings of the Second Conference on Creativity and Innovation in Design
Quantum reinforcement learning

ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part II
Learning in BDI multi-agent systems

CLIMA IV'04 Proceedings of the 4th international conference on Computational Logic in Multi-Agent Systems
The apriori stochastic dependency detection (ASDD) algorithm for learning stochastic logic rules

CLIMA IV'04 Proceedings of the 4th international conference on Computational Logic in Multi-Agent Systems
Behavior recognition and opponent modeling for adaptive table soccer playing

KI'05 Proceedings of the 28th annual German conference on Advances in Artificial Intelligence
Adaptive Scheduling on Power-Aware Managed Data-Centers Using Machine Learning

GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Handling camera movement constraints in reinforcement learning based active object recognition

DAGM'06 Proceedings of the 28th conference on Pattern Recognition
Cooperative behavior of agents based on potential field

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
A multi-agent fuzzy-reinforcement learning method for continuous domains

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
An adaptive approach for the exploration-exploitation dilemma for learning agents

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
General discounting versus average reward

ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Is there an elegant universal theory of prediction?

ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Asymptotic learnability of reinforcement problems with arbitrary dependence

ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Probabilistic generalization of simple grammars and its application to reinforcement learning

ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
BRA: An Algorithm for Simulating Bounded Rational Agents

Computational Economics
Intrinsically motivated intelligent sensed environments

EG-ICE'06 Proceedings of the 13th international conference on Intelligent Computing in Engineering and Architecture
A novel self-organizing neural fuzzy network for automatic generation of fuzzy inference systems

ISNN'05 Proceedings of the Second international conference on Advances in Neural Networks - Volume Part I
Applying neural network to reinforcement learning in continuous spaces

ISNN'05 Proceedings of the Second international conference on Advances in Neural Networks - Volume Part I
Task-Driven discretization of the joint space of visual percepts and continuous actions

ECML'06 Proceedings of the 17th European conference on Machine Learning
Patching approximate solutions in reinforcement learning

ECML'06 Proceedings of the 17th European conference on Machine Learning
Skill acquisition via transfer learning and advice taking

ECML'06 Proceedings of the 17th European conference on Machine Learning
Reinforcement learning for MDPs with constraints

ECML'06 Proceedings of the 17th European conference on Machine Learning
Efficient non-linear control through neuroevolution

ECML'06 Proceedings of the 17th European conference on Machine Learning
Scaling model-based average-reward reinforcement learning for product delivery

ECML'06 Proceedings of the 17th European conference on Machine Learning
Improvement of systems management policies using hybrid reinforcement learning

ECML'06 Proceedings of the 17th European conference on Machine Learning
A sparse kernel-based least-squares temporal difference algorithm for reinforcement learning

ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part I
Unique state and automatical action abstracting based on logical MDPs with negation

ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part II
Analyzing fault monitoring policy for hierarchical network with MMDP environment

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Using meta-level control with reinforcement learning to improve the performance of the agents

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Testing probabilistic equivalence through reinforcement learning

FSTTCS'06 Proceedings of the 26th international conference on Foundations of Software Technology and Theoretical Computer Science
Cognitive agents for sense and respond logistics

DAMAS'05 Proceedings of the 2005 international conference on Defence Applications of Multi-Agent Systems
Opponent learning for multi-agent system simulation

RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Adaptive learning in complex trade networks

SEAL'06 Proceedings of the 6th international conference on Simulated Evolution And Learning
Context adaptive self-configuration system

ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part III
Performance bounds for mobile cellular networks with handover prediction

MMNS'05 Proceedings of the 8th international conference on Management of Multimedia Networks and Services
Learning to segment document images

PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
Ensemble pruning using reinforcement learning

SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
Monte Carlo matrix inversion policy evaluation

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Natural inspiration for artificial adaptivity: some neurocomputing experiences in robotics

UC'05 Proceedings of the 4th international conference on Unconventional Computation
A tutoring system for commercial games

ICEC'05 Proceedings of the 4th international conference on Entertainment Computing
An RLS-based natural actor-critic algorithm for locomotion of a two-linked robot arm

CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
An autonomous mobile robot based on quantum algorithm

CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Global versus local constructive function approximation for on-line reinforcement learning

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Structural abstraction experiments in reinforcement learning

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Adaptive utility-based scheduling in resource-constrained systems

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Learning near-optimal policies with bellman-residual minimization based fitted policy iteration and a single sample path

COLT'06 Proceedings of the 19th annual conference on Learning Theory
Toward guidelines for modeling learning agents in multiagent-based simulation: implications from Q-learning and sarsa agents

MABS'04 Proceedings of the 2004 international conference on Multi-Agent and Multi-Agent-Based Simulation
An architecture for multi-agent based self-adaptive system in mobile environment

IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Effect of synthetic emotions on agents’ learning speed and their survivability

ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
The quantitative law of effect is a robust emergent property of an evolutionary algorithm for reinforcement learning

ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
Valency for adaptive homeostatic agents: relating evolution and learning

ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
Fast reinforcement learning of dialogue policies using stable function approximation

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Investigation of evolving populations of adaptive agents

ICANN'05 Proceedings of the 15th international conference on Artificial Neural Networks: biological Inspirations - Volume Part I
Model based learning of sigma points in unscented Kalman filtering

Neurocomputing
Reinforcement learning based sensing policy optimization for energy efficient cognitive radio networks

Neurocomputing
An analytic research on secondary-spectrum trading mechanisms based on technical and market changes

Computer Networks: The International Journal of Computer and Telecommunications Networking
URL: A unified reinforcement learning approach for autonomic cloud management

Journal of Parallel and Distributed Computing
Robotic grasping and manipulation through human visuomotor learning

Robotics and Autonomous Systems
A hybrid learning strategy for discovery of policies of action

IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
An analysis of the different components of the anthocnet routing algorithm

ANTS'06 Proceedings of the 5th international conference on Ant Colony Optimization and Swarm Intelligence
Machine learning for spoken dialogue management: an experiment with speech-based database querying

AIMSA'06 Proceedings of the 12th international conference on Artificial Intelligence: methodology, Systems, and Applications
Mining paths of complex crowd scenes

ISVC'05 Proceedings of the First international conference on Advances in Visual Computing
AlchemistJ: a framework for self-adaptive software

EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
An intelligent adaptation system based on a self-growing engine

EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Aspects of optimal viewpoint selection and viewpoint fusion

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part II
Grey reinforcement learning for incomplete information processing

TAMC'06 Proceedings of the Third international conference on Theory and Applications of Models of Computation
Self-organizing neural architecture for reinforcement learning

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
On the efficient implementation biologic reinforcement learning using eligibility traces

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Q-Learning with FCMAC in multi-agent cooperation

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Reinforcement learning-based tuning algorithm applied to fuzzy identification

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
On the selection of a transversal to solve nonlinear systems with interval arithmetic

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Discovery of stable peers in a self-organising peer-to-peer gradient topology

DAIS'06 Proceedings of the 6th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems
Teamwork formation for keepaway in robotics soccer (reinforcement learning approach)

PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Multiagent reinforcement learning for a planetary exploration multirobot system

PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Expectancy, ambiguity, and behavioral flexibility: Separable and complementary roles of the orbital frontal cortex and amygdala in processing reward expectancies

Journal of Cognitive Neuroscience
Keepaway soccer: from machine learning testbed to benchmark

RoboCup 2005
Learning to approach a moving ball with a simulated two-wheeled robot

RoboCup 2005
Selecting actions for resource-bounded information extraction using reinforcement learning

Proceedings of the fifth ACM international conference on Web search and data mining
A multiagent approach to managing air traffic flow

Autonomous Agents and Multi-Agent Systems
Toward autonomous robotic containment booms: visual servoing for robust inter-vehicle docking of surface vehicles

Intelligent Service Robotics
A hybrid cognitive/reactive intelligent agent autonomous path planning technique in a networked-distributed unstructured environment for reinforcement learning

The Journal of Supercomputing
Automated synthesis of action selection policies for unmanned vehicles operating in adverse environments

Autonomous Robots
Sequentially optimal repeated coalition formation under uncertainty

Autonomous Agents and Multi-Agent Systems
Neuroevolution with manifold learning for playing Mario

International Journal of Bio-Inspired Computation
Optimal tuning of continual online exploration in reinforcement learning

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Feature extraction for decision-theoretic planning in partially observable environments

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Reinforcement learning with echo state networks

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Reward function and initial values: better choices for accelerated goal-directed reinforcement learning

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Nearly optimal exploration-exploitation decision thresholds

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
A neural network module with pretuning for search and reproduction of input-output mapping

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
On the relation of slow feature analysis and laplacian eigenmaps

Neural Computation
The equilibrium of agent mind: the balance between agent theories and practice

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Quantitative µ-calculus analysis of power management in wireless networks

ICTAC'06 Proceedings of the Third international conference on Theoretical Aspects of Computing
Bounded rational search for on-the-fly model checking of LTL properties

FSEN'09 Proceedings of the Third IPM international conference on Fundamentals of Software Engineering
Multiple overlapping tiles for contextual monte carlo tree search

EvoApplicatons'10 Proceedings of the 2010 international conference on Applications of Evolutionary Computation - Volume Part I
An adaptive mobile system using mobile grid computing in wireless network

ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Rough sets and higher order vagueness

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part I
Behavioral pattern identification through rough set modelling

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
Towards finite-sample convergence of direct reinforcement learning

ECML'05 Proceedings of the 16th European conference on Machine Learning
Natural actor-critic

ECML'05 Proceedings of the 16th European conference on Machine Learning
Neural fitted q iteration – first experiences with a data efficient neural reinforcement learning method

ECML'05 Proceedings of the 16th European conference on Machine Learning
Using advice to transfer knowledge acquired in one reinforcement learning task to another

ECML'05 Proceedings of the 16th European conference on Machine Learning
The investigation of the agent in the artificial market

AIS'04 Proceedings of the 13th international conference on AI, Simulation, and Planning in High Autonomy Systems
Optimising natural language generation decision making for situated dialogue

SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
"The day after the day after tomorrow?": a machine learning approach to adaptive temporal expression generation: training and evaluation with real users

SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
Reinforcement learning utilizes proxemics: An avatar learns to manipulate the position of people in immersive virtual reality

ACM Transactions on Applied Perception (TAP)
Automatic optimization of web recommendations using feedback and ontology graphs

ICWE'05 Proceedings of the 5th international conference on Web Engineering
The novel feature selection method based on emotion recognition system

ICIC'06 Proceedings of the 2006 international conference on Computational Intelligence and Bioinformatics - Volume Part III
Multiobjective water pinch analysis of the cuernavaca city water distribution network

EMO'05 Proceedings of the Third international conference on Evolutionary Multi-Criterion Optimization
Learning action sequences through imitation in behavior based architectures

ARCS'05 Proceedings of the 18th international conference on Architecture of Computing Systems conference on Systems Aspects in Organic and Pervasive Computing
Adaptive modeling: an approach and a method for implementing adaptive agents

MMAS'04 Proceedings of the First international conference on Massively Multi-Agent Systems
Agent based decision support system using reinforcement learning under emergency circumstances

ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part I
A virtual reality platform for modeling cognitive development

Biomimetic Neural Learning for Intelligent Robots
Reinforcement learning using a grid based function approximator

Biomimetic Neural Learning for Intelligent Robots
Spatial representation and navigation in a bio-inspired robot

Biomimetic Neural Learning for Intelligent Robots
Autonomous vehicle steering based on evaluative feedback by reinforcement learning

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Cost integration in multi-step viewpoint selection for object recognition

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Abstract policy evaluation for reactive agents

SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Function approximation via tile coding: automating parameter choice

SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Feature-Discovering approximate value iteration methods

SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Multiagent association rules mining in cooperative learning systems

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
CBR for state value function approximation in reinforcement learning

ICCBR'05 Proceedings of the 6th international conference on Case-Based Reasoning Research and Development
Evolving small-board Go players using coevolutionary temporal difference learning with archives

International Journal of Applied Mathematics and Computer Science
Optimal motion planning by reinforcement learning in autonomous mobile vehicles

Robotica
K-Shortest paths q-routing: a new QoS routing algorithm in telecommunication networks

ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
Reduced-State SARSA featuring extended channel reassignment for dynamic channel allocation in mobile cellular networks

ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
A reinforcement learning approach for qos based routing packets in integrated service web based systems

AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
A genetic approach to data dimensionality reduction using a special initial population

IWINAC'05 Proceedings of the First international work-conference on the Interplay Between Natural and Artificial Computation conference on Artificial Intelligence and Knowledge Engineering Applications: a bioinspired approach - Volume Part II
Reinforcement learning based on multi-agent in robocup

ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Evolving agent societies through imitation controlled by artificial emotions

ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Nonlinear prediction by reinforcement learning

ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Enhanced therapeutic interactivity using social robot Zeno

Proceedings of the 4th International Conference on PErvasive Technologies Related to Assistive Environments
A combined reactive and reinforcement learning controller for an autonomous tracked vehicle

Robotics and Autonomous Systems
Duty cycle learning algorithm (DCLA) for IEEE 802.15.4 beacon-enabled wireless sensor networks

Ad Hoc Networks
The design and implementation of SAMIR

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Cognitive hybrid reasoning intelligent agent system

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Learning plans with patterns of actions in bounded-rational agents

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Finding hidden hierarchy in reinforcement learning

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
A neurobiologically motivated model for self-organized learning

MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Hybrid fuzzy/expert system to control grasping with deformation detection

MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Adaptive neuro-fuzzy-expert controller of a robotic gripper

MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Enhancing the automatic generation of hints with expert seeding

ITS'10 Proceedings of the 10th international conference on Intelligent Tutoring Systems - Volume Part II
A dynamic allocation method of basis functions in reinforcement learning

AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Error bounds in reinforcement learning policy evaluation

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Stabilising hebbian learning with a third factor in a food retrieval task

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
An adaptive robot motivational system

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Incremental skill acquisition for self-motivated learning animats

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
A model of reaching that integrates reinforcement learning and population encoding of postures

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Combining self-organizing maps with mixtures of experts: application to an actor-critic model of reinforcement learning in the basal ganglia

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Experimental study on task teaching to real rats through interaction with a robotic rat

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Simbad: an autonomous robot simulation package for education and research

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Self-organizing relays in LTE networks: queuing analysis and algorithms

Proceedings of the 7th International Conference on Network and Services Management
Reinforcement learning by chaotic exploration generator in target capturing task

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Automatic extraction system of a kidney region based on the q-learning

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Time does not always buy quality in co-evolutionary learning

SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Inducing effective pedagogical strategies using learning context features

UMAP'10 Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
MoCoA: customisable middleware for context-aware mobile applications

ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
Reconciling strategic and tactical decision making in agent-oriented simulation of vehicles in urban traffic

Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques
Hierarchical neuro-fuzzy models based on reinforcement learning for intelligent agents

IWANN'05 Proceedings of the 8th international conference on Artificial Neural Networks: computational Intelligence and Bioinspired Systems
Learning teleoreactive logic programs from problem solving

ILP'05 Proceedings of the 15th international conference on Inductive Logic Programming
Learning multi-modal control programs

HSCC'05 Proceedings of the 8th international conference on Hybrid Systems: computation and control
Do micro-level tutorial decisions matter: applying reinforcement learning to induce pedagogical tutorial tactics

ITS'10 Proceedings of the 10th international conference on Intelligent Tutoring Systems - Volume Part I
Adaptive scalable video streaming in wireless networks

Proceedings of the 3rd Multimedia Systems Conference
Data mining techniques for robocup soccer agents

AIS-ADM 2005 Proceedings of the 2005 international conference on Autonomous Intelligent Systems: agents and Data Mining
Modeling the brain's operating system

BVAI'05 Proceedings of the First international conference on Brain, Vision, and Artificial Intelligence
A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Artificial Life and Robotics
Robot learning from demonstration by constructing skill trees

International Journal of Robotics Research
Incremental learning of full body motion primitives and their sequencing through human motion observation

International Journal of Robotics Research
A review of long-term memory in natural and synthetic systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Generating inspiration for agent design by reinforcement learning

Information and Software Technology
Dynamic cooperator selection in cognitive radio networks

Ad Hoc Networks
Learning to negotiate optimally in non-stationary environments

CIA'06 Proceedings of the 10th international conference on Cooperative Information Agents
Learning-Based spectrum selection in cognitive radio ad hoc networks

WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Multi-agent case-based reasoning for cooperative reinforcement learners

ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
A cooperation online reinforcement learning approach in ant-q

ICONIP'06 Proceedings of the 13 international conference on Neural Information Processing - Volume Part I
The interactive feature selection method development for an ANN based emotion recognition system

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Intelligent pairing assistant for air operation centers

Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
Learning automata-based approach to learn dialogue policies in large state space

International Journal of Intelligent Information and Database Systems
Rough ethology: towards a biologically-inspired study of collective behavior in intelligent systems with approximation spaces

Transactions on Rough Sets III
Emergent consensus in decentralised systems using collaborative reinforcement learning

Self-star Properties in Complex Information Systems
On the organisation of agent experience: scaling up social cognition

Socionics
A multi-agent approach to controlling a smart environment

Designing Smart Homes
Rough sets and vague concept approximation: from sample approximation to adaptive learning

Transactions on Rough Sets V
Efficient behavior learning by utilizing estimated state value of self and teammates

RoboCup 2009
An algorithm that recognizes and reproduces distinct types of humanoid motion based on periodically-constrained nonlinear PCA

RoboCup 2004
Chasing a Moving Target: Exploitation and Exploration in Dynamic Environments

Management Science
Adaptive stock trading with dynamic asset allocation using reinforcement learning

Information Sciences: an International Journal
Dynamic alternation of primate response properties during trial-and-error knowledge updating

Robotics and Autonomous Systems
Adaptive fraud detection using benford's law

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Partial local friendq multiagent learning: application to team automobile coordination problem

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Trace equivalence characterization through reinforcement learning

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
The K best-paths approach to approximate dynamic programming with application to portfolio optimization

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Adaptive critic neural networks for identification of wheeled mobile robot

ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
Efficient ant reinforcement learning using replacing eligibility traces

ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
A distributed learning control system for elevator groups

ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
Online testing with reinforcement learning

FATES'06/RV'06 Proceedings of the First combined international conference on Formal Approaches to Software Testing and Runtime Verification
A time-frame based trust model for p2p systems

ICISC'06 Proceedings of the 9th international conference on Information Security and Cryptology
Abstraction and generalization in reinforcement learning: a summary and framework

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Replicator dynamics for multi-agent learning: an orthogonal approach

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Recursive adaptation of stepsize parameter for non-stationary environments

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Multiagent reinforcement learning model for the emergence of common property and transhumance in sub-saharan africa

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Coordinating learning agents for multiple resource job scheduling

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Effectiveness of considering state similarity for reinforcement learning

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Reducing the memory footprint of temporal difference learning over finitely many states by using case-based generalization

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Imitating inscrutable enemies: learning from stochastic policy observation, retrieval and reuse

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
A general introspective reasoning approach to web search for case adaptation

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Natural language interfaces to ontologies: combining syntactic analysis and ontology-based lookup through the user interaction

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part I
Efficient deep web crawling using reinforcement learning

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
A reinforcement learning approach for the flexible job shop scheduling problem

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Learning heuristic policies – a reinforcement learning problem

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Teaching a robot to perform task through imitation and on-line feedback

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Adaption of stepsize parameter using newton's method

PRIMA'11 Proceedings of the 14th international conference on Agents in Principle, Agents in Practice
Stochastic abstract policies for knowledge transfer in robotic navigation tasks

MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Three automated stock-trading agents: a comparative study

AAMAS'04 Proceedings of the 6th AAMAS international conference on Agent-Mediated Electronic Commerce: theories for and Engineering of Distributed Mechanisms and Systems
A general multi-agent modelling framework for the transit assignment problem – a learning-based approach

IICS'04 Proceedings of the 4th international conference on Innovative Internet Community Systems
Experimentation system for efficient job performing in veterinary medicine area

ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part IV
*-MINIMAX performance in backgammon

CG'04 Proceedings of the 4th international conference on Computers and Games
Reinforcement distribution in continuous state action space fuzzy Q–learning: a novel approach

WILF'05 Proceedings of the 6th international conference on Fuzzy Logic and Applications
An afterstates reinforcement learning approach to optimize admission control in mobile cellular networks

EURO-NGI'05 Proceedings of the Second international conference on Wireless Systems and Network Architectures in Next Generation Internet
An overview of cooperative and competitive multiagent learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Learning automata as a basis for multi agent reinforcement learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Dealing with errors in a cooperative multi-agent learning system

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
An adaptive approach for the exploration-exploitation dilemma and its application to economic systems

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Multi-agent relational reinforcement learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Sift and sort: climbing the semantic pyramid

ESOA'05 Proceedings of the Third international conference on Engineering Self-Organising Systems
Actor-Critic algorithm based on incremental least-squares temporal difference with eligibility trace

ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing Theories and Applications: with aspects of artificial intelligence
A multi-agent reinforcement learning with weighted experience sharing

ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing Theories and Applications: with aspects of artificial intelligence
Adaptive and non-adaptive distribution functions for DSA

PRIMA'10 Proceedings of the 13th international conference on Principles and Practice of Multi-Agent Systems
Learning form experience: a bayesian network based reinforcement learning approach

ICICA'11 Proceedings of the Second international conference on Information Computing and Applications
Adaptive multi-robot team reconfiguration using a policy-reuse reinforcement learning approach

AAMAS'11 Proceedings of the 10th international conference on Advanced Agent Technology
Exploration strategies for learning in multi-agent foraging

SEMCCO'11 Proceedings of the Second international conference on Swarm, Evolutionary, and Memetic Computing - Volume Part II
Admission control policies for a multi-class QoS-aware service oriented architecture

ACM SIGMETRICS Performance Evaluation Review
Statistical mechanics of reward-modulated learning in decision-making networks

Neural Computation
Tactile Guidance for Policy Adaptation

Foundations and Trends in Robotics
A new class of ε-optimal learning automata

ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing
Homeokinetic reinforcement learning

PSL'11 Proceedings of the First IAPR TC3 conference on Partially Supervised Learning
Co-learning segmentation in marketplaces

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
A convergent multiagent reinforcement learning approach for a subclass of cooperative stochastic games

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Basis function discovery using spectral clustering and bisimulation metrics

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Solving sparse delayed coordination problems in multi-agent reinforcement learning

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Adaptive information presentation for spoken dialogue systems: evaluation with human subjects

ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
Combining hierarchical reinforcement learning and Bayesian networks for natural language generation in situated dialogue

ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
TATM: a trust mechanism for social traders in double auctions

AI'11 Proceedings of the 24th international conference on Advances in Artificial Intelligence
A novel crawling algorithm for web pages

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Effective search for Pittsburgh learning classifier systems via estimation of distribution algorithms

Information Sciences: an International Journal
Coverage rewarded: Test input generation via adaptation-based programming

ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
Application-aware dynamic spectrum access

Wireless Networks
Long-term information collection with energy harvesting wireless sensors: a multi-armed bandit based approach

Autonomous Agents and Multi-Agent Systems
Improving behavior of computer game bots using fictitious play

International Journal of Automation and Computing
Stochastic enforced hill-climbing

Journal of Artificial Intelligence Research
A reinforcement learning framework for spiking networks with dynamic synapses

Computational Intelligence and Neuroscience
ORACLE: Mobility control in wireless sensor and actor networks

Computer Communications
TIRAMOLA: elastic nosql provisioning through a cloud management platform

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Agile strategic information systems based on axiomatic agent architecture

iUBICOM'11 Proceedings of the 6th international conference on Ubiquitous and Collaborative Computing
Multifaceted web services: an approach to secure and scalable grid scheduling

EuroWeb'02 Proceedings of the 2002 international conference on EuroWeb
Enabling opportunistic and dynamic spectrum access through learning techniques

Wireless Communications & Mobile Computing
Reinforcement Programming

Computational Intelligence
Reinforcement learning as heuristic for action-rule preferences

ProMAS'10 Proceedings of the 8th international conference on Programming Multi-Agent Systems
Probabilistic argumentation frameworks

TAFA'11 Proceedings of the First international conference on Theory and Applications of Formal Argumentation
Self-Organizing reinforcement learning model

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
Evaluation of the improved penalty avoiding rational policy making algorithm in real world environment

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
Centralized and distributed task allocation in multi-robot teams via a stochastic clustering auction

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Dyna-H: A heuristic planning reinforcement learning algorithm applied to role-playing game strategy decision systems

Knowledge-Based Systems
Multi-agent framework for real-time processing of large and dynamic search spaces

Proceedings of the 27th Annual ACM Symposium on Applied Computing
An integrated approach for healthcare planning over multi-dimensional data using long-term prediction

HIS'12 Proceedings of the First international conference on Health Information Science
Adaptive optimal control without weight transport

Neural Computation
The successor representation and temporal context

Neural Computation
Tracking the evolution of cooperation in complex networked populations

EvoBIO'12 Proceedings of the 10th European conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics
Monte-Carlo swarm policy search

SIDE'12 Proceedings of the 2012 international conference on Swarm and Evolutionary Computation
Thalamic cooperation between the cerebellum and basal ganglia with a new tropism-based action-dependent heuristic dynamic programming method

Neurocomputing
A competitive strategy for function approximation in Q-learning

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Using cases as heuristics in reinforcement learning: a transfer learning application

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Q-error as a selection mechanism in modular reinforcement-learning systems

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Sample efficient on-line learning of optimal dialogue policies with kalman temporal differences

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Risk-sensitive policies for sustainable renewable resource allocation

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Non-linear Monte-Carlo search in civilization II

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Integrated learning for goal-driven autonomy

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Integrating learning into a BDI Agent for environments with changing dynamics

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Regret minimization in multiplayer extensive games

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives

Robotics and Autonomous Systems
Integrating particle swarm optimization with reinforcement learning in noisy problems

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Hierarchical task decomposition through symbiosis in reinforcement learning

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Sample aware embedded feature selection for reinforcement learning

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Two-cornered learning classifier systems for pattern generation and classification

Proceedings of the 14th annual conference on Genetic and evolutionary computation
CMA-TWEANN: efficient optimization of neural networks via self-adaptation and seamless augmentation

Proceedings of the 14th annual conference on Genetic and evolutionary computation
An evaluation of pedagogical tutorial tactics for a natural language tutoring system: a reinforcement learning approach

International Journal of Artificial Intelligence in Education - Special issue on Best of ITS 2010
Enhancing the automatic generation of hints with expert seeding

International Journal of Artificial Intelligence in Education - Special issue on Best of ITS 2010
Robustness of optimal channel reservation using handover prediction in multiservice wireless networks

Wireless Networks
Rewards for pairs of Q-learning agents conducive to turn-taking in medium-access games

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Multiple levels of spatial organization: World Graphs and spatial difference learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Bisimulation Metrics for Continuous Markov Decision Processes

SIAM Journal on Computing
The Knowledge Gradient Algorithm for a General Class of Online Learning Problems

Operations Research
Automatic discovery of ranking formulas for playing with multi-armed bandits

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Goal-Directed online learning of predictive models

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Gradient based algorithms with loss functions and kernels for improved on-policy control

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Active learning of MDP models

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Feature reinforcement learning in practice

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reinforcement learning with a bilinear q function

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
ℓ1-Penalized projected bellman residual

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Regularized least squares temporal difference learning with nested ℓ2 and ℓ1 penalization

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Recursive least-squares learning with eligibility traces

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Value function approximation through sparse bayesian modeling

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Automatic construction of temporally extended actions for MDPs using bisimulation metrics

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Transferring evolved reservoir features in reinforcement learning tasks

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Transfer learning via multiple inter-task mappings

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Multi-Task reinforcement learning: shaping and feature selection

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Batch, off-policy and model-free apprenticeship learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Introduction of fixed mode states into online profit sharing and its application to waist trajectory generation of biped robot

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
MapReduce for parallel reinforcement learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Compound reinforcement learning: theory and an application to finance

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Proposal and evaluation of the active course classification support system with exploitation-oriented learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Fuzzy epoch-incremental reinforcement learning algorithm

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part I
DCOPs and bandits: exploration and exploitation in decentralised coordination

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
V-MAX: tempered optimism for better PAC reinforcement learning

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Reinforcement learning transfer via sparse coding

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Decentralized Bayesian reinforcement learning for online agent collaboration

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Dynamic potential-based reward shaping

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Reinforcement learning from simultaneous human and MDP reward

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Transfer in reinforcement learning via shared features

The Journal of Machine Learning Research
Integrating a partial model into model free reinforcement learning

The Journal of Machine Learning Research
Optimistic Bayesian sampling in contextual-bandit problems

The Journal of Machine Learning Research
Memory formation, consolidation, and forgetting in learning agents

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Dynamic channel selection with reinforcement learning for cognitive WLAN over fiber

International Journal of Communication Systems
Tax Collections Optimization for New York State

Interfaces
Strategy-Based learning through communication with humans

KES-AMSTA'12 Proceedings of the 6th KES international conference on Agent and Multi-Agent Systems: technologies and applications
Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes

AI Communications
Community of scientist optimization: An autonomy oriented approach to distributed optimization

AI Communications - 18th RCRA International Workshop on “Experimental evaluation of algorithms for solving problems with combinatorial explosion”
Selecting vision operators and fixing their optimal parameters values using reinforcement learning

ICISP'12 Proceedings of the 5th international conference on Image and Signal Processing
Decentralised reinforcement learning for energy-efficient scheduling in wireless sensor networks

International Journal of Communication Networks and Distributed Systems
Beyond reward: the problem of knowledge and data

ILP'11 Proceedings of the 21st international conference on Inductive Logic Programming
Multiagent learning through neuroevolution

WCCI'12 Proceedings of the 2012 World Congress conference on Advances in Computational Intelligence
A novel feature sparsification method for kernel-based approximate policy iteration

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A rapid sparsification method for kernel machines in approximate policy iteration

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A modular hierarchical reinforcement learning algorithm

ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
Measuring Resemblances Between Swarm Behaviours: A Perceptual Tolerance Near Set Approach

Fundamenta Informaticae - Swarm Intelligence
Single-player Monte-Carlo tree search for SameGame

Knowledge-Based Systems
A New Architecture for Learning Classifier Systems to Solve POMDP Problems

Fundamenta Informaticae
Optimal radio channel recommendations with explicit and implicit feedback

Proceedings of the sixth ACM conference on Recommender systems
Rough Set Approach to Behavioral Pattern Identification

Fundamenta Informaticae - New Frontiers in Scientific Discovery - Commemorating the Life and Work of Zdzislaw Pawlak
A fuzzy reinforcement learning approach for pre-congestion notification based admission control

AIMS'12 Proceedings of the 6th IFIP WG 6.6 international autonomous infrastructure, management, and security conference on Dependable Networks and Services
Distributed self-organized collaboration of autonomous IDS sensors

AIMS'12 Proceedings of the 6th IFIP WG 6.6 international autonomous infrastructure, management, and security conference on Dependable Networks and Services
An online kernel-based clustering approach for value function approximation

SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
An adaptive dialogue system with online dialogue policy learning

SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
Unstable path routing in urban-scale WSN

ACM SIGBED Review - Special Issue on the 3rd International Workshop on Networks of Cooperating Objects (CONET 2012)
Levels of realism for cooperative multi-agent reinforcement learning

ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part I
Reinforcement Learning with Approximation Spaces

Fundamenta Informaticae
Behavioral Pattern Identification Through Rough Set Modeling

Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Calculi of Approximation Spaces

Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Interactive information systems: Toward perception based computing

Theoretical Computer Science
Rough Sets and Vague Concepts

Fundamenta Informaticae - Contagious Creativity - In Honor of the 80th Birthday of Professor Solomon Marcus
$-Calculus of Bounded Rational Agents: Flexible Optimization as Search under Bounded Resources in Interactive Systems

Fundamenta Informaticae
Faster program adaptation through reward attribution inference

Proceedings of the 11th International Conference on Generative Programming and Component Engineering
A diversity dilemma in evolutionary markets

Proceedings of the 13th International Conference on Electronic Commerce
Market niching in multi-attribute computational resource allocation systems

Proceedings of the 13th International Conference on Electronic Commerce
A comparative study of reinforcement learning techniques on dialogue management

EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
A Flexible and Adaptive Hyper-heuristic Approach for (Dynamic) Capacitated Vehicle Routing Problems

Fundamenta Informaticae - Emergent Computing
Towards a Multiple-Lookahead-Levels agent reinforcement-learning technique and its implementation in integrated circuits

The Journal of Supercomputing
A multi-agent reinforcement learning approach to robot soccer

Artificial Intelligence Review
A cognitive WSN framework for highway safety based on weighted cognitive maps and Q-learning

Proceedings of the second ACM international symposium on Design and analysis of intelligent vehicular networks and applications
Managing Femto to Macro Interference without X2 Interface Support through POMDP

Mobile Networks and Applications
Learning and reasoning with action-related places for robust mobile manipulation

Journal of Artificial Intelligence Research
Learning to win by reading manuals in a monte-carlo framework

Journal of Artificial Intelligence Research
Real-world reinforcement learning for autonomous humanoid robot docking

Robotics and Autonomous Systems
Adaptive classification on brain-computer interfaces using reinforcement signals

Neural Computation
The basal ganglia optimize decision making over general perceptual hypotheses

Neural Computation
Learning high-level planning from text

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Generative goal-driven user simulation for dialog management

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Optimising incremental dialogue decisions using information density for interactive systems

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Framework of automatic text summarization using reinforcement learning

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
More for your money: exploiting performance heterogeneity in public clouds

Proceedings of the Third ACM Symposium on Cloud Computing
Constructing test collections by inferring document relevance via extracted relevant information

Proceedings of the 21st ACM international conference on Information and knowledge management
Function optimisation by learning automata

Information Sciences: an International Journal
Mobile robot navigation: neural Q-learning

International Journal of Computer Applications in Technology
SMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology, and Policy

INFORMS Journal on Computing
Estimating interleaved comparison outcomes from historical click data

Proceedings of the 21st ACM international conference on Information and knowledge management
Improving the performance of the reinforcement learning model for answering complex questions

Proceedings of the 21st ACM international conference on Information and knowledge management
Analysis of solutions to the time-optimal planning and execution problem

Intelligent Service Robotics
Thinking Inside the Box: Controlling and Using an Oracle AI

Minds and Machines
Simultaneous policy update algorithms for learning the solution of linear continuous-time H∞ state feedback control

Information Sciences: an International Journal
Machine learning in agent-based stochastic simulation: Inferential theory and evaluation in transportation logistics

Computers & Mathematics with Applications
Multi-agent learning and control system using ants colony for packet scheduling in routers

APNOMS'07 Proceedings of the 10th Asia-Pacific conference on Network Operations and Management Symposium: managing next generation networks and services
Multi-armed bandit formulation of the task partitioning problem in swarm robotics

ANTS'12 Proceedings of the 8th international conference on Swarm Intelligence
Improving scheduling performance using a q-learning-based leasing policy for clouds

Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Autonomous shaping via coevolutionary selection of training experience

PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part II
Distributed learning of best response behaviors in concurrent iterated many-object negotiations

MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Evolutionary dynamics of ant colony optimization

MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Q-Tree: automatic construction of hierarchical state representation for reinforcement learning

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part III
Adaptive planning for markov decision processes with uncertain transition models via incremental feature dependency discovery

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
APRIL: active preference learning-based reinforcement learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Autonomous data-driven decision-making in smart electricity markets

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Bayesian nonparametric inverse reinforcement learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Learning policies for battery usage optimization in electric vehicles

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Policy iteration based on a learned transition model

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Active learning of inverse models with intrinsically motivated goal exploration in robots

Robotics and Autonomous Systems
Adaptive reservoir computing through evolution and learning

Neurocomputing
Adaptive exploration using stochastic neurons

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
Low complexity proto-value function learning from sensory observations with incremental slow feature analysis

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
Integration of static and self-motion-based depth cues for efficient reaching and locomotor actions

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Making a reinforcement learning agent believe

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Biologically plausible multi-dimensional reinforcement learning in neural networks

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Understanding the role of serotonin in basal ganglia through a unified model

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
A computational model of motor areas based on bayesian networks and most probable explanations

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Learning-Based test programming for programmers

ISoLA'12 Proceedings of the 5th international conference on Leveraging Applications of Formal Methods, Verification and Validation: technologies for mastering change - Volume Part I
Gradient algorithms for exploration/exploitation trade-offs: global and local variants

ANNPR'12 Proceedings of the 5th INNS IAPR TC 3 GIRPR conference on Artificial Neural Networks in Pattern Recognition
Extracting key gene regulatory dynamics for the direct control of mechanical systems

PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part I
Multi-agent task division learning in hide-and-seek games

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Learning motion controllers with adaptive depth perception

EUROSCA'12 Proceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation
Learning motion controllers with adaptive depth perception

Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Video search and indexing with reinforcement agent for interactive multimedia services

ACM Transactions on Embedded Computing Systems (TECS) - Special issue on embedded systems for interactive multimedia services (ES-IMS)
Reinforcement learning approach to multi-stage decision making problems with changes in action sets

Artificial Life and Robotics
Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

Artificial Life and Robotics
Continuous strategy replicator dynamics for multi-agent Q-learning

Autonomous Agents and Multi-Agent Systems
A distributed Q-learning approach for variable attention to multiple critics

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Cooperative behavior acquisition in multi-agent reinforcement learning system using attention degree

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Sparse gradient-based direct policy search

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
Learn to swing up and balance a real pole based on raw visual input data

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V
Robot dancing: adapting robot dance to human preferences

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V
Upper confidence tree-based consistent reactive planning application to minesweeper

LION'12 Proceedings of the 6th international conference on Learning and Intelligent Optimization
Evaluation of a family of reinforcement learning cross-domain optimization heuristics

LION'12 Proceedings of the 6th international conference on Learning and Intelligent Optimization
Reinforcement learning transfer using a sparse coded inter-task mapping

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Local coordination in online distributed constraint optimization problems

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Multi-agent learning and the reinforcement gradient

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Recognizing internal states of other agents to anticipate and coordinate interactions

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Observer effect from stateful resources in agent sensing

Autonomous Agents and Multi-Agent Systems
Applying a framework for healthcare incentives simulation

Proceedings of the Winter Simulation Conference
An adaptive simulator for ML-rules

Proceedings of the Winter Simulation Conference
Model-based adaptive spatial sampling for occurrence map construction

Statistics and Computing
A learning strategy for software testing optimization based on dynamic programming

Proceedings of the Fourth Asia-Pacific Symposium on Internetware
Adaptive value function approximation for continuous-state stochastic dynamic programming

Computers and Operations Research
Scheduling fighter aircraft maintenance with reinforcement learning

Proceedings of the Winter Simulation Conference
Stochastic policy search for variance-penalized semi-Markov control

Proceedings of the Winter Simulation Conference
A sampled fictitious play based learning algorithm for infinite horizon Markov decision processes

Proceedings of the Winter Simulation Conference
Organizational Learning as Credit Assignment: A Model and Two Experiments

Organization Science
A spiking neural model for stable reinforcement of synapses based on multiple distal rewards

Neural Computation
Learning classifier system with average reward reinforcement learning

Knowledge-Based Systems
A "Society of Mind" Cognitive Architecture Based on the Principles of Artificial Economics

International Journal of Artificial Life Research
Reusing historical interaction data for faster online learning to rank for IR

Proceedings of the sixth ACM international conference on Web search and data mining
Reinforcement Learning with Reward Shaping and Mixed Resolution Function Approximation

International Journal of Agent Technologies and Systems
A Reinforcement Learning Approach to Setting Multi-Objective Goals for Energy Demand Management

International Journal of Agent Technologies and Systems
Simulation Analysis for Choice of Binary Lotteries

Computational Economics
Asymptotic non-learnability of universal agents with computable horizon functions

Theoretical Computer Science
Two-step gradient-based reinforcement learning for underwater robotics behavior learning

Robotics and Autonomous Systems
Simulating Cooperative Behaviors in Dynamic Networks

International Journal of Agent Technologies and Systems
Adaptive Kansei Search Method Using User's Subjective Criterion Deviation

International Journal of Computer Vision and Image Processing
Game designers training first person shooter bots

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
Exploration / exploitation trade-off in mobile context-aware recommender systems

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
On-Line model-based continuous state reinforcement learning using background knowledge

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
XCS with adaptive action mapping

SEAL'12 Proceedings of the 9th international conference on Simulated Evolution and Learning
Exploiting user feedback for adapting mobile interaction obtrusiveness

UCAmI'12 Proceedings of the 6th international conference on Ubiquitous Computing and Ambient Intelligence
Modular value iteration through regional decomposition

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Avoiding unintended AI behaviors

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Decision support for safe AI design

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
On measuring social intelligence: experiments on competition and cooperation

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Space-Time embedded intelligence

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Memory issues of intelligent agents

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Deconstructing reinforcement learning in sigma

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
On ensemble techniques for AIXI approximation

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Motivation management in AGI systems

AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Optimizing spectrum trading in cognitive mesh network using machine learning

Journal of Electrical and Computer Engineering - Special issue on Resource Allocation in Communications and Computing
A survey of point-based POMDP solvers

Autonomous Agents and Multi-Agent Systems
TEXPLORE: real-time sample-efficient reinforcement learning for robots

Machine Learning
Safe exploration of state and action spaces in reinforcement learning

Journal of Artificial Intelligence Research
From dynamic movement primitives to associative skill memories

Robotics and Autonomous Systems
Sourcing strategies in supply risk management: An approximate dynamic programming approach

Computers and Operations Research
Reinforcement-Learning-Based Double Auction Design for Dynamic Spectrum Access in Cognitive Radio Networks

Wireless Personal Communications: An International Journal
Human-robot cross-training: computational formulation, modeling and evaluation of a human team training strategy

Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Security aspects in the cognition cycle of distributed cognitive radio networks: a survey from a multi-agent perspective

International Journal of Ad Hoc and Ubiquitous Computing
Learning non-myopically from human-generated reward

Proceedings of the 2013 international conference on Intelligent user interfaces
A hierarchical representation policy iteration algorithm for reinforcement learning

IScIDE'12 Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
Applying reinforcement learning for web pages ranking algorithms

Applied Soft Computing
Transferring task models in Reinforcement Learning agents

Neurocomputing
A state-dependent time evolving multi-constraint routing algorithm

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Policy sharing between multiple mobile robots using decision trees

Information Sciences: an International Journal
Generating artificial neural networks for value function approximation in a domain requiring a shifting strategy

EvoApplications'13 Proceedings of the 16th European conference on Applications of Evolutionary Computation
Non-reciprocating Sharing Methods in Cooperative Q-Learning Environments

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
A Hybrid Cooperative Behavior Learning Method for a Rule-Based Shout-Ahead Architecture

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Knowledge-Based Exploration for Reinforcement Learning in Self-Organizing Neural Networks

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Abstraction in Model Based Partially Observable Reinforcement Learning Using Extended Sequence Trees

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Routing in Distributed Cognitive Radio Networks: A Survey

Wireless Personal Communications: An International Journal
MQ-Routing: Mobility-, GPS- and energy-aware routing protocol in MANETs for disaster relief scenarios

Ad Hoc Networks
Affective touch gesture recognition for a furry zoomorphic machine

Proceedings of the 7th International Conference on Tangible, Embedded and Embodied Interaction
Analysis of strategy in robot soccer game

Neurocomputing
Neuroevolution results in emergence of short-term memory in multi-goal environment

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Selection strategy for XCS with adaptive action mapping

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Hybrid POMDP based evolutionary adaptive framework for efficient visual tracking algorithms

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Solving the distal reward problem with rare correlations

Neural Computation
Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm

Neurocomputing
Using reinforcement learning and artificial evolution for the detection of group identities in complex adaptive artificial societies

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Extended rule-based genetic network programming

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Dynamic Pay-Per-Action Mechanisms and Applications to Online Advertising

Operations Research
DCOB: Action space for reinforcement learning of high DoF robots

Autonomous Robots
Learning with configurable operators and RL-based heuristics

NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
Learning to crawl deep web

Information Systems
Neural representation of reward probability: Evidence from the illusion of control

Journal of Cognitive Neuroscience
Popularity-based relevance propagation

Journal of Web Engineering
Simulation, learning, and optimization techniques in Watson's game strategies

IBM Journal of Research and Development
Self-organized collaboration of distributed IDS sensors

DIMVA'12 Proceedings of the 9th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Forward and backward feature selection in gradient-based MDP algorithms

MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Interaction-based group identity detection via reinforcement learning and artificial evolution

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Generation of tests for programming challenge tasks using multi-objective optimization

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Testing probabilistic equivalence through Reinforcement Learning

Information and Computation
Finding your way in the testing jungle: a learning approach to web security testing

Proceedings of the 2013 International Symposium on Software Testing and Analysis
Learning classifier systems: introducing the user-friendly textbook

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Efficient sample reuse in policy gradients with parameter-based exploration

Neural Computation
From occasional choices to inevitable musts: a computational model of nicotine addiction

Computational Intelligence and Neuroscience
Towards a deeper understanding of cooperative equilibrium: characterization and complexity

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Emergence of social norms through collective learning in networked agent societies

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Using informative behavior to increase engagement in the tamer framework

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Smart exploration in reinforcement learning using absolute temporal difference errors

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Addressing the policy-bias of q-learning by repeating updates

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Teaching on a budget: agents advising agents in reinforcement learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Learning exploration strategies in model-based reinforcement learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
CLEAN rewards for improving multiagent coordination in the presence of exploration

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Learning in non-stationary MDPs as transfer learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Decentralized coordination via task decomposition and reward shaping

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Using response probability to build system redundancy in multiagent systems

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
A generic adaptive simulation algorithm for component-based simulation systems

Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation
Supporting adaptation of decentralized software based on application scenarios

Journal of Systems and Software
A slotted CSMA based reinforcement learning approach for extending the lifetime of underwater acoustic wireless sensor networks

Computer Communications
Effective search for genetic-based machine learning systems via estimation of distribution algorithms and embedded feature reduction techniques

Neurocomputing
Distributed self-learning scheduling approach for wireless sensor network

Ad Hoc Networks
Online learning in a chemical perceptron

Artificial Life
On the complexity of trial and error

Proceedings of the forty-fifth annual ACM symposium on Theory of computing
Learning resources in federated environments: a broken link checker based on URL similarity

International Journal of Metadata, Semantics and Ontologies
Building a social multi-agent system simulation management toolbox

Proceedings of the 6th Balkan Conference in Informatics
Corticostriatal contributions to musical expectancy perception

Journal of Cognitive Neuroscience
Hierarchical control by a higher center and the rhythm generator contributes to realize adaptive locomotion

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Autonomous task partitioning in robot foraging: an approach based on cost estimation

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Inverse reinforcement learning for interactive systems

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Machine learning for interactive systems and robots: a brief introduction

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Shared control of a robot using EEG-based feedback signals

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Emotion-oriented agent in mental state transition learning network

International Journal of Computational Intelligence Studies
Architecture of a cyberphysical avatar

Proceedings of the ACM/IEEE 4th International Conference on Cyber-Physical Systems
Performance bounds for λ policy iteration and application to the game of Tetris

The Journal of Machine Learning Research
On Potential Cognitive Abilities in the Machine Kingdom

Minds and Machines
Finite-sample analysis of least-squares policy iteration

The Journal of Machine Learning Research
Dynamic policy programming

The Journal of Machine Learning Research
Linear fitted-Q iteration with multiple reward functions

The Journal of Machine Learning Research
Reinforcement learning for cooperative sensing gain in cognitive radio ad hoc networks

Wireless Networks
Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model

Machine Learning
Using historical click data to increase interleaving sensitivity

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
On the moral responsibility of military robots

Ethics and Information Technology
Joint Admission Control and Channel Selection Based on Multi Response Learning Automata (MRLA) in Cognitive Radio Networks

Wireless Personal Communications: An International Journal
Multi-criteria expertness based cooperative Q-learning

Applied Intelligence
Learning policies for battery usage optimization in electric vehicles

Machine Learning
A reinforcement learning approach to autonomous decision-making in smart electricity markets

Machine Learning
Scenario Trees and Policy Selection for Multistage Stochastic Programming Using Machine Learning

INFORMS Journal on Computing
Modelling mental rotation in cognitive robots

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Mental imagery in the navigation domain: a computational model of sensory-motor simulation mechanisms

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A vision for a stochastic reasoner for autonomic cloud deployment

Proceedings of the Second Nordic Symposium on Cloud Computing & Internet Technologies
An English-Language Argumentation Interface for Explanation Generation with Markov Decision Processes in the Domain of Academic Advising

ACM Transactions on Interactive Intelligent Systems (TiiS)
A novel reinforcement learning architecture for continuous state and action spaces

Advances in Artificial Intelligence
Vicarious reinforcement and ex ante law enforcement: a study in norm-governed learning agents

Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law
Robust Regulation Adaptation in Multi-Agent Systems

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Spike-timing-dependent construction

Neural Computation
Probabilistic model-based imitation learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Review: Vulnerabilities in cognitive radio networks: A survey

Computer Communications
Survey An appraisal and design of a multi-agent system based cooperative wireless intrusion detection computational intelligence technique

Engineering Applications of Artificial Intelligence
Teaching and leading an ad hoc teammate: Collaboration without pre-coordination

Artificial Intelligence
Evolutionary robotics approach to odor source localization

Neurocomputing
Reinforcement learning in robotics: A survey

International Journal of Robotics Research
A study of ex ante law enforcement in norm-governed learning agents

JSAI-isAI'12 Proceedings of the 2012 international conference on New Frontiers in Artificial Intelligence
Learning epistemic actions in model-free memory-free reinforcement learning: experiments with a neuro-robotic model

Living Machines'13 Proceedings of the Second international conference on Biomimetic and Biohybrid Systems
Reward shaping for statistical optimisation of dialogue management

SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
A model of emotional intelligent agent for cooperative goal exploration

ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories
AHPM as a proposal to improve interaction with air traffic controllers

HCI'13 Proceedings of the 15th international conference on Human-Computer Interaction: interaction modalities and techniques - Volume Part IV
Technical Section: Goal directed multi-finger manipulation: Control policies and analysis

Computers and Graphics
Toward nonlinear local reinforcement learning rules through neuroevolution

Neural Computation
Fidelity, Soundness, and Efficiency of Interleaved Comparison Methods

ACM Transactions on Information Systems (TOIS)
Models of gaze control for manipulation tasks

ACM Transactions on Applied Perception (TAP)
Strategic cognitive sequencing: a computational cognitive neuroscience approach

Computational Intelligence and Neuroscience - Special issue on Neurocognitive Models of Sense Making
Intelligent controllers for bi-objective dynamic scheduling on a single machine with sequence-dependent setups

Applied Soft Computing
An intelligent broker agent for energy trading: an MDP approach

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Efficiently solving joint activity based security games

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Monte-Carlo expectation maximization for decentralized POMDPs

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Sufficiency-based selection strategy for MCTS

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Prior-free exploration bonus for and beyond near bayes-optimal behavior

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Online expectation maximization for reinforcement learning in POMDPs

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Meta-interpretive learning of higher-order dyadic datalog: predicate invention revisited

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Employing batch reinforcement learning to control gene regulation without explicitly constructing gene regulatory networks

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Lifelong learning for acquiring the wisdom of the crowd

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Towards a second generation random walk planner: an experimental exploration

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Fault-tolerant planning under uncertainty

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
On stochastic optimal control and reinforcement learning by approximate inference (extended abstract)

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Hedonic value: enhancing adaptation for motivated agents

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Extending sensorimotor contingency theory: prediction, planning, and action generation

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Using reinforcement learning to find an optimal set of features

Computers & Mathematics with Applications
Behavior Selection Using Utility-Based Reinforcement Learning in Irregular Warfare Simulation Models

International Journal of Operations Research and Information Systems
Adaptivity on the robot brain architecture level using reinforcement learning

Robot Soccer World Cup XV
Camera modeling technique of 3D sensing based on tile coding for computer vision

BodyNets '13 Proceedings of the 8th International Conference on Body Area Networks
A brain-computer interface for high-level remote control of an autonomous, reinforcement-learning-based robotic system for reaching and grasping

Proceedings of the 19th international conference on Intelligent User Interfaces
Reduction of state space in reinforcement learning by sensor selection

Artificial Life and Robotics
Reinforcement learning models for scheduling in wireless networks

Frontiers of Computer Science: Selected Publications from Chinese Universities
Q-learning Reward Propagation Method for Reducing the Transmission Power of Sensor Nodes in Wireless Sensor Networks

Wireless Personal Communications: An International Journal
Monte-Carlo tree search for Bayesian reinforcement learning

Applied Intelligence
Learning via human feedback in continuous state and action spaces

Applied Intelligence
Towards a Self-Learning Agent: Using Ranking Functions as a Belief Representation in Reinforcement Learning

Neural Processing Letters
Reinforcement learning based routing in wireless mesh networks

Wireless Networks
Full-range adaptive cruise control based on supervised adaptive dynamic programming

Neurocomputing
Fast damage recovery in robotics with the T-resilience algorithm

International Journal of Robotics Research
Towards a real-time interface between a biomimetic model of sensorimotor cortex and a robotic arm

Pattern Recognition Letters
Scheduling a dynamic aircraft repair shop with limited repair resources

Journal of Artificial Intelligence Research
Distributed reasoning for multiagent simple temporal problems

Journal of Artificial Intelligence Research
Analysis of watson's strategies for playing Jeopardy!

Journal of Artificial Intelligence Research
The arcade learning environment: an evaluation platform for general agents

Journal of Artificial Intelligence Research
Learning by observation of agent software images

Journal of Artificial Intelligence Research
Bi-LCQ: A low-weight clustering-based Q-learning approach for NoCs

Microprocessors & Microsystems
Robustness of stochastic bandit policies

Theoretical Computer Science
Universal knowledge-seeking agents

Theoretical Computer Science
General time consistent discounting

Theoretical Computer Science
Construction of approximation spaces for reinforcement learning

The Journal of Machine Learning Research
Counterfactual reasoning and learning systems: the example of computational advertising

The Journal of Machine Learning Research
Frontal theta oscillatory activity is a common mechanism for the computation of unexpected outcomes and learning rate

Journal of Cognitive Neuroscience
How we learn to make decisions: Rapid propagation of reinforcement learning prediction errors in humans

Journal of Cognitive Neuroscience
A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

Artificial Life and Robotics
Efficient bidding strategies for Cliff-Edge problems

Autonomous Agents and Multi-Agent Systems
Multiagent learning in the presence of memory-bounded agents

Autonomous Agents and Multi-Agent Systems
Dopamine ramps are a consequence of reward prediction errors

Neural Computation
Learning potential functions and their representations for multi-task reinforcement learning

Autonomous Agents and Multi-Agent Systems
Optimal learning for sequential sampling with non-parametric beliefs

Journal of Global Optimization
Embodied imitation-enhanced reinforcement learning in multi-agent systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A cognitive framework for WSN based on weighted cognitive maps and Q-learning

Ad Hoc Networks
Self-healing in transparent optical packet switching mesh networks: A reinforcement learning perspective

Computer Networks: The International Journal of Computer and Telecommunications Networking
Hierarchical control of traffic signals using Q-learning with tile coding

Applied Intelligence
Reinforcement Learning for Multiple Access Control in Wireless Sensor Networks: Review, Model, and Open Issues

Wireless Personal Communications: An International Journal
Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning

Journal of Intelligent and Robotic Systems
Simulation Analysis for Network Formulation

Computational Economics
Distributed Learning for Planning Under Uncertainty Problems with Heterogeneous Teams

Journal of Intelligent and Robotic Systems
Multiagent meta-level control for radar coordination

Web Intelligence and Agent Systems
MineralMiner: An active sensing simulation environment

Multiagent and Grid Systems
Analysis of emission right prices in greenhouse gas emission trading via agent-based model

Multiagent and Grid Systems
A survey of multi-objective sequential decision-making

Journal of Artificial Intelligence Research
Scalable and efficient bayes-adaptive reinforcement learning based on monte-carlo tree search

Journal of Artificial Intelligence Research
Exact and parallel metaheuristic algorithms for the single processor total weighted completion time scheduling problem with the sum-of-processing-time based models

Computers and Operations Research
A tour of machine learning: An AI perspective

AI Communications - ECAI 2012 Turing and Anniversary Track
Artificial Intelligence: From programs to solvers

AI Communications - ECAI 2012 Turing and Anniversary Track
Interactive activity recognition and prompting to assist people with cognitive disabilities

Journal of Ambient Intelligence and Smart Environments - Home-based Health and Wellness Measurement and Monitoring
Adaptive function approximation in reinforcement learning with an interpolating growing neural gas

International Journal of Hybrid Intelligent Systems
A novel multi-agent system utilizing quantum-inspired evolution for demand side management in the future smart grid

Integrated Computer-Aided Engineering
Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection

Intelligent Data Analysis
Automatic skill acquisition in reinforcement learning using graph centrality measures

Intelligent Data Analysis
A comparison between a communication-based and a data mining-based learning approach for agents

Intelligent Decision Technologies
METAL: A framework for mixture-of-experts task and attention learning

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Similarity of learned helplessness in human being and fuzzy reinforcement learning algorithms

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Computational intelligence models for image processing and information reasoning
Active noise control system via multi-agent credit assignment

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Experimental Evaluation of Automatic Hint Generation for a Logic Tutor

International Journal of Artificial Intelligence in Education - Best of AIED 2011
A multi-agent control architecture for a robotic wheelchair

Applied Bionics and Biomechanics
Behaviour generation in humanoids by learning potential-based policies from constrained motion

Applied Bionics and Biomechanics
Multi-timescale nexting in a reinforcement learning robot

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
RLAM: A dynamic and efficient reinforcement learning-based adaptive mapping scheme in mobile WiMAX networks

Mobile Information Systems

Quantified Score

Hi-index	0.02

Visualization

Abstract

From the Publisher:In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.