An optimal-control application of two paradigms of on-line learning
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Markov decision processes in large state spaces
COLT '95 Proceedings of the eighth annual conference on Computational learning theory
Learning curve bounds for a Markov decision process with undiscounted rewards
COLT '96 Proceedings of the ninth annual conference on Computational learning theory
Machine Learning - Special issue on inductive transfer
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Tree based discretization for continuous state space reinforcement learning
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Module-Based Reinforcement Learning: Experiments with a Real Robot
Machine Learning - Special issue on learning in autonomous robots
An adaptive agent bidding strategy based on stochastic modeling
Proceedings of the third annual conference on Autonomous Agents
Machine Learning
Colearning in Differential Games
Machine Learning
Machine Learning
Efficient exploration for optimizing immediate reward
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
A Nonlinear Noise-Shaping Delta-Sigma Modulator with On-Chip Reinforcement Learning^{*}
Analog Integrated Circuits and Signal Processing - Special issue on Learning on Silicon
Multi-agent reinforcement learning for planning and conflict resolution in a dynamic domain
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Adaptivity in agent-based routing for data networks
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Distributed reinforcement learning for a traffic engineering application
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Learning sequences of actions in collectives of autonomous agents
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
An approach to the analysis and design of multiagent systems based on interaction frames
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2
Designing agent collectives for systems with markovian dynamics
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to select a coordination mechanism
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Construction of a learning agent handling its rewards according to environmental situations
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Integrated learning for interactive synthetic characters
Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Machine learning and inductive logic programming for multi-agent systems
Mutli-agents systems and applications
Learning Sequences of Compatible Actions Among Agents
Artificial Intelligence Review
Analog VLSI Stochastic Perturbative Learning Architectures
Analog Integrated Circuits and Signal Processing
Printed Circuit Board Design via Organizational-Learning Agents
Applied Intelligence
Reinforcement Learning in the Multi-Robot Domain
Autonomous Robots
Module-Based Reinforcement Learning: Experiments with a Real Robot
Autonomous Robots
Dynamics of a Classical Conditioning Model
Autonomous Robots
Reinforcement Learning Soccer Teams with Incomplete World Models
Autonomous Robots
Target Reaching by Using Visual Information and Q-learning Controllers
Autonomous Robots
Making Organizational Learning Operational: Implications from Learning Classifier Systems
Computational & Mathematical Organization Theory
Reinforced Genetic Programming
Genetic Programming and Evolvable Machines
An Integrated Approach of Learning, Planning, and Execution
Journal of Intelligent and Robotic Systems
Relational Reinforcement Learning
Machine Learning
Kernel-Based Reinforcement Learning
Machine Learning
Near-Optimal Reinforcement Learning in Polynomial Time
Machine Learning
Learning intelligent behavior in a non-stationary and partially observable environment
Artificial Intelligence Review
Control of exploitation-exploration meta-parameter in reinforcement learning
Neural Networks - Computational models of neuromodulation
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
Neural Processing Letters
The anticipatory classifier system and genetic generalization
Natural Computing: an international journal
Recent Advances in Hierarchical Reinforcement Learning
Discrete Event Dynamic Systems
Exploration Strategies for Model-based Learning in Multi-agent Systems: Exploration Strategies
Autonomous Agents and Multi-Agent Systems
Pricing in Agent Economies Using Multi-Agent Q-Learning
Autonomous Agents and Multi-Agent Systems
Predicting the Expected Behavior of Agents that Learn About Agents: The CLRI Framework
Autonomous Agents and Multi-Agent Systems
A Framework for Learning in Search-Based Systems
IEEE Transactions on Knowledge and Data Engineering
Learning Optimal Robotic Tasks
IEEE Expert: Intelligent Systems and Their Applications
Optimal control using the transport equation: the Liouville machine
Adaptive Behavior
Machines that learn to play games
DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Electronic Institutions as a Framework for Agents' Negotiation and Mutual Commitment
EPIA '01 Proceedings of the10th Portuguese Conference on Artificial Intelligence on Progress in Artificial Intelligence, Knowledge Extraction, Multi-agent Systems, Logic Programming and Constraint Solving
Module Based Reinforcement Learning: An Application to a Real Robot
EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Robot Learning Using Gate-Level Evolvable Hardware
EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Vision Based State Space Construction for Learning Mobile Robots in Multi-agent Environments
EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
A Framework for Supporting Intelligent Fault and Performance Management for Communication Networks
MMNS '01 Proceedings of the 4th IFIP/IEEE International Conference on Management of Multimedia Networks and Services: Management of Multimedia on the Internet
Minimizing Transmission Costs through Adaptive Marking in Differentiated Services Networks
MMNS '02 Proceedings of the 5th IFIP/IEEE International Conference on Management of Multimedia Networks and Services: Management of Multimedia on the Internet
An Integrated On-Line Learning System for Evolving Programmable Logic Array Controllers
PPSN VI Proceedings of the 6th International Conference on Parallel Problem Solving from Nature
A Reinforcement Learning with Condition Reduced Fuzz Rules
SEAL'98 Selected papers from the Second Asia-Pacific Conference on Simulated Evolution and Learning on Simulated Evolution and Learning
AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Minimax Fuzzy Q-Learning in Cooperative Multi-agent Systems
ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Modular-Fuzzy Cooperation Algorithm for Multi-agent Systems
ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
MUTANT: A Genetic Learning System
AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
Machine Learning and Inductive Logic Programming for Multi-agent Systems
EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Reinforcement Learning for Control of Traffic and Access Points in Intelligent Wireless ATM Networks
Proceedings of the International Conference, 7th Fuzzy Days on Computational Intelligence, Theory and Applications
Sequential Strategy for Learning Multi-stage Multi-agent Collaborative Games
ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Market-Based Reinforcement Learning in Partially Observable Worlds
ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Learning Multi-agent Strategies in Multi-stage Collaborative Games
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Lempel-Ziv Coding in Reinforcement Learning
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
An Introduction to Learning Fuzzy Classifier Systems
Learning Classifier Systems, From Foundations to Applications
Fuzzy and Crisp Representations of Real-Valued Input for Learning Classifier Systems
Learning Classifier Systems, From Foundations to Applications
Probability-Enhanced Predictions in the Anticipatory Classifier System
IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Rationality of Reward Sharing in Multi-agent Reinforcement Learning
PRIMA '99 Proceedings of the Second Pacific Rim International Workshop on Multi-Agents: Approaches to Intelligent Agents
Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer
RoboCup 2001: Robot Soccer World Cup V
VQQL. Applying Vector Quantization to Reinforcement Learning
RoboCup-99: Robot Soccer World Cup III
Open Theoretical Questions in Reinforcement Learning
EuroCOLT '99 Proceedings of the 4th European Conference on Computational Learning Theory
Application of Episodic Q-Learning to a Multi-agent Cooperative Task
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning
Sequence Learning - Paradigms, Algorithms, and Applications
Sequential Decision Making Based on Direct Search
Sequence Learning - Paradigms, Algorithms, and Applications
Communication and Interaction with Learning Agents in Virtual Soccer
VW '00 Proceedings of the Second International Conference on Virtual Worlds
An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning
ATAL '00 Proceedings of the 7th International Workshop on Intelligent Agents VII. Agent Theories Architectures and Languages
Implicit Negotiation in Repeated Games
ATAL '01 Revised Papers from the 8th International Workshop on Intelligent Agents VIII
COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
PAC Bounds for Multi-armed Bandit and Markov Decision Processes
COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning
IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Analysis of Optimal Criteria
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Sequential cost-sensitive decision making with reinforcement learning
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Applications of the self-organising map to reinforcement learning
Neural Networks - New developments in self-organizing maps
The design of collectives of agents to control non-Markovian systems
Eighteenth national conference on Artificial intelligence
Exploring artificial intelligence in the new millennium
Social learning mechanisms compared in a simple environment
ICAL 2003 Proceedings of the eighth international conference on Artificial life
Towards a pareto-optimal solution in general-sum games
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A selection-mutation model for q-learning in multi-agent systems
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Advice-exchange in heterogeneous groups of learning agents
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A Taxonomy for artificial embryogeny
Artificial Life
The Ant Colony Optimization paradigm for combinatorial optimization
Advances in evolutionary computing
Recent Advances in Hierarchical Reinforcement Learning
Discrete Event Dynamic Systems
Reinforcement learning based on local state feature learning and policy adjustment
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Introduction to multimedia and mobile agents
On the convergence of optimistic policy iteration
The Journal of Machine Learning Research
ε-mdps: learning in varying environments
The Journal of Machine Learning Research
Adaptive Radial Basis Decomposition by Learning Vector Quantization
Neural Processing Letters
Nash q-learning for general-sum stochastic games
The Journal of Machine Learning Research
CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
Probability in the Engineering and Informational Sciences
A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL
Probability in the Engineering and Informational Sciences
Call admission control in cellular networks: a reinforcement learning solution
International Journal of Network Management
An experimental evaluation of reinforcement learning for gain scheduling
Design and application of hybrid intelligent systems
Employing OLAP mining for multiagent reinforcement learning
Design and application of hybrid intelligent systems
The Journal of Machine Learning Research
Dynamic abstraction in reinforcement learning via clustering
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning when and how to coordinate
Web Intelligence and Agent Systems
Incremental heuristic search in AI
AI Magazine
Best-Response Multiagent Learning in Non-Stationary Environments
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Run the GAMUT: A Comprehensive Approach to Evaluating Game-Theoretic Algorithms
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Unifying Temporal and Structural Credit Assignment Problems
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
An Architecture for Persistent Reactive Behavior
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Learning from Multiple Sources
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Planning, learning and coordination in multiagent decision processes
TARK '96 Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge
No Pervasive Computing without Intelligent Systems
BT Technology Journal
Building agents to serve customers
AI Magazine
Basic Ideas for Event-Based Optimization of Markov Systems
Discrete Event Dynamic Systems
Learning and Exploiting Relative Weaknesses of Opponent Agents
Autonomous Agents and Multi-Agent Systems
Coordinating Multiple Agents via Reinforcement Learning
Autonomous Agents and Multi-Agent Systems
IEEE Internet Computing
An Architecture for Behavior-Based Reinforcement Learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process
Proceedings of the 2005 ACM symposium on Applied computing
Neighboring crossover to improve GA-based Q-learning method for multi-legged robot control
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Optimal Control Using the Transport Equation: The Liouville Machine
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Q-learning of sequential attention for visual object recognition from informative local descriptors
ICML '05 Proceedings of the 22nd international conference on Machine learning
Adaptive Clustering: Obtaining Better Clusters Using Feedback and Past Experience
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
A middleware for autonomic QoS management based on learning
SEM '05 Proceedings of the 5th international workshop on Software engineering and middleware
Local Reinforcement and Recombination in Classifier Systems
Evolutionary Computation
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games
Autonomous Agents and Multi-Agent Systems
Adaptive dialogue systems - interaction with interact
SIGDIAL '02 Proceedings of the 3rd SIGdial workshop on Discourse and dialogue - Volume 2
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms
Neural Computation
PAC model-free reinforcement learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Mobile Networks and Applications - Special issue: Recent advances in wireless networking
Small-scale peer-to-peer overlays
ACM SIGOPS Operating Systems Review
A hierarchical approach to efficient reinforcement learning in deterministic domains
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning against multiple opponents
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning to commit in repeated games
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
A comprehensive review of nature inspired routing algorithms for fixed telecommunication networks
Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Nature-inspired applications and systems
The Knowledge Engineering Review
A reinforcement learning approach to active camera foveation
Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
Combining expert advice in reactive environments
Journal of the ACM (JACM)
IEEE Transactions on Mobile Computing
Cooperative transportation system for humanoid robots using simulation-based learning
Applied Soft Computing
Reinforcement Learning with Approximation Spaces
Fundamenta Informaticae
Cooperative Transportation by Humanoid Robots - Solving Piano Movers' Problem
International Journal of Hybrid Intelligent Systems
Using multi-agent systems for learning optimal policies for complex problems
ACM-SE 45 Proceedings of the 45th annual southeast regional conference
Proceedings of the 2006 international conference on Game research and development
If multi-agent learning is the answer, what is the question?
Artificial Intelligence
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
The Journal of Machine Learning Research
STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents
dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
The Multi-Agent Data Collection in HLA-Based Simulation System
Proceedings of the 21st International Workshop on Principles of Advanced and Distributed Simulation
Modeling embodied visual behaviors
ACM Transactions on Applied Perception (TAP)
On developmental mental architectures
Neurocomputing
Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension
Evolutionary Computation
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of SONQL for real-time learning of robot behaviors
Robotics and Autonomous Systems
A reinforcement agent for threshold fusion
Applied Soft Computing
Adaptive stepsize selection for tracking in a regime-switching environment
Automatica (Journal of IFAC)
Reinforcement learning in multi-agent environment and ant colony for packet scheduling in routers
Proceedings of the 5th ACM international workshop on Mobility management and wireless access
Application of reinforcement learning to the game of Othello
Computers and Operations Research
Modeling motivation for adaptive nonplayer characters in dynamic computer game worlds
Computers in Entertainment (CIE) - Theoretical and Practical Computer Applications in Entertainment
Classifier fitness based on accuracy
Evolutionary Computation
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Batch reinforcement learning in a complex domain
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Distributed path planning for mobile robots using a swarm of interacting reinforcement learners
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Advice taking in multiagent reinforcement learning
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Learning the meaning of action commands based on "no news is good news" criterion
Proceedings of the 2007 workshop on Multimodal interfaces in semantic interaction
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning
Artificial Intelligence
Learning how to combine sensory-motor functions into a robust behavior
Artificial Intelligence
Teachable robots: Understanding human teaching behavior to build more effective robot learners
Artificial Intelligence
Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning
International Journal of Robotics Research
Wireless Personal Communications: An International Journal
On the convergence of stochastic iterative dynamic programming algorithms
Neural Computation
Biologically-inspired adaptive learning control strategies: A rough set approach
International Journal of Hybrid Intelligent Systems
Knowledge propagation in a distributed omnidirectional vision system
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Marco Somalvico Memorial Issue
Coordination in multiagent reinforcement learning systems by virtual reinforcement signals
International Journal of Knowledge-based and Intelligent Engineering Systems
Recursive least squares and quadratic prediction in continuous multistep problems
Proceedings of the 10th annual conference companion on Genetic and evolutionary computation
Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective
The Journal of Machine Learning Research
Accelerated Neural Evolution through Cooperatively Coevolved Synapses
The Journal of Machine Learning Research
Norm emergence under constrained interactions in diverse societies
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Expediting RL by using graphical structures
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Adaptive Kanerva-based function approximation for multi-agent systems
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Reinforcement learning for problems with symmetrical restricted states
Robotics and Autonomous Systems
Adaptiveness in Agent Communication: Application and Adaptation of Conversation Patterns
Agent Communication II
Reinforcement Learning in Fine Time Discretization
ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
State Space Partition for Reinforcement Learning Based on Fuzzy Min-Max Neural Network
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
A Novel Method of Constructing ANN
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Competition and Coordination in Stochastic Games
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Anticipatory Behavior in Adaptive Learning Systems
Combining the Best of the Two Worlds: Inheritance Versus Experience
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs
ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Making a Robot Learn to Play Soccer Using Reward and Punishment
KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Perception and Developmental Learning of Affordances in Autonomous Robots
KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Model-Based Reinforcement Learning in a Complex Domain
RoboCup 2007: Robot Soccer World Cup XI
A Design of Reward Function Based on Knowledge in Multi-agent Learning
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Learning Grouping and Anti-predator Behaviors for Multi-agent Systems
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Route Optimization Using Q-Learning for On-Demand Bus Systems
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Rational Bidding Using Reinforcement Learning
GECON '08 Proceedings of the 5th international workshop on Grid Economics and Business Models
State-Dependent Exploration for Policy Gradient Methods
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Multi-Agent Reinforcement Learning for Intrusion Detection: A Case Study and Evaluation
MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
Spatial Abstraction: Aspectualization, Coarsening, and Conceptual Classification
Proceedings of the international conference on Spatial Cognition VI: Learning, Reasoning, and Talking about Space
Robot Navigation Based on Fuzzy RL Algorithm
ISNN '08 Proceedings of the 5th international symposium on Neural Networks: Advances in Neural Networks
Applying Reinforcement Learning to Multi-robot Team Coordination
HAIS '08 Proceedings of the 3rd international workshop on Hybrid Artificial Intelligence Systems
Meta-case-based reasoning: self-improvement through self-understanding
Journal of Experimental & Theoretical Artificial Intelligence
An online multi-agent co-operative learning algorithm in POMDPs
Journal of Experimental & Theoretical Artificial Intelligence
Service diffusion in the market considering consumers' subjective value
CSTST '08 Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
Experimental analysis of eligibility traces strategies in temporal difference learning
International Journal of Knowledge Engineering and Soft Data Paradigms
Reinforcement Learning on a Futures Market Simulator
KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Semantic Relatedness Measure Using Object Properties in an Ontology
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Designing Toys That Come Alive: Curious Robots for Creative Play
ICEC '08 Proceedings of the 7th International Conference on Entertainment Computing
Q-Strategy: A Bidding Strategy for Market-Based Allocation of Grid Services
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
Reinforcement Learning for Decision Making in Sequential Visual Attention
Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
Recent Advances in Reinforcement Learning
A Collaborative Reinforcement Learning Approach to Urban Traffic Control Optimization
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Learning-Rate Adjusting Q-Learning for Prisoner's Dilemma Games
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
International Journal of Intelligent Systems Technologies and Applications
ACE '08 Proceedings of the 2008 International Conference on Advances in Computer Entertainment Technology
Effects of chaotic exploration on reinforcement learning in target capturing task
International Journal of Knowledge-based and Intelligent Engineering Systems
Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
Autonomous Agents and Multi-Agent Systems
Measurement of Underlying Cooperation in Multiagent Reinforcement Learning
PRIMA '08 Proceedings of the 11th Pacific Rim International Conference on Multi-Agents: Intelligent Agents and Multi-Agent Systems
A learning classifier system for mazes with aliasing clones
Natural Computing: an international journal
Imitation guided learning in learning classifier systems
Natural Computing: an international journal
Negotiation Model Supporting Co-Allocation for Grid Scheduling
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Integrated cognitive architectures: a survey
Artificial Intelligence Review
Reinforcement distribution in fuzzy Q-learning
Fuzzy Sets and Systems
Modeling reinforcement learning algorithms for performance analysis
Proceedings of the International Conference on Advances in Computing, Communication and Control
Boosting the performance of computing systems through adaptive configuration tuning
Proceedings of the 2009 ACM symposium on Applied Computing
Basal Ganglia Models for Autonomous Behavior Learning
Creating Brain-Like Intelligence
An Optimal Approximate Dynamic Programming Algorithm for the Lagged Asset Acquisition Problem
Mathematics of Operations Research
An autonomic architecture for optimizing QoE in multimedia access networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
A policy-based framework for autonomic reconfiguration management in heterogeneous networks
Proceedings of the 7th International Conference on Mobile and Ubiquitous Multimedia
Static strategy and dynamic adjustment: An effective method for Grid task scheduling
Future Generation Computer Systems
Maestro: a self-organizing peer-to-peer dataflow framework using reinforcement learning
Proceedings of the 18th ACM international symposium on High performance distributed computing
A new marketing strategy map for direct marketing
Knowledge-Based Systems
Using distributed w-learning for multi-policy optimization in decentralized autonomic systems
ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Generalized model learning for reinforcement learning in factored domains
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
An empirical analysis of value function-based and policy search reinforcement learning
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multiagent learning in large anonymous games
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Comparing trust mechanisms for monitoring aggregator nodes in sensor networks
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Novel reinforcement learning-based approaches to reduce loss probability in buffer-less OBS networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Opponent Modeling in Adversarial Environments through Learning Ingenuity
Proceedings of the 2005 conference on Self-Organization and Autonomic Informatics (I)
Fast Learning in an Actor-Critic Architecture with Reward and Punishment
Proceedings of the 2008 conference on Tenth Scandinavian Conference on Artificial Intelligence: SCAI 2008
Reinforcement Learning with Classifier Selection for Focused Crawling
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning-Rate Adjusting Q-Learning for Two-Person Two-Action Symmetric Games
KES-AMSTA '09 Proceedings of the Third KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Reinforcement learning for robot soccer
Autonomous Robots
EDA-RL: estimation of distribution algorithms for reinforcement learning problems
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Learning in the time-dependent minority game
Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
TEMMAS: The Electricity Market Multi-Agent Simulator
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
The kNN-TD Reinforcement Learning Algorithm
IWINAC '09 Proceedings of the 3rd International Work-Conference on The Interplay Between Natural and Artificial Computation: Part I: Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira's Scientific Legacy
Multi-agent Reinforcement Learning in Network Management
AIMS '09 Proceedings of the 3rd International Conference on Autonomous Infrastructure, Management and Security: Scalability of Networks and Services
A q-learning based adaptive bidding strategy in combinatorial auctions
Proceedings of the 11th International Conference on Electronic Commerce
QUICR-learning for multi-agent coordination
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Hard constrained semi-Markov decision processes
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Harnessing migrations in a market-based grid OS
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
Markov Decision Processes with Arbitrary Reward Processes
Mathematics of Operations Research
Prediction of solar conditions by emotional learning
Intelligent Data Analysis
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games
Web Intelligence and Agent Systems
Agent-based simulation of electricity markets: a survey of tools
Artificial Intelligence Review
Machine learning in digital games: a survey
Artificial Intelligence Review
A DR algorithm based on artificial potential field method
Multimedia Tools and Applications
Parallel Algorithms for Solving Markov Decision Process
ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Potential-based shaping in model-based reinforcement learning
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Hierarchical reinforcement learning with the MAXQ value function decomposition
Journal of Artificial Intelligence Research
Accelerating reinforcement learning by composing solutions of automatically identified subtasks
Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods
Journal of Artificial Intelligence Research
Collective intelligence, data routing and braess' paradox
Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation
Journal of Artificial Intelligence Research
Journal of Artificial Intelligence Research
Risk-sensitive reinforcement learning applied to control under constraints
Journal of Artificial Intelligence Research
Cooperative information sharing to improve distributed learning in multi-agent systems
Journal of Artificial Intelligence Research
Behavior bounding: an efficient method for high-level behavior comparison
Journal of Artificial Intelligence Research
Reinforcement learning: a survey
Journal of Artificial Intelligence Research
On partially controlled multi-agent systems
Journal of Artificial Intelligence Research
Dynamic non-Bayesian decision making
Journal of Artificial Intelligence Research
Journal of Artificial Intelligence Research
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Direct code access in self-organizing neural networks for reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Emergence of norms through social learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Qualitative map learning based on co-visibility of objects
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Topology and Memory Effect on Convention Emergence
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Multi-agent based modeling of liver detoxification
SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Automatic abstraction in reinforcement learning using data mining techniques
Robotics and Autonomous Systems
Robot weightlifting by direct policy search
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Learning against opponents with bounded memory
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Learning to act using real-time dynamic programming
Artificial Intelligence
Effective learning in the presence of adaptive counterparts
Journal of Algorithms
Neuroevolution strategies for episodic reinforcement learning
Journal of Algorithms
Assured end-to-end QoS through adaptive marking in multi-domain differentiated services networks
Computer Communications
IEEE Journal on Selected Areas in Communications - Special issue on wireless and pervasive communications for healthcare
A reward field model generation in Q-learning by dynamic programming
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
Ant colony optimization incorporated with fuzzy Q-learning for reinforcement fuzzy control
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
A Q-learning approach to derive optimal consumption and investment strategies
IEEE Transactions on Neural Networks
Interaction, observance or both? Study of the effects on convention emergence
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Route optimisation using evolutionary approaches for on-demand pickup problem
International Journal of Advanced Intelligence Paradigms
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine
Interaction, observance or both? Study of the effects on convention emergence
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Hybridization of cognitive models using evolutionary strategies
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
A Model of Neuronal Specialization Using Hebbian Policy-Gradient with "Slow" Noise
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
A Swarm-Based Learning Method Inspired by Social Insects
ICIC '07 Proceedings of the 3rd International Conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence
ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
A Q-learning model-independent flow controller for high-speed networks
ACC'09 Proceedings of the 2009 conference on American Control Conference
Nash Q-learning multi-agent flow control for high-speed networks
ACC'09 Proceedings of the 2009 conference on American Control Conference
Transfer of knowledge for a climbing virtual human: a reinforcement learning approach
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Associating domain-dependent knowledge and Monte Carlo approaches within a Go program
Information Sciences: an International Journal
Hybrid Q-learning algorithm about cooperation in MAS
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
An adaptive inventory control for a supply chain
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Fuzzy Q-learning in a nondeterministic environment: developing an intelligent Ms. Pac-Man agent
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Real-valued Q-learning in multi-agent cooperation
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Implementation of fuzzy Q-learning based on modular fuzzy model and parallel structured learning
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
The improvement of Q-learning applied to imperfect information game
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Cooperative multi-robot reinforcement learning: a framework in hybrid state space
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Proceedings of the 5th ACM/IEEE international conference on Human-robot interaction
A new mobile robot navigation method using fuzzy logic and a modified Q-learning algorithm
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Online reinforcement learning for dynamic multimedia systems
IEEE Transactions on Image Processing
Online adaptive policies for ensemble classifiers
Neurocomputing
Encoding robotic sensor states for Q-learning using the self-organizing map
Journal of Computing Sciences in Colleges
Reinforcement Learning in Finite MDPs: PAC Analysis
The Journal of Machine Learning Research
Experience-based reinforcement learning to acquire effective behavior in a multi-agent domain
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Constructing an autonomous agent with an interdependent heuristics
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Q-learning with linear function approximation
COLT'07 Proceedings of the 20th annual conference on Learning theory
Context aware life pattern prediction using fuzzy-state Q-learning
ICOST'07 Proceedings of the 5th international conference on Smart homes and health telematics
Reinforcement learning scheme for grouping and anti-predator behavior
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Virtual markets: Q-learning sellers with simple state representation
AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
Online learning of task-driven object-based visual attention control
Image and Vision Computing
Skill combination for reinforcement learning
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
A novel ANN model based on quantum computational MAS theory
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
An agent reinforcement learning model based on neural networks
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Two-layer networked learning control of a nonlinear HVAC system
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
MABS'06 Proceedings of the 2006 international conference on Multi-agent-based simulation VII
A k-NN based perception scheme for reinforcement learning
EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
Reinforcement learning of predictive features in affordance perception
Proceedings of the 2006 international conference on Towards affordance-based robot control
Approximate Dynamic Programming for Ambulance Redeployment
INFORMS Journal on Computing
A state-cluster based Q-learning
ICNC'09 Proceedings of the 5th international conference on Natural computation
Study on traffic signal control based on Q-learning
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 3
Cooperative learning using advice exchange
Adaptive agents and multi-agent systems
Multiagent learning for open systems: a study in opponent classification
Adaptive agents and multi-agent systems
Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks
Information Sciences: an International Journal
Applying reinforcement learning to scheduling strategies in an actual grid environment
International Journal of High Performance Systems Architecture
CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
A cat-like robot real-time learning to run
ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Joint path and wavelength selection using Q-learning in optical burst switching networks
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Application of reinforcement learning to autonomous heading control for bionic underwater robots
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
A novel method for strategy acquisition and its application to a double-auction market game
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on game theory
IEEE Journal on Selected Areas in Communications
Autonomous Agents and Multi-Agent Systems
Evolutionary mechanism design: a review
Autonomous Agents and Multi-Agent Systems
Autonomous Agents and Multi-Agent Systems
Multi-task evolutionary shaping without pre-specified representations
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
CCNC'10 Proceedings of the 7th IEEE conference on Consumer communications and networking conference
High-level reinforcement learning in strategy games
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Frequency adjusted multi-agent Q-learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Planning against fictitious players in repeated normal form games
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Extending adaptive fuzzy behavior hierarchies to multiple levels of composite behaviors
Robotics and Autonomous Systems
Modeling Behavior Cycles as a Value System for Developmental Robots
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Model-free control based on reinforcement learning for a wastewater treatment problem
Applied Soft Computing
Learning to follow navigational directions
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Computing optimal policies for partially observable decision processes using compact representations
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
From cognition to docition: The teaching radio paradigm for distributed & autonomous deployments
Computer Communications
Proceedings of the 3rd ACM workshop on Artificial intelligence and security
Docitive networks: an emerging paradigm for dynamic spectrum management
IEEE Wireless Communications
A Human-Robot Collaborative Reinforcement Learning Algorithm
Journal of Intelligent and Robotic Systems
Rule acquisition for cognitive agents by using estimation of distribution algorithms
International Journal of Knowledge Engineering and Soft Data Paradigms
Multi-goal Q-learning of cooperative teams
Expert Systems with Applications: An International Journal
Multi-policy optimization in self-organizing systems
SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Why and how hippocampal transition cells can be used in reinforcement learning
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Reinforcement learning scheme for grouping and characterization of multi-agent network
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part III
Effects of social network topology and options on norm emergence
COIN'09 Proceedings of the 5th international conference on Coordination, organizations, institutions, and norms in agent systems
Evaluation of techniques for a learning-driven modeling methodology in multiagent simulation
MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Evolutionary dynamics of regret minimization
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Towards constraint optimal control of greenhouse climate
LSMS/ICSEE'10 Proceedings of the 2010 international conference on Life system modeling and simulation and intelligent computing, and 2010 international conference on Intelligent computing for sustainable energy and environment: Part III
Exploring continuous action spaces with diffusion trees for reinforcement learning
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Distributed Reinforcement Learning for Coordinate Multi-Robot Foraging
Journal of Intelligent and Robotic Systems
Auto-exploratory average reward reinforcement learning
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Reinforcement learning based resource allocation in business process management
Data & Knowledge Engineering
Self-learning fuzzy logic controllers for pursuit-evasion differential games
Robotics and Autonomous Systems
To adapt or not to adapt: consequences of adapting driver and traffic light agents
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Continuous-state reinforcement learning with fuzzy approximation
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Bifurcation analysis of reinforcement learning agents in the Selten's horse game
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Solving multi-stage games with hierarchical learning automata that bootstrap
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Web-based multi-agent system architecture in a dynamic environment
International Journal of Knowledge-based and Intelligent Engineering Systems
Adaptation-based programming in java
Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
A consideration of human immunity-based reinforcement learning with continuous states
Artificial Life and Robotics
An information-spectrum approach to analysis of return maximization in reinforcement learning
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Predicting and compensating for lexicon access errors
Proceedings of the 16th international conference on Intelligent user interfaces
Swarm reinforcement learning method based on an actor-critic method
SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
Extended Q-learning algorithm for path-planning of a mobile robot
SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
Reduct based Q-learning: an introduction
Proceedings of the 2011 International Conference on Communication, Computing & Security
ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume Part III
State representation with perceptual constancy based on active motion
ICSR'10 Proceedings of the Second international conference on Social robotics
Information Collection on a Graph
Operations Research
Self-organizing state aggregation for architecture design of Q-learning
Information Sciences: an International Journal
Path selection in disaster response management based on Q-learning
International Journal of Automation and Computing
The world of independent learners is not markovian
International Journal of Knowledge-based and Intelligent Engineering Systems
Sampled fictitious play for approximate dynamic programming
Computers and Operations Research
Noisy reinforcements in reinforcement learning: some case studies based on gridworlds
ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
Supporting smart interactions with predictive analytics
The smart internet
ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Knowledge of opposite actions for reinforcement learning
Applied Soft Computing
Supporting smart interactions with predictive analytics
The smart internet
Software agent with reinforcement learning approach for medical image segmentation
Journal of Computer Science and Technology
Ambulance redeployment: an approximate dynamic programming approach
Winter Simulation Conference
Coordination control of greenhouse environmental factors
International Journal of Automation and Computing
Using reinforcement learning for controlling an elastic web application hosting platform
Proceedings of the 8th ACM international conference on Autonomic computing
Proceedings of the 2011 workshop on Organic computing
A dynamic route change mechanism for mobile ad hoc networks
International Journal of Communication Networks and Distributed Systems
Learning chasing behaviours of non-player characters in games using SARSA
EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
A Multi-State Q-Learning Approach for the Dynamic Load Balancing of Time Warp
PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Learning to manage combined energy supply systems
Proceedings of the 17th IEEE/ACM international symposium on Low-power electronics and design
A Monte-Carlo AIXI approximation
Journal of Artificial Intelligence Research
A probabilistic approach for maintaining trust based on evidence
Journal of Artificial Intelligence Research
Multiagent learning in large anonymous games
Journal of Artificial Intelligence Research
Learning in minority games with multiple resources
ECAL'09 Proceedings of the 10th European conference on Advances in artificial life: Darwin meets von Neumann - Volume Part II
HCII'11 Proceedings of the 1st international conference on Human interface and the management of information: interacting with information - Volume Part II
Voting in multi-agent system for improvement of partial observations
KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
A distributed reinforcement learning approach for solving optimization problems
CIT'11 Proceedings of the 5th WSEAS international conference on Communications and information technology
Evolving subjective utilities: Prisoner's Dilemma game examples
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Comparing humans and AI agents
AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
How to playwell in non-zero sum games: some lessons from generalized traveler's dilemma
AMT'11 Proceedings of the 7th international conference on Active media technology
Preference-based policy iteration: leveraging preference learning for reinforcement learning
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Empirical and theoretical support for lenient learning
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
The evolution of rules for conflicts resolution in self-organizing teams
Expert Systems with Applications: An International Journal
SD-Q: selective discount Q learning based on new results of intertemporal choice theory
AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Principled methods for biasing reinforcement learning agents
AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Social welfare for automatic innovation
MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
An artificial market for efficient allocation of road transport networks
MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
Value-difference based exploration: adaptive control between epsilon-greedy and softmax
KI'11 Proceedings of the 34th Annual German conference on Advances in artificial intelligence
Obstacle avoidance of redundant manipulators using neural networks based reinforcement learning
Robotics and Computer-Integrated Manufacturing
Overcoming Omniscience in Axelrod's Model
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
A self-adaptive routing paradigm for wireless mesh networks based on reinforcement learning
Proceedings of the 14th ACM international conference on Modeling, analysis and simulation of wireless and mobile systems
Correlated action effects in decision theoretic regression
UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence
Evaluating a reinforcement learning algorithm with a general intelligence test
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Quantum reinforcement learning
ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part II
An adaptive approach for the exploration-exploitation dilemma for learning agents
CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
Teachable characters: user studies, design principles, and learning performance
IVA'06 Proceedings of the 6th international conference on Intelligent Virtual Agents
Efficient non-linear control through neuroevolution
ECML'06 Proceedings of the 17th European conference on Machine Learning
Using meta-level control with reinforcement learning to improve the performance of the agents
FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Testing probabilistic equivalence through reinforcement learning
FSTTCS'06 Proceedings of the 26th international conference on Foundations of Software Technology and Theoretical Computer Science
Experience cooperative sharing in cross-layer cognitive radio for real-time multimedia communication
Proceedings of the 4th International Conference on Cognitive Radio and Advanced Spectrum Management
Opponent learning for multi-agent system simulation
RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Ensemble pruning using reinforcement learning
SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
MABS'04 Proceedings of the 2004 international conference on Multi-Agent and Multi-Agent-Based Simulation
A general framework for analyzing the optimal call admission control in DS-CDMA cellular network
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part II
A machine learning approach to intraday trading on foreign exchange markets
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Institutionalization through reciprocal habitualization and typification
WRAC'05 Proceedings of the Second international conference on Radical Agent Concepts: innovative Concepts for Autonomic and Agent-Based Systems
Learning-based ship design optimization approach
Computer-Aided Design
Kernel-Based reinforcement learning
ICIC'06 Proceedings of the 2006 international conference on Intelligent Computing - Volume Part I
A hybrid learning strategy for discovery of policies of action
IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
AlchemistJ: a framework for self-adaptive software
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Grey reinforcement learning for incomplete information processing
TAMC'06 Proceedings of the Third international conference on Theory and Applications of Models of Computation
Self-organizing neural architecture for reinforcement learning
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Q learning based on self-organizing fuzzy radial basis function network
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Selecting actions for resource-bounded information extraction using reinforcement learning
Proceedings of the fifth ACM international conference on Web search and data mining
Optimal tuning of continual online exploration in reinforcement learning
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
An adaptive strategy for energy-efficient data collection in sparse wireless sensor networks
EWSN'10 Proceedings of the 7th European conference on Wireless Sensor Networks
Self-organized and evolvable cognitive architecture for intelligent agents and multi-agent systems
EvoApplicatons'10 Proceedings of the 2010 international conference on Applications of Evolutionary Computation - Volume Part I
Machine learning of plan robustness knowledge about instances
ECML'05 Proceedings of the 16th European conference on Machine Learning
Communication, diversity and learning: cornerstones of swarm behavior
SAB'04 Proceedings of the 2004 international conference on Swarm Robotics
S2A: secure smart household appliances
Proceedings of the second ACM conference on Data and Application Security and Privacy
SCIA'05 Proceedings of the 14th Scandinavian conference on Image Analysis
Multiobjective water pinch analysis of the cuernavaca city water distribution network
EMO'05 Proceedings of the Third international conference on Evolutionary Multi-Criterion Optimization
Reinforcement learning using a grid based function approximator
Biomimetic Neural Learning for Intelligent Robots
Cost integration in multi-step viewpoint selection for object recognition
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Multiagent association rules mining in cooperative learning systems
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
K-Shortest paths q-routing: a new QoS routing algorithm in telecommunication networks
ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
Reinforcement learning by chaotic exploration generator in target capturing task
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning
Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning
Generating inspiration for agent design by reinforcement learning
Information and Software Technology
Aggressive joint access and backhaul design for distributed-cognition 1gbps/km2 system architecture
WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Multi-agent case-based reasoning for cooperative reinforcement learners
ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
Emergence of flocking behavior based on reinforcement learning
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Dialog strategy acquisition and its evaluation for efficient learning of word meanings by agents
EELC'06 Proceedings of the Third international conference on Emergence and Evolution of Linguistic Communication: symbol Grounding and Beyond
Learning automata-based approach to learn dialogue policies in large state space
International Journal of Intelligent Information and Database Systems
Trace equivalence characterization through reinforcement learning
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Mobile p2p automatic content sharing by ontology-based and contextualized integrative negotiation
DEECS'06 Proceedings of the Second international conference on Data Engineering Issues in E-Commerce and Services
Improvement of air handling unit control performance using reinforcement learning
PKAW'06 Proceedings of the 9th Pacific Rim Knowledge Acquisition international conference on Advances in Knowledge Acquisition and Management
Efficient deep web crawling using reinforcement learning
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
A reinforcement learning approach for the flexible job shop scheduling problem
LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Aspects of active norm learning and the effect of lying on norm emergence in agent societies
PRIMA'11 Proceedings of the 14th international conference on Agents in Principle, Agents in Practice
Reinforcement distribution in continuous state action space fuzzy Q–learning: a novel approach
WILF'05 Proceedings of the 6th international conference on Fuzzy Logic and Applications
Learning pareto-optimal solutions in 2x2 conflict games
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Heterogeneous populations of learning agents in the minority game
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Self-Organizing reinforcement learning model
ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
Non-intrusive policy optimization for dependable and adaptive service-oriented systems
Proceedings of the 27th Annual ACM Symposium on Applied Computing
A time-constrained SLA negotiation strategy in competitive computational grids
Future Generation Computer Systems
Strategy learning for autonomous agents in smart grid markets
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
A model of attentional impairments in autism: first steps toward a computational theory
Cognitive Systems Research
Accelerating evolution via egalitarian social learning
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Value function approximation through sparse bayesian modeling
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Automatic construction of temporally extended actions for MDPs using bisimulation metrics
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Compound reinforcement learning: theory and an application to finance
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reputation-Aware learning for SLA negotiation
IFIP'12 Proceedings of the 2012 international conference on Networking
Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
V-MAX: tempered optimism for better PAC reinforcement learning
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Just add Pepper: extending learning algorithms for repeated matrix games to repeated Markov games
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Dynamic potential-based reward shaping
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Adaptive agents on evolving networks
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Task allocation in mesh structure: 2side leapfrog algorithm and q-learning based algorithm
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IV
GRiDA: GReen Distributed Algorithm for energy-efficient IP backbone networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Selecting vision operators and fixing their optimal parameters values using reinforcement learning
ICISP'12 Proceedings of the 5th international conference on Image and Signal Processing
Multiagent learning through neuroevolution
WCCI'12 Proceedings of the 2012 World Congress conference on Advances in Computational Intelligence
A modular hierarchical reinforcement learning algorithm
ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
Overhead-Controlled routing in WSNs with reinforcement learning
IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Reinforcement Learning with Approximation Spaces
Fundamenta Informaticae
Learning Classification Programs: The Genetic Algorithm Approach
Fundamenta Informaticae
The Journal of Supercomputing
Managing Femto to Macro Interference without X2 Interface Support through POMDP
Mobile Networks and Applications
Computers & Mathematics with Applications
Learning to achieve socially optimal solutions in general-sum games
PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Distributed learning of best response behaviors in concurrent iterated many-object negotiations
MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Evolutionary dynamics of ant colony optimization
MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Q-Tree: automatic construction of hierarchical state representation for reinforcement learning
ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part III
Planning interactive task for intelligent characters
Computer Animation and Virtual Worlds
Multi-agent task division learning in hide-and-seek games
AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Cooperative behavior acquisition in multi-agent reinforcement learning system using attention degree
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Multi-agent learning and the reinforcement gradient
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Recognizing internal states of other agents to anticipate and coordinate interactions
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
How to design agent-based simulation models using agent learning
Proceedings of the Winter Simulation Conference
Learning classifier system with average reward reinforcement learning
Knowledge-Based Systems
Simulating plausible mechanisms for changing hepatic xenobiotic clearance patterns
Proceedings of the Winter Simulation Conference
A Reinforcement Learning Approach to Setting Multi-Objective Goals for Energy Demand Management
International Journal of Agent Technologies and Systems
Two-step gradient-based reinforcement learning for underwater robotics behavior learning
Robotics and Autonomous Systems
Exploiting user feedback for adapting mobile interaction obtrusiveness
UCAmI'12 Proceedings of the 6th international conference on Ubiquitous Computing and Ambient Intelligence
On measuring social intelligence: experiments on competition and cooperation
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
A social network-based trust-aware propagation model for P2P systems
Knowledge-Based Systems
Information Sciences: an International Journal
Robust convention emergence in social networks through self-reinforcing structures dissolution
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
On Finding and Learning Effective Strategies for Complex Non-zero-sum Repeated Games
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Behavior Abstraction Robustness in Agent Modeling
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Knowledge-Based Exploration for Reinforcement Learning in Self-Organizing Neural Networks
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Expert Systems with Applications: An International Journal
From model-based control to data-driven control: Survey, classification and perspective
Information Sciences: an International Journal
Learning with configurable operators and RL-based heuristics
NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
Information Systems
An investigation into the development of service-oriented robotic systems
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Testing probabilistic equivalence through Reinforcement Learning
Information and Computation
Emergence of social norms through collective learning in networked agent societies
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
iCO2: multi-user eco-driving training environment based on distributed constraint optimization
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Smart exploration in reinforcement learning using absolute temporal difference errors
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Object focused q-learning for autonomous agents
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
The Journal of Machine Learning Research
Wireless Personal Communications: An International Journal
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Backward Q-learning: The combination of Sarsa algorithm and Q-learning
Engineering Applications of Artificial Intelligence
Online learning of timeout policies for dynamic power management
ACM Transactions on Embedded Computing Systems (TECS)
Resource-bounded machines are motivated to be effective, efficient, and curious
AGI'13 Proceedings of the 6th international conference on Artificial General Intelligence
Toward nonlinear local reinforcement learning rules through neuroevolution
Neural Computation
The dynamics of reinforcement social learning in cooperative multiagent systems
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Wireless Personal Communications: An International Journal
Gait Pattern Based on CMAC Neural Network for Robotic Applications
Neural Processing Letters
Reinforcement learning based routing in wireless mesh networks
Wireless Networks
Adaptive learning algorithm of self-organizing teams
Expert Systems with Applications: An International Journal
Reinforcement learning algorithms with function approximation: Recent advances and applications
Information Sciences: an International Journal
The arcade learning environment: an evaluation platform for general agents
Journal of Artificial Intelligence Research
Construction of approximation spaces for reinforcement learning
The Journal of Machine Learning Research
Scheduling sensors for monitoring sentient spaces using an approximate POMDP policy
Pervasive and Mobile Computing
Journal of Cognitive Neuroscience
Hybrid motion graph for character motion synthesis
Journal of Visual Languages and Computing
A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation
Artificial Life and Robotics
Multiagent learning in the presence of memory-bounded agents
Autonomous Agents and Multi-Agent Systems
Learning potential functions and their representations for multi-task reinforcement learning
Autonomous Agents and Multi-Agent Systems
Embodied imitation-enhanced reinforcement learning in multi-agent systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Active tracking and pursuit under different levels of occlusion: a two-layer approach
Machine Vision and Applications
Computer Networks: The International Journal of Computer and Telecommunications Networking
Hierarchical control of traffic signals using Q-learning with tile coding
Applied Intelligence
Analysis of emission right prices in greenhouse gas emission trading via agent-based model
Multiagent and Grid Systems
Self-organized femtocells: a Fuzzy Q-Learning approach
Wireless Networks
Adaptive function approximation in reinforcement learning with an interpolating growing neural gas
International Journal of Hybrid Intelligent Systems
Hi-index | 0.01 |
\cal Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states.This paper presents and proves in detail a convergence theorem for \cal Q-learning based on that outlined in Watkins (1989). We show that \cal Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many \cal Q values can be changed each iteration, rather than just one.