Communications of the ACM
Empirical model-building and response surface
Empirical model-building and response surface
Theory of linear and integer programming
Theory of linear and integer programming
Stochastic optimal control: theory and application
Stochastic optimal control: theory and application
Dynamic programming: deterministic and stochastic models
Dynamic programming: deterministic and stochastic models
Stochastic systems: estimation, identification and adaptive control
Stochastic systems: estimation, identification and adaptive control
Parallel and distributed computation: numerical methods
Parallel and distributed computation: numerical methods
Learning automata: an introduction
Learning automata: an introduction
Proceedings of the seventh international conference (1990) on Machine learning
Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations
Generalization and scaling in reinforcement learning
Advances in neural information processing systems 2
A survey of algorithmic methods for partially observed Markov decision processes
Annals of Operations Research
Reinforcement learning in Markovian and non-Markovian environments
NIPS-3 Proceedings of the 1990 conference on Advances in neural information processing systems 3
Adaptation in natural and artificial systems
Adaptation in natural and artificial systems
Practical Issues in Temporal Difference Learning
Machine Learning
Technical Note: \cal Q-Learning
Machine Learning
The Convergence of TD(λ) for General λ
Machine Learning
Reinforcement learning and its application to control
Reinforcement learning and its application to control
Learning in embedded systems
The complexity of stochastic games
Information and Computation
Reinforcement learning for robots using neural networks
Reinforcement learning for robots using neural networks
Efficient learning and planning within the Dyna framework
Adaptive Behavior
Efficient reinforcement learning
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
TD-Gammon, a self-teaching backgammon program, achieves master-level play
Neural Computation
Associative Reinforcement Learning: Functions in k-DNF
Machine Learning
Associative Reinforcement Learning: A Generate and Test Algorithm
Machine Learning
TD(λ) Converges with Probability 1
Machine Learning
Memoryless policies: theoretical limitations and practical results
SAB94 Proceedings of the third international conference on Simulation of adaptive behavior : from animals to animats 3: from animals to animats 3
A comparison of Q-learning and classifier systems
SAB94 Proceedings of the third international conference on Simulation of adaptive behavior : from animals to animats 3: from animals to animats 3
Learning to solve Markovian decision processes
Learning to solve Markovian decision processes
Asynchronous Stochastic Approximation and Q-Learning
Machine Learning
Acting optimally in partially observable stochastic domains
AAAI'94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 2)
Robot shaping: developing autonomous agents through learning
Artificial Intelligence
Learning to act using real-time dynamic programming
Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Temporal difference learning and TD-Gammon
Communications of the ACM
Adding temporary memory to ZCS
Adaptive Behavior
Continual learning in reinforcement environments
Continual learning in reinforcement environments
Feature-based methods for large scale dynamic programming
Machine Learning - Special issue on reinforcement learning
Reinforcement learning with replacing eligibility traces
Machine Learning - Special issue on reinforcement learning
Average reward reinforcement learning: foundations, algorithms, and empirical results
Machine Learning - Special issue on reinforcement learning
Predicting real-time planner performance by domain characterization
Predicting real-time planner performance by domain characterization
Markov Decision Processes: Discrete Stochastic Dynamic Programming
Markov Decision Processes: Discrete Stochastic Dynamic Programming
Neural Network Perception for Mobile Robot Guidance
Neural Network Perception for Mobile Robot Guidance
Genetic Algorithms in Search, Optimization and Machine Learning
Genetic Algorithms in Search, Optimization and Machine Learning
Brains, Behavior and Robotics
Finite State Markovian Decision Processes
Finite State Markovian Decision Processes
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Advances in Neural Information Processing Systems 5, [NIPS Conference]
Dynamic Programming
Memory Approaches to Reinforcement Learning in Non-Markovian Domains
Memory Approaches to Reinforcement Learning in Non-Markovian Domains
Temporal credit assignment in reinforcement learning
Temporal credit assignment in reinforcement learning
Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning)
Reinforcement learning with selective perception and hidden state
Reinforcement learning with selective perception and hidden state
Classifier fitness based on accuracy
Evolutionary Computation
On the convergence of stochastic iterative dynamic programming algorithms
Neural Computation
A reinforcement learning approach to job-shop scheduling
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
On the complexity of solving Markov decision problems
UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
Rapid, safe, and incremental learning of navigation strategies
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
PAC adaptive control of linear systems
COLT '97 Proceedings of the tenth annual conference on Computational learning theory
Learning agents for uncertain environments (extended abstract)
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Learning situation-dependent costs: improving planning from probabilistic robot execution
AGENTS '98 Proceedings of the second international conference on Autonomous agents
A history-based approach for adaptive robot behavior in dynamic environments
AGENTS '98 Proceedings of the second international conference on Autonomous agents
Iterated phantom induction: a little knowledge can go a long way
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Tree based discretization for continuous state space reinforcement learning
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Module-Based Reinforcement Learning: Experiments with a Real Robot
Machine Learning - Special issue on learning in autonomous robots
Machine Learning - Special issue on learning in autonomous robots
Learning from History for Behavior-Based Mobile Robots in Non-Stationary Conditions
Machine Learning - Special issue on learning in autonomous robots
PETEEI: a PET with evolving emotional intelligence
Proceedings of the third annual conference on Autonomous Agents
Team-partitioned, opaque-transition reinforcement learning
Proceedings of the third annual conference on Autonomous Agents
Adaptivity and learning in intelligent real-time systems
Proceedings of the third annual conference on Autonomous Agents
Proceedings of the third annual conference on Autonomous Agents
Nomadic radio: scaleable and contextual notification for wearable audio messaging
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Conjectural Equilibrium in Multiagent Learning
Machine Learning
Machine Learning
Efficient exploration for optimizing immediate reward
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Proceedings of the 27th annual conference on Computer graphics and interactive techniques
Adaptive Retrieval Agents: Internalizing Local Contextand Scaling up to the Web
Machine Learning - Special issue on information retrieval
Nomadic radio: speech and audio interaction for contextual messaging in nomadic environments
ACM Transactions on Computer-Human Interaction (TOCHI) - Special issue on human-computer interaction with mobile systems
Using background knowledge to speed reinforcement learning in physical agents
Proceedings of the fifth international conference on Autonomous agents
Reinforcement learning for fuzzy agents: application to a pighouse environment control
New learning paradigms in soft computing
Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network
Neural Processing Letters
Multiagent learning using a variable learning rate
Artificial Intelligence
Designing agent collectives for systems with markovian dynamics
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to select a coordination mechanism
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Controlled animation of video sprites
Proceedings of the 2002 ACM SIGGRAPH/Eurographics symposium on Computer animation
Machine learning and inductive logic programming for multi-agent systems
Mutli-agents systems and applications
Relational reinforcement learning
Mutli-agents systems and applications
Dynamic non-Bayesian decision making in multi-agent systems
Annals of Mathematics and Artificial Intelligence
Efficient and inefficient ant coverage methods
Annals of Mathematics and Artificial Intelligence
Ant colony optimization and stochastic gradient descent
Artificial Life
A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making
Applied Intelligence
Module-Based Reinforcement Learning: Experiments with a Real Robot
Autonomous Robots
Dynamics of a Classical Conditioning Model
Autonomous Robots
Robot Awareness in Cooperative Mobile Robot Learning
Autonomous Robots
Multiagent Systems: A Survey from a Machine Learning Perspective
Autonomous Robots
Acquiring Mobile Robot Behaviors by Learning Trajectory Velocities
Autonomous Robots
Automated Software Engineering
Automating the Construction of Internet Portals with Machine Learning
Information Retrieval
The Effect of Evolution in Artificial Life Learning Behavior
Journal of Intelligent and Robotic Systems
Relational Reinforcement Learning
Machine Learning
Robot learning driven by emotions
Adaptive Behavior
Learning intelligent behavior in a non-stationary and partially observable environment
Artificial Intelligence Review
Actor-critic models of the basal ganglia: new anatomical and computational perspectives
Neural Networks - Computational models of neuromodulation
Neuromodulation of decision and response selection
Neural Networks - Computational models of neuromodulation
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
Neural Processing Letters
Recent Advances in Hierarchical Reinforcement Learning
Discrete Event Dynamic Systems
Exploration Strategies for Model-based Learning in Multi-agent Systems: Exploration Strategies
Autonomous Agents and Multi-Agent Systems
FLAME—Fuzzy Logic Adaptive Model of Emotions
Autonomous Agents and Multi-Agent Systems
A Topic-Specific Web Robot Model Based on Restless Bandits
IEEE Internet Computing
Sequence Learning: From Recognition and Prediction to Sequential Decision Making
IEEE Intelligent Systems
Optimal control using the transport equation: the Liouville machine
Adaptive Behavior
Evolving neural networks through augmenting topologies
Evolutionary Computation
An Information-Theoretic Approach for the Quantification of Relevance
ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Distributing a Mind on the Internet: The World-Wide-Mind
ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Problem Decomposition for Behavioural Cloning
ECML '00 Proceedings of the 11th European Conference on Machine Learning
ECML '00 Proceedings of the 11th European Conference on Machine Learning
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Learning While Exploring: Bridging the Gaps in the Eligibility Traces
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Analysis and Design of Robot's Behavior: Towards a Methodology
EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Learning a Navigation Task in Changing Environments by Multi-task Reinforcement Learning
EWLR-8 Proceedings of the 8th European Workshop on Learning Robots: Advances in Robot Learning
Extraction of Local Structural Features in Images by Using a Multi-scale Relevance Function
MLDM '99 Proceedings of the First International Workshop on Machine Learning and Data Mining in Pattern Recognition
L-VIBRA: Learning the VIBRA Architecture
IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Least-Squares Methods in Reinforcement Learning for Control
SETN '02 Proceedings of the Second Hellenic Conference on AI: Methods and Applications of Artificial Intelligence
Learning to Balance Upright Posture: What can be Learnt Using Adaptive NN Models?
WIRN VIETRI 2002 Proceedings of the 13th Italian Workshop on Neural Nets-Revised Papers
Relational Reinforcement Learning
EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Machine Learning and Inductive Logic Programming for Multi-agent Systems
EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Reinforcement Learning for Cooperating and Communicating Reactive Agents in Electrical Power Grids
Balancing Reactivity and Social Deliberation in Multi-Agent Systems, From RoboCup to Real-World Applications (selected papers from the ECAI 2000 Workshop and additional contributions)
Parameterized Logic Programs where Computing Meets Learning
FLOPS '01 Proceedings of the 5th International Symposium on Functional and Logic Programming
Sequential Instance-Based Learning for Planning in the Context of an Imperfect Information Game
ICCBR '01 Proceedings of the 4th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Intraday FX Trading: An Evolutionary Reinforcement Learning Approach
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
An Introduction to Learning Fuzzy Classifier Systems
Learning Classifier Systems, From Foundations to Applications
A Roadmap to the Last Decade of Learning Classifier System Research
Learning Classifier Systems, From Foundations to Applications
Two Dimensional Evaluation Reinforcement Learning
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Connectionist Models of Neurons, Learning Processes and Artificial Intelligence-Part I
Comparing the Learning Processes of Cognitive Distance Learning and Search Based Agent
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Connectionist Models of Neurons, Learning Processes and Artificial Intelligence-Part I
Market Performance of Adaptive Trading Agents in Synchronous Double Auctions
PRIMA 2001 Proceedings of the 4th Pacific Rim International Workshop on Multi-Agents, Intelligent Agents: Specification, Modeling, and Applications
Game Theory and Artificial Intelligence
Selected papers from the UKMAS Workshop on Foundations and Applications of Multi-Agent Systems
Andhill-98: A RoboCup Team which Reinforces Positioning with Observation
RoboCup-98: Robot Soccer World Cup II
Team-Partitioned, Opaque-Transition Reinforced Learning
RoboCup-98: Robot Soccer World Cup II
From a Concurrent Architecture to a Concurrent Autonomous Agents Architecture
RoboCup-99: Robot Soccer World Cup III
RoboCup 2000: Robot Soccer World Cup IV
On the Relationship between Learning Capability and the Boltzmann-Formula
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Preliminary Results
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Introduction to Sequence Learning
Sequence Learning - Paradigms, Algorithms, and Applications
Hidden-Mode Markov Decision Processes for Nonstationary Sequential Decision Making
Sequence Learning - Paradigms, Algorithms, and Applications
Automatic Segmentation of Sequences through Hierarchical Reinforcement Learning
Sequence Learning - Paradigms, Algorithms, and Applications
Simulating Competing Alife Organisms by Constructive Compound Neural Networks
AI '00 Proceedings of the 13th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
Abstraction Methods for Game Theoretic Poker
CG '00 Revised Papers from the Second International Conference on Computers and Games
Logic, Knowledge Representation, and Bayesian Decision Theory
CL '00 Proceedings of the First International Conference on Computational Logic
Faster Near-Optimal Reinforcement Learning: Adding Adaptiveness to the E3 Algorithm
ALT '99 Proceedings of the 10th International Conference on Algorithmic Learning Theory
Feedforward Neural Networks in Reinforcement Learning Applied to High-Dimensional Motor Control
ALT '02 Proceedings of the 13th International Conference on Algorithmic Learning Theory
Learning Hierarchical Skills from Observation
DS '02 Proceedings of the 5th International Conference on Discovery Science
Mining Documents for Complex Semantic Relations by the Use of Context Classification
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
To Collect or Not to Collect? Machine Learning for Memory Management
Proceedings of the 2nd Java Virtual Machine Research and Technology Symposium
Bounds on Sample Size for Policy Evaluation in Markov Environments
COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Self-Optimizing and Pareto-Optimal Policies in General Environments Based on Bayes-Mixtures
COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
Continual Robot Learning with Constructive Neural Networks
EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning
IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Analysis of Optimal Criteria
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Imitation in animals and artifacts
Constructing complex minds through multiple authors
ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
A context-based architecture for general problem solving
ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Sequential cost-sensitive decision making with reinforcement learning
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Applications of the self-organising map to reinforcement learning
Neural Networks - New developments in self-organizing maps
State abstraction for programmable reinforcement learning agents
Eighteenth national conference on Artificial intelligence
Reinforcement learning of coordination in cooperative multi-agent systems
Eighteenth national conference on Artificial intelligence
The design of collectives of agents to control non-Markovian systems
Eighteenth national conference on Artificial intelligence
Dispersion games: general definitions and some specific learning results
Eighteenth national conference on Artificial intelligence
Soccer strategies that live in the B2B world of negotiation and decision-making
Decision Support Systems
Biologically inspired robot behavior engineering
A general learning approach to visually guided 3D-positioning and pose control of robot arms
Biologically inspired robot behavior engineering
TPCG '03 Proceedings of the Theory and Practice of Computer Graphics 2003
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Multi-agent learning in extensive games with complete information
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
The empirical Bayes envelope and regret minimization in competitive Markov decision processes
Mathematics of Operations Research
Using Reinforcement Learning for Similarity Assessment in Case-Based Systems
IEEE Intelligent Systems
Recent Advances in Hierarchical Reinforcement Learning
Discrete Event Dynamic Systems
A reinforcement learning adaptive fuzzy controller for robots
Fuzzy Sets and Systems - Theme: Modeling and control
Autonomous mental development in high dimensional context and action spaces
Neural Networks - 2003 Special issue: Advances in neural networks research IJCNN'03
R-max - a general polynomial time algorithm for near-optimal reinforcement learning
The Journal of Machine Learning Research
Machine Learning for Computer Graphics: A Manifesto and Tutorial
PG '03 Proceedings of the 11th Pacific Conference on Computer Graphics and Applications
Mining Plans for Customer-Class Transformation
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Computer Networks: The International Journal of Computer and Telecommunications Networking
Speedup learning for repair-based search by identifying redundant steps
The Journal of Machine Learning Research
Nash q-learning for general-sum stochastic games
The Journal of Machine Learning Research
Modeling the adaptive visual system: a survey of principled approaches
Neural Networks - Special issue: Neuroinformatics
A Tabu-Search Hyperheuristic for Timetabling and Rostering
Journal of Heuristics
Fault prognostics using dynamic wavelet neural networks
Artificial Intelligence for Engineering Design, Analysis and Manufacturing
Optimal Ordered Problem Solver
Machine Learning
Learning obstacle avoidance with an operant behavior model
Artificial Life
Stable repeated strategies for information exchange between two autonomous agents
Artificial Intelligence
A Reinforcement Learning Framework for Parameter Control in Computer Vision Applications
CRV '04 Proceedings of the 1st Canadian Conference on Computer and Robot Vision
A Geometric Approach to Multi-Criterion Reinforcement Learning
The Journal of Machine Learning Research
Self-organized load balancing in proxy servers: algorithms and performance
Journal of Intelligent Information Systems - Special issue on web intelligence
IBM Systems Journal
Transfer of Experience Between Reinforcement Learning Environments with Progressive Difficulty
Artificial Intelligence Review
Utile distinction hidden Markov models
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning when and how to coordinate
Web Intelligence and Agent Systems
Cross channel optimized marketing by reinforcement learning
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Integrating Guidance into Relational Reinforcement Learning
Machine Learning
Incremental heuristic search in AI
AI Magazine
Reinforcement Learning of Coordination in Heterogeneous Cooperative Multi-Agent Systems
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Precomputing avatar behavior from human motion data
SCA '04 Proceedings of the 2004 ACM SIGGRAPH/Eurographics symposium on Computer animation
Proceedings of the 34th conference on Winter simulation: exploring new frontiers
Proceedings of the 35th conference on Winter simulation: driving innovation
Exploitation vs. exploration: choosing a supplier in an environment of incomplete information
Decision Support Systems
Efficient learning equilibrium
Artificial Intelligence
Asymmetric multiagent reinforcement learning
Web Intelligence and Agent Systems
Guiding queries to information sources with InfoBeacons
Proceedings of the 5th ACM/IFIP/USENIX international conference on Middleware
Learning and Exploiting Relative Weaknesses of Opponent Agents
Autonomous Agents and Multi-Agent Systems
Applying Active Space Principles to Active Classrooms
PERCOMW '05 Proceedings of the Third IEEE International Conference on Pervasive Computing and Communications Workshops
Graphical user interface of an interactive system for schemes design, used in distance learning
CompSysTech '04 Proceedings of the 5th international conference on Computer systems and technologies
Coordinating Multiple Agents via Reinforcement Learning
Autonomous Agents and Multi-Agent Systems
Strong, Stable, and Reliable Fitness Pressure in XCS due to Tournament Selection
Genetic Programming and Evolvable Machines
Using Optimal Foraging Models to Evaluate Learned Robotic Foraging Behavior
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Fast multi-level adaptation for interactive autonomous characters
ACM Transactions on Graphics (TOG)
System for foreign exchange trading using genetic algorithms and reinforcement learning
International Journal of Systems Science
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process
Proceedings of the 2005 ACM symposium on Applied computing
Evolving Soccer Keepaway Players Through Task Decomposition
Machine Learning
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Cooperative Multi-Agent Learning: The State of the Art
Autonomous Agents and Multi-Agent Systems
Optimal Control Using the Transport Equation: The Liouville Machine
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Learning Users' Interests by Quality Classification in Market-Based Recommender Systems
IEEE Transactions on Knowledge and Data Engineering
Teaching virtual characters how to use body language
Lecture Notes in Computer Science
Automatic pan-tilt-zoom calibration in the presence of hybrid sensor networks
Proceedings of the third ACM international workshop on Video surveillance & sensor networks
Intelligent exploration method for XCS
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Learning strategies for story comprehension: a reinforcement learning approach
ICML '05 Proceedings of the 22nd international conference on Machine learning
Adaptive Clustering: Obtaining Better Clusters Using Feedback and Past Experience
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
A Reinforcement Learning Algorithm in Cooperative Multi-Robot Domains
Journal of Intelligent and Robotic Systems
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games
Autonomous Agents and Multi-Agent Systems
Experiments in socially guided machine learning: understanding how humans teach
Proceedings of the 1st ACM SIGCHI/SIGART conference on Human-robot interaction
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms
Neural Computation
A Reinforcement Learning Approach to Online Clustering
Neural Computation
Reinforcement Learning in Continuous Time and Space
Neural Computation
DIAGAL: An Agent Communication Language Based on Dialogue Games and Sustained by Social Commitments
Autonomous Agents and Multi-Agent Systems
Precomputing avatar behavior from human motion data
Graphical Models - Special issue on SCA 2004
Relational temporal difference learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Dealing with non-stationary environments using context detection
ICML '06 Proceedings of the 23rd international conference on Machine learning
Learning hierarchical task networks by observation
ICML '06 Proceedings of the 23rd international conference on Machine learning
A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Improving reinforcement learning with context detection
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
The Knowledge Engineering Review
A reinforcement learning approach to active camera foveation
Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
I-SEE: an intelligent search agent for electronic commerce
International Journal of Electronic Commerce
Approximate Reasoning in MAS: Rough Set Approach
IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Proceedings of the 38th conference on Winter simulation
Neural-based downlink scheduling algorithm for broadband wireless networks
Computer Communications
Reinforcement Learning with Approximation Spaces
Fundamenta Informaticae
A reinforcement learning approach to dynamic resource allocation
Engineering Applications of Artificial Intelligence
Dimensions of complexity of intelligent agents
PCAR '06 Proceedings of the 2006 international symposium on Practical cognitive agents and robots
First-Order Logical Neural Networks
International Journal of Hybrid Intelligent Systems - Recent developments in Hybrid Intelligent Systems
Performance analysis of the AntNet algorithm
Computer Networks: The International Journal of Computer and Telecommunications Networking
The theory and experiments of designing cooperative intelligent systems
Decision Support Systems
Scientific Programming - Distributed Computing and Applications
DEA: An Architecture for Goal Planning and Classification
Neural Computation
Adaptive Behavior in Autonomous Agents
Presence: Teleoperators and Virtual Environments
If multi-agent learning is the answer, what is the question?
Artificial Intelligence
Approximate Reasoning in MAS: Rough Set Approach
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Modeling embodied visual behaviors
ACM Transactions on Applied Perception (TAP)
On developmental mental architectures
Neurocomputing
Reinforcement learning by reward-weighted regression for operational space control
Proceedings of the 24th international conference on Machine learning
Learning and Cooperation in Sequential Games
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Responsive characters from motion fragments
ACM SIGGRAPH 2007 papers
Near-optimal character animation with continuous control
ACM SIGGRAPH 2007 papers
Part III: dynamic texture synthesis
ACM SIGGRAPH 2007 courses
Learning to trade with insider information
Proceedings of the ninth international conference on Electronic commerce
Data acquisition and cost-effective predictive modeling: targeting offers for electronic commerce
Proceedings of the ninth international conference on Electronic commerce
Shaping multi-agent systems with gradient reinforcement learning
Autonomous Agents and Multi-Agent Systems
Metric embedding of view-graphs
Autonomous Robots
Application of reinforcement learning in robot soccer
Engineering Applications of Artificial Intelligence
A reinforcement agent for threshold fusion
Applied Soft Computing
A layered approach to learning coordination knowledge in multiagent environments
Applied Intelligence
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of reinforcement learning to the game of Othello
Computers and Operations Research
IEEE Transactions on Parallel and Distributed Systems
Learning reinforcement strategies for a changing workforce
WBED'07 Proceedings of the sixth conference on IASTED International Conference Web-Based Education - Volume 2
Dynamically learning sources of trust information: experience vs. reputation
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Survey paper: Optimal experimental design and some related control problems
Automatica (Journal of IFAC)
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning
Artificial Intelligence
Learning to Control in Operational Space
International Journal of Robotics Research
Wireless Personal Communications: An International Journal
Optimizing time warp simulation with reinforcement learning techniques
Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come
Accelerating autonomous learning by using heuristic selection of actions
Journal of Heuristics
Dynamic learning of action patterns for object acquisition
International Journal of Intelligent Systems Technologies and Applications
A cooperative learning approach to Mixed Performance Controller design: a behavioural viewpoint
International Journal of Intelligent Systems Technologies and Applications
Artificial Intelligence techniques: An introduction to their use for modelling environmental systems
Mathematics and Computers in Simulation
Coordination in multiagent reinforcement learning systems by virtual reinforcement signals
International Journal of Knowledge-based and Intelligent Engineering Systems
Investigation of Q-learning in the context of a virtual learning environment
Informatics in education
A node discovery service for partially mobile sensor networks
Proceedings of the 2nd international workshop on Middleware for sensor networks
Water reservoir control under economic, social and environmental constraints
Automatica (Journal of IFAC)
Service oriented architecture for financial customer relationship management
Proceedings of the second international conference on Distributed event-based systems
Accelerating neuroevolutionary methods using a Kalman filter
Proceedings of the 10th annual conference on Genetic and evolutionary computation
Cooperative content distribution and traffic engineering
Proceedings of the 3rd international workshop on Economics of networked systems
Data Mining and Knowledge Discovery
Reinforcement learning for problems with symmetrical restricted states
Robotics and Autonomous Systems
Ensemble clustering with voting active clusters
Pattern Recognition Letters
Adaptive hybrid control for noise rejection
NN'08 Proceedings of the 9th WSEAS International Conference on Neural Networks
Optimization of Handover Parameters for Traffic Sharing in GERAN
Wireless Personal Communications: An International Journal
Incremental Learning of Planning Operators in Stochastic Domains
SOFSEM '07 Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science
RoboCup 2006: Robot Soccer World Cup X
Fuzzy Q-Map Algorithm for Reinforcement Learning
Computational Intelligence and Security
Reinforcement Learning Reward Functions for Unsupervised Learning
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
A Kernel-Based Reinforcement Learning Approach to Dynamic Behavior Modeling of Intrusion Detection
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
A Novel Method of Constructing ANN
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Recurrent Boosting for Classification of Natural and Synthetic Time-Series Data
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Towards the Automatic Learning of Reflex Modulation for Mobile Robot Navigation
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Multi-agent Learning Dynamics: A Survey
CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
Flexible Control Mechanism for Multi-DOF Robotic Arm Based on Biological Fluctuation
SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Efficient Node Discovery in Mobile Wireless Sensor Networks
DCOSS '08 Proceedings of the 4th IEEE international conference on Distributed Computing in Sensor Systems
Epoch-Incremental Queue-Dyna Algorithm
ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Learning Grouping and Anti-predator Behaviors for Multi-agent Systems
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Rational Bidding Using Reinforcement Learning
GECON '08 Proceedings of the 5th international workshop on Grid Economics and Business Models
State-Dependent Exploration for Policy Gradient Methods
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Some Progress of Supervised Learning
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
KI '08 Proceedings of the 31st annual German conference on Advances in Artificial Intelligence
A Logical Framework to Reinforcement Learning Using Hybrid Probabilistic Logic Programs
SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
Engineering Applications of Artificial Intelligence
Towards adaptive programming: integrating reinforcement learning into a programming language
Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
A probabilistic model of integration
Decision Support Systems
DESIGN OF AN AEROSPACE LAUNCH VEHICLE AUTOPILOT BASED ON OPTIMIZED EMOTIONAL LEARNING ALGORITHM
Cybernetics and Systems
REINFORCEMENT LEARNING FOR POMDP USING STATE CLASSIFICATION
Applied Artificial Intelligence
Meta-case-based reasoning: self-improvement through self-understanding
Journal of Experimental & Theoretical Artificial Intelligence
Maintaining dynamic channel profiles on the web
Proceedings of the VLDB Endowment
Design and analysis of GA based neural/fuzzy optimum adaptive control
WSEAS Transactions on Systems and Control
Q-Strategy: A Bidding Strategy for Market-Based Allocation of Grid Services
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
An Information-Theoretic Class of Stochastic Decision Processes
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Effects of chaotic exploration on reinforcement learning in target capturing task
International Journal of Knowledge-based and Intelligent Engineering Systems
Route optimization with Q-learning
ACS'08 Proceedings of the 8th conference on Applied computer scince
Improving the Exploration Strategy in Bandit Algorithms
Learning and Intelligent Optimization
Action-Based Environment Modeling for Maintaining Trust
Trust in Agent Societies
Simulation of sequential data: An enhanced reinforcement learning approach
Expert Systems with Applications: An International Journal
Sequential optimal design of neurophysiology experiments
Neural Computation
How people talk when teaching a robot
Proceedings of the 4th ACM/IEEE international conference on Human robot interaction
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Simulation-optimization using a reinforcement learning approach
Proceedings of the 40th Conference on Winter Simulation
Dialogue games that agents play within a society
Artificial Intelligence
Reinforcement Learning: A Tutorial Survey and Recent Advances
INFORMS Journal on Computing
A reinforcement learning framework for utility-based scheduling in resource-constrained systems
Future Generation Computer Systems
A policy-based framework for autonomic reconfiguration management in heterogeneous networks
Proceedings of the 7th International Conference on Mobile and Ubiquitous Multimedia
SO-antnet for improving load sharing in MANET
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
Designing autonomous layered video coders
Image Communication
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Multiagent learning in large anonymous games
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Novel reinforcement learning-based approaches to reduce loss probability in buffer-less OBS networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Immediate Reward Reinforcement Learning for Clustering and Topology Preserving Mappings
Similarity-Based Clustering
Reordering Sparsification of Kernel Machines in Approximate Policy Iteration
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Opponent Modeling in Adversarial Environments through Learning Ingenuity
Proceedings of the 2005 conference on Self-Organization and Autonomic Informatics (I)
Cognitive learning with automatic goal acquisition
Proceedings of the 2006 conference on STAIRS 2006: Proceedings of the Third Starting AI Researchers' Symposium
Cognitive Architectures: Where do we go from here?
Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Transfer Learning and Intelligence: an Argument and Approach
Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Learning by Automatic Option Discovery from Conditionally Terminating Sequences
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Partially Observable Markov Decision Process Approximations for Adaptive Sensing
Discrete Event Dynamic Systems
Neuroevolutionary reinforcement learning for generalized helicopter control
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
A cooperative and self-adaptive metaheuristic for the facility location problem
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
From Continuous Behaviour to Discrete Knowledge
IWANN '03 Proceedings of the 7th International Work-Conference on Artificial and Natural Neural Networks: Part II: Artificial Neural Nets Problem Solving Methods
Motion Planning of a Non-holonomic Vehicle in a Real Environment by Reinforcement Learning*
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Anticipatory Behavior in Adaptive Learning Systems
Toward Rough-Granular Computing
RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Robotic Target Tracking with Approximation Space-Based Feedback During Reinforcement Learning
RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
An Efficient and Adaptive Mechanism for Parallel Simulation Replication
PADS '09 Proceedings of the 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation
A q-learning based adaptive bidding strategy in combinatorial auctions
Proceedings of the 11th International Conference on Electronic Commerce
Hand grip pattern recognition for mobile user interfaces
IAAI'06 Proceedings of the 18th conference on Innovative applications of artificial intelligence - Volume 2
QUICR-learning for multi-agent coordination
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
RL-CD: dealing with non-stationarity in reinforcement learning
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Parallel Algorithms for Solving Markov Decision Process
ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
Optimal efficient learning equilibrium: imperfect monitoring in symmetric games
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Adaptive treatment of epilepsy via batch-mode reinforcement learning
IAAI'08 Proceedings of the 20th national conference on Innovative applications of artificial intelligence - Volume 3
Unknown rewards in finite-horizon domains
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Evaluation of a hierarchical reinforcement learning spoken dialogue system
Computer Speech and Language
Ants and reinforcement learning: a case study in routing in dynamic networks
IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
Optimizing dialogue management with reinforcement learning: experiments with the NJFun system
Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods
Journal of Artificial Intelligence Research
Collective intelligence, data routing and braess' paradox
Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation
Journal of Artificial Intelligence Research
PHA*: finding the shortest path with A* in an unknown physical environment
Journal of Artificial Intelligence Research
Journal of Artificial Intelligence Research
Closed-loop learning of visual control policies
Journal of Artificial Intelligence Research
A formal framework for speedup learning from problems and solutions
Journal of Artificial Intelligence Research
Dynamic non-Bayesian decision making
Journal of Artificial Intelligence Research
AntNet: distributed stigmergetic control for communications networks
Journal of Artificial Intelligence Research
A machine learning approach to building domain-specific search engines
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Bounding the suboptimality of reusing subproblems
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Heuristic selection of actions in multiagent reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
State similarity based approach for improving performance in RL
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Analogical learning in a turn-based strategy game
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning to count by think aloud imitation
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Direct code access in self-organizing neural networks for reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning strategies for open-domain natural language question answering
ACLstudent '05 Proceedings of the ACL Student Research Workshop
Adapting Reinforcement Learning for Trust: Effective Modeling in Dynamic Environments
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Real-time planning for parameterized human motion
Proceedings of the 2008 ACM SIGGRAPH/Eurographics Symposium on Computer Animation
SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Automatic abstraction in reinforcement learning using data mining techniques
Robotics and Autonomous Systems
Reinforcement learning in distributed domains: beyond team games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
R-MAX: a general polynomial time algorithm for near-optimal reinforcement learning
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Exploiting multiple secondary reinforcers in policy gradient reinforcement learning
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Learning strategies for open-domain natural language question answering
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Utility-based on-line exploration for repeated navigation in an embedded graph
Artificial Intelligence
Reinforcement learning versus model predictive control: a comparison on a power system problem
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A Q-learning approach to derive optimal consumption and investment strategies
IEEE Transactions on Neural Networks
IEEE Transactions on Neural Networks
Knowledge-based recurrent neural networks in Reinforcement Learning
ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
Speeding up reinforcement learning using recurrent neural networks in non-Markovian environments
ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
An agent structure for evaluating micro-level MAS performance
PerMIS '07 Proceedings of the 2007 Workshop on Performance Metrics for Intelligent Systems
An RL-based scheduling algorithm for video traffic in high-rate wireless personal area networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Agent Architectures for Compliance
ESAW '09 Proceedings of the 10th International Workshop on Engineering Societies in the Agents World X
Reinforcement Learning Based Web Service Compositions for Mobile Business
WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
A reinforcement learning framework for utility-based scheduling in resource-constrained systems
A reinforcement learning framework for utility-based scheduling in resource-constrained systems
A reinforcement learning approach to dynamic resource allocation
A reinforcement learning approach to dynamic resource allocation
Two-step recommendation based personalization for future services
ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 3
Nash Q-learning multi-agent flow control for high-speed networks
ACC'09 Proceedings of the 2009 conference on American Control Conference
Coordinated multiple ramps metering based on neuro-fuzzy adaptive dynamic programming
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive state space partitioning for reinforcement learning
Engineering Applications of Artificial Intelligence
An enhanced reinforcement routing protocol for inter-vehicular unicast application
EuroIMSA '08 Proceedings of the IASTED International Conference on Internet and Multimedia Systems and Applications
Probabilistic fuzzy logic system: a tool to process stochastic and imprecise information
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Introducing a round robin tournament into Blondie24
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Intercluster connection in cognitive wireless mesh networks based on intelligent network coding
EURASIP Journal on Advances in Signal Processing - Special issue on dynamic spectrum access for wireless networking
Online layered learning for cross-layer optimization of dynamic multimedia systems
MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A probabilistic fuzzy logic system: learning in the stochastic environment with incomplete dynamics
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Real-valued Q-learning in multi-agent cooperation
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Cooperative multi-robot reinforcement learning: a framework in hybrid state space
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
A Recursive Classifier System for Partially Observable Environments
Fundamenta Informaticae
A framework for the design of a military operational supply network
CISDA'09 Proceedings of the Second IEEE international conference on Computational intelligence for security and defense applications
Online reinforcement learning for dynamic multimedia systems
IEEE Transactions on Image Processing
On agents and grids: Creating the fabric for a new generation of distributed intelligent systems
Web Semantics: Science, Services and Agents on the World Wide Web
Induction over Strategic Agents
Information Systems Research
Fuzzy decision tree function approximation in reinforcement learning
International Journal of Artificial Intelligence and Soft Computing
Transfer Learning for Reinforcement Learning Domains: A Survey
The Journal of Machine Learning Research
RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments
The Journal of Machine Learning Research
Convergence results for ant routing algorithms viastochastic approximation
Proceedings of the 13th ACM international conference on Hybrid systems: computation and control
Tournament selection: stable fitness pressure in XCS
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
A hybrid architecture combining reactive plan execution and reactive learning
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Knowledge discovery and emergent complexity in bioinformatics
KDECB'06 Proceedings of the 1st international conference on Knowledge discovery and emergent complexity in bioinformatics
Reinforcement learning for online control of evolutionary algorithms
ESOA'06 Proceedings of the 4th international conference on Engineering self-organising systems
A theory of profit sharing in dynamic environment
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
A region selecting method which performs observation and action in the multi-resolution environment
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Constructing an autonomous agent with an interdependent heuristics
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
EvoWorkshops'03 Proceedings of the 2003 international conference on Applications of evolutionary computing
Learning in groups of traffic signals
Engineering Applications of Artificial Intelligence
Heuristic search based exploration in reinforcement learning
IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Reinforcement learning scheme for grouping and anti-predator behavior
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
MBEANN: mutation-based evolving artificial neural networks
ECAL'07 Proceedings of the 9th European conference on Advances in artificial life
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Stochastic weights reinforcement learning for exploratory data analysis
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Clustering with reinforcement learning
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Independent factor reinforcement learning for portfolio management
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Kernel-based online NEAT for keepaway soccer
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
An agent reinforcement learning model based on neural networks
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Two-layer networked learning control of a nonlinear HVAC system
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Toward perception based computing: a rough-granular perspective
WImBI'06 Proceedings of the 1st WICI international conference on Web intelligence meets brain informatics
DS'07 Proceedings of the 10th international conference on Discovery science
Market-based hierarchical resource management using machine learning
DSOM'07 Proceedings of the Distributed systems: operations and management 18th IFIP/IEEE international conference on Managing virtualization of networks and services
A novel design of hidden web crawler using reinforcement learning based agents
APPT'07 Proceedings of the 7th international conference on Advanced parallel processing technologies
ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications
A self-optimized job scheduler for heterogeneous server clusters
JSSPP'07 Proceedings of the 13th international conference on Job scheduling strategies for parallel processing
A new Q-learning with generalized approximation spaces
ICNC'09 Proceedings of the 5th international conference on Natural computation
A state-cluster based Q-learning
ICNC'09 Proceedings of the 5th international conference on Natural computation
Reinforcement learning approaches to coordination in cooperative multi-agent systems
Adaptive agents and multi-agent systems
ONDUX: on-demand unsupervised learning for information extraction
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
On decentralized self-adaptation: lessons from the trenches and challenges for the future
Proceedings of the 2010 ICSE Workshop on Software Engineering for Adaptive and Self-Managing Systems
Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Cooperative communications with relay selection for QoS provisioning in wireless sensor networks
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
A policy-based sensor selection system with goal oriented singular value decomposition technique
POLICY'09 Proceedings of the 10th IEEE international conference on Policies for distributed systems and networks
Improving optimistic exploration in model-free reinforcement learning
ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Probabilistic Policy Reuse for inter-task transfer learning
Robotics and Autonomous Systems
Joint path and wavelength selection using Q-learning in optical burst switching networks
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Monotonicity of constrained optimal transmission policies in correlated fading channels with ARQ
IEEE Transactions on Signal Processing
A general framework to detect unsafe system states from multisensor data stream
IEEE Transactions on Intelligent Transportation Systems
Optimizing debt collections using constrained reinforcement learning
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Using spatial hints to improve policy reuse in a reinforcement learning agent
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Action selection and task sequence learning for hybrid dynamical cognitive agents
Robotics and Autonomous Systems
Learning hybridization strategies in evolutionary algorithms
Intelligent Data Analysis
MRL-CC: a novel cooperative communication protocol for QoS provisioning in wireless sensor networks
International Journal of Sensor Networks
Intelligent negotiation behaviour model for an open railway access market
Expert Systems with Applications: An International Journal
Autonomous decision making in layered and reconfigurable video coders
Asilomar'09 Proceedings of the 43rd Asilomar conference on Signals, systems and computers
The Dynamics of Multi-Agent Reinforcement Learning
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
On the scalability and dynamic load-balancing of optimistic gate level simulation
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Reinforcement learning with time
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
A robust and fast action selection mechanism for planning
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Evolution and sustainability of a wildlife monitoring sensor network
Proceedings of the 8th ACM Conference on Embedded Networked Sensor Systems
The neuronal replicator hypothesis
Neural Computation
A study of Q-learning considering negative rewards
Artificial Life and Robotics
An autonomic testing framework for IPv6 configuration protocols
AIMS'10 Proceedings of the Mechanisms for autonomous management of networks and services, and 4th international conference on Autonomous infrastructure, management and security
Time-based reward shaping in real-time strategy games
ADMI'10 Proceedings of the 6th international conference on Agents and data mining interaction
AdQL - anomaly detection Q-learning in control multi-queue systems with QoS constraints
KES-AMSTA'10 Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II
Why and how hippocampal transition cells can be used in reinforcement learning
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Reinforcement learning scheme for grouping and characterization of multi-agent network
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part III
Extending context spaces theory by proactive adaptation
ruSMART/NEW2AN'10 Proceedings of the Third conference on Smart Spaces and next generation wired, and 10th international conference on Wireless networking
Generating adaptive route instructions using hierarchical reinforcement learning
SC'10 Proceedings of the 7th international conference on Spatial cognition
Evolving a single scalable controller for an octopus arm with a variable number of segments
PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part II
Part-based feature synthesis for human detection
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Evolutionary dynamics of regret minimization
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Emotion and reinforcement: affective facial expressions facilitate robot learning
ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
Self-learning fuzzy logic controllers for pursuit-evasion differential games
Robotics and Autonomous Systems
Generalized learning automata for multi-agent reinforcement learning
AI Communications - European Workshop on Multi-Agent Systems (EUMAS) 2009
Solving multi-stage games with hierarchical learning automata that bootstrap
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Finding optimum route of electrical energy transmission line using multi-criteria with Q-learning
Expert Systems with Applications: An International Journal
Autonomous discovery of subgoals using acyclic state trajectories
ICICA'10 Proceedings of the First international conference on Information computing and applications
EURASIP Journal on Wireless Communications and Networking
Adaptation-based programming in java
Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
Structural knowledge transfer by spatial abstraction for reinforcement learning agents
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Supplier behavior modeling and winner determination using parallel MDP
Expert Systems with Applications: An International Journal
The inverse classification problem
Journal of Computer Science and Technology
Accelerating point-based POMDP algorithms via greedy strategies
SIMPAR'10 Proceedings of the Second international conference on Simulation, modeling, and programming for autonomous robots
Reduct based Q-learning: an introduction
Proceedings of the 2011 International Conference on Communication, Computing & Security
Extended spatial and temporal learning scale in reinforcement learning
CIMMACS '10 Proceedings of the 9th WSEAS international conference on computational intelligence, man-machine systems and cybernetics
A hybrid agent architecture integrating desire, intention and reinforcement learning
Expert Systems with Applications: An International Journal
Multiobjective reinforcement learning for traffic signal control using vehicular ad hoc network
EURASIP Journal on Advances in Signal Processing - Special title on vehicular ad hoc networks
Approximate dynamic programming for an inventory problem: Empirical comparison
Computers and Industrial Engineering
Wireless Personal Communications: An International Journal
Spatially-aware dialogue control using hierarchical reinforcement learning
ACM Transactions on Speech and Language Processing (TSLP)
Path selection in disaster response management based on Q-learning
International Journal of Automation and Computing
A Modified Memory-Based Reinforcement Learning Method for Solving POMDP Problems
Neural Processing Letters
User Modeling and User-Adapted Interaction
Noisy reinforcements in reinforcement learning: some case studies based on gridworlds
ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
TELE-INFO'06 Proceedings of the 5th WSEAS international conference on Telecommunications and informatics
Supporting smart interactions with predictive analytics
The smart internet
ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Knowledge of opposite actions for reinforcement learning
Applied Soft Computing
Dual memory model for using pre-existing knowledge in reinforcement learning tasks
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Stochastic processes for return maximization in reinforcement learning
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Completely self-referential optimal reinforcement learners
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Supporting smart interactions with predictive analytics
The smart internet
Software agent with reinforcement learning approach for medical image segmentation
Journal of Computer Science and Technology
Data Collection in Wireless Sensor Networks with Mobile Elements: A Survey
ACM Transactions on Sensor Networks (TOSN)
A dynamic route change mechanism for mobile ad hoc networks
International Journal of Communication Networks and Distributed Systems
Optimization of heuristic search using recursive algorithm selection and reinforcement learning
Annals of Mathematics and Artificial Intelligence
Reinforcement learning techniques for the control of wastewater treatment plants
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation: new challenges on bioinspired applications - Volume Part II
Journal of Computational Methods in Sciences and Engineering - Intelligent Systems and Knowledge Management (Part II)
Multiagent learning in large anonymous games
Journal of Artificial Intelligence Research
Exploiting Best-Match Equations for Efficient Reinforcement Learning
The Journal of Machine Learning Research
Collaborative learning in uncertain environments
CARE@AI'09/CARE@IAT'10 Proceedings of the CARE@AI 2009 and CARE@IAT 2010 international conference on Collaborative agents - research and development
User to user QoE routing system
WWIC'11 Proceedings of the 9th IFIP TC 6 international conference on Wired/wireless internet communications
ECSQARU'11 Proceedings of the 11th European conference on Symbolic and quantitative approaches to reasoning with uncertainty
Transactions on computational science XII
MobiCom '11 Proceedings of the 17th annual international conference on Mobile computing and networking
Heliza: talking dirty to the attackers
Journal in Computer Virology
SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
On the power of global reward signals in reinforcement learning
MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
A composite self-organisation mechanism in an agent network
WISE'11 Proceedings of the 12th international conference on Web information system engineering
Expert Systems with Applications: An International Journal
Model based Bayesian exploration
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning finite-state controllers for partially observable environments
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning to cooperate via policy search
UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
The apriori stochastic dependency detection (ASDD) algorithm for learning stochastic logic rules
CLIMA IV'04 Proceedings of the 4th international conference on Computational Logic in Multi-Agent Systems
A multi-agent fuzzy-reinforcement learning method for continuous domains
CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
An adaptive approach for the exploration-exploitation dilemma for learning agents
CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
General discounting versus average reward
ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Teachable characters: user studies, design principles, and learning performance
IVA'06 Proceedings of the 6th international conference on Intelligent Virtual Agents
Learning in one-shot strategic form games
ECML'06 Proceedings of the 17th European conference on Machine Learning
A sparse kernel-based least-squares temporal difference algorithm for reinforcement learning
ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part I
Unique state and automatical action abstracting based on logical MDPs with negation
ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part II
Testing probabilistic equivalence through reinforcement learning
FSTTCS'06 Proceedings of the 26th international conference on Foundations of Software Technology and Theoretical Computer Science
Network-Adaptive qos routing using local information
APNOMS'06 Proceedings of the 9th Asia-Pacific international conference on Network Operations and Management: management of Convergence Networks and Services
Cognitive agents for sense and respond logistics
DAMAS'05 Proceedings of the 2005 international conference on Defence Applications of Multi-Agent Systems
Opponent learning for multi-agent system simulation
RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Efficient gradient estimation for motor control learning
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Policy-contingent abstraction for robust robot control
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
A machine learning approach to intraday trading on foreign exchange markets
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
ARKAQ-learning: autonomous state space segmentation and policy generation
ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
Institutionalization through reciprocal habitualization and typification
WRAC'05 Proceedings of the Second international conference on Radical Agent Concepts: innovative Concepts for Autonomic and Agent-Based Systems
Fast reinforcement learning of dialogue policies using stable function approximation
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
URL: A unified reinforcement learning approach for autonomic cloud management
Journal of Parallel and Distributed Computing
Analysis and improvement of policy gradient estimation
Neural Networks
A hybrid learning strategy for discovery of policies of action
IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
Grey reinforcement learning for incomplete information processing
TAMC'06 Proceedings of the Third international conference on Theory and Applications of Models of Computation
Q-Learning with FCMAC in multi-agent cooperation
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Multiagent reinforcement learning for a planetary exploration multirobot system
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Multiagent model for grid computing
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Feature extraction for decision-theoretic planning in partially observable environments
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Teamwork and simulation in hybrid cognitive architecture
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Mapping web usage patterns to MDP model and mining with reinforcement learning
FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
Machine learning of plan robustness knowledge about instances
ECML'05 Proceedings of the 16th European conference on Machine Learning
Adaptive modeling: an approach and a method for implementing adaptive agents
MMAS'04 Proceedings of the First international conference on Massively Multi-Agent Systems
Agent based decision support system using reinforcement learning under emergency circumstances
ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part I
Multiagent association rules mining in cooperative learning systems
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Evaluating the effectiveness of exploration and accumulated experience in automatic case elicitation
ICCBR'05 Proceedings of the 6th international conference on Case-Based Reasoning Research and Development
A reinforcement learning approach for host-based intrusion detection using sequences of system calls
ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Multi-Agent cooperative reinforcement learning in 3d virtual world
ICSI'10 Proceedings of the First international conference on Advances in Swarm Intelligence - Volume Part I
Reinforcement learning by chaotic exploration generator in target capturing task
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Heuristic rule induction for decision making in near-deterministic domains
SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Towards intelligent management of a student's time
SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Learning multi-modal control programs
HSCC'05 Proceedings of the 8th international conference on Hybrid Systems: computation and control
Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning
Building Intelligent Interactive Tutors: Student-centered strategies for revolutionizing e-learning
Market-based recommender systems: learning users' interests by quality classification
AOIS'04 Proceedings of the 6th international conference on Agent-Oriented Information Systems II
Modeling the brain's operating system
BVAI'05 Proceedings of the First international conference on Brain, Vision, and Artificial Intelligence
Learning-Based spectrum selection in cognitive radio ad hoc networks
WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Emergence of flocking behavior based on reinforcement learning
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Emergent consensus in decentralised systems using collaborative reinforcement learning
Self-star Properties in Complex Information Systems
Rough sets and vague concept approximation: from sample approximation to adaptive learning
Transactions on Rough Sets V
Intelligent Social Media Indexing and Sharing Using an Adaptive Indexing Search Engine
ACM Transactions on Intelligent Systems and Technology (TIST)
Trace equivalence characterization through reinforcement learning
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Coordinating learning agents for multiple resource job scheduling
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
A user trust-based collaborative filtering recommendation algorithm
ICICS'09 Proceedings of the 11th international conference on Information and Communications Security
Effectiveness of considering state similarity for reinforcement learning
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Efficient deep web crawling using reinforcement learning
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
HYPERION: a recursive hyper-heuristic framework
LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
An overview of cooperative and competitive multiagent learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Multi-agent relational reinforcement learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Expert Systems with Applications: An International Journal
ICIRA'11 Proceedings of the 4th international conference on Intelligent Robotics and Applications - Volume Part I
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Multi-agent reinforcement learning for simulating pedestrian navigation
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Heterogeneous populations of learning agents in the minority game
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
On modeling the affective effect on learning
MIWAI'11 Proceedings of the 5th international conference on Multi-Disciplinary Trends in Artificial Intelligence
Computational Intelligence
WILDSENSING: Design and deployment of a sustainable sensor network for wildlife monitoring
ACM Transactions on Sensor Networks (TOSN)
Conflict resolution and learning probability matching in a neural cell-assembly architecture
Cognitive Systems Research
Psychological models of human and optimal performance in bandit problems
Cognitive Systems Research
Value-function reinforcement learning in Markov games
Cognitive Systems Research
When do differences matter? On-line feature extraction through cognitive economy
Cognitive Systems Research
Self-organization in an agent network: A mechanism and a potential application
Decision Support Systems
Hierarchical task decomposition through symbiosis in reinforcement learning
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Sample aware embedded feature selection for reinforcement learning
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Value function approximation through sparse bayesian modeling
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Dynamic channel selection with reinforcement learning for cognitive WLAN over fiber
International Journal of Communication Systems
A novel feature sparsification method for kernel-based approximate policy iteration
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A rapid sparsification method for kernel machines in approximate policy iteration
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A Recursive Classifier System for Partially Observable Environments
Fundamenta Informaticae
A New Architecture for Learning Classifier Systems to Solve POMDP Problems
Fundamenta Informaticae
An online kernel-based clustering approach for value function approximation
SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
Reinforcement Learning with Approximation Spaces
Fundamenta Informaticae
Automatica (Journal of IFAC)
Interactive Character Animation Using Simulated Physics: A State-of-the-Art Review
Computer Graphics Forum
An improved choice function heuristic selection for cross domain heuristic search
PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part II
Safe robot learning by energy limitation
ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part III
Learning policies for battery usage optimization in electric vehicles
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Planning interactive task for intelligent characters
Computer Animation and Virtual Worlds
Multi-agent task division learning in hide-and-seek games
AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Learning motion controllers with adaptive depth perception
EUROSCA'12 Proceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation
Learning motion controllers with adaptive depth perception
Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Building ubiquitous computing applications using the VERSAG adaptive agent framework
Journal of Systems and Software
Continuous strategy replicator dynamics for multi-agent Q-learning
Autonomous Agents and Multi-Agent Systems
Cooperative behavior acquisition in multi-agent reinforcement learning system using attention degree
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Observer effect from stateful resources in agent sensing
Autonomous Agents and Multi-Agent Systems
Adaptive value function approximation for continuous-state stochastic dynamic programming
Computers and Operations Research
Expert Systems with Applications: An International Journal
Scheduling fighter aircraft maintenance with reinforcement learning
Proceedings of the Winter Simulation Conference
Learning classifier system with average reward reinforcement learning
Knowledge-Based Systems
International Journal of Applied Metaheuristic Computing
Enhancing the Adaptation of BDI Agents Using Learning Techniques
International Journal of Agent Technologies and Systems
A Reinforcement Learning Approach to Setting Multi-Objective Goals for Energy Demand Management
International Journal of Agent Technologies and Systems
Simulating Cooperative Behaviors in Dynamic Networks
International Journal of Agent Technologies and Systems
Safe exploration of state and action spaces in reinforcement learning
Journal of Artificial Intelligence Research
Convergence results for ant routing algorithms via stochastic approximation
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A state-dependent time evolving multi-constraint routing algorithm
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Non-reciprocating Sharing Methods in Cooperative Q-Learning Environments
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Abstraction in Model Based Partially Observable Reinforcement Learning Using Extended Sequence Trees
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Expert Systems with Applications: An International Journal
GESwarm: grammatical evolution for the automatic synthesis of collective behaviors in swarm robotics
Proceedings of the 15th annual conference on Genetic and evolutionary computation
Neuroevolution results in emergence of short-term memory in multi-goal environment
Proceedings of the 15th annual conference on Genetic and evolutionary computation
Adding memory condition to learning classifier systems to solve partially observable environments
International Journal of Computer Applications in Technology
Information Systems
Distributed dynamic data driven prediction based on reinforcement learning approach
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Generation of tests for programming challenge tasks using multi-objective optimization
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Testing probabilistic equivalence through Reinforcement Learning
Information and Computation
Utilizing query change for session search
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
INFORM: a dynamic interest forwarding mechanism for information centric networking
Proceedings of the 3rd ACM SIGCOMM workshop on Information-centric networking
Machine learning for interactive systems and robots: a brief introduction
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Novel method for using Q-learning in small microcontrollers
Proceedings of the 51st ACM Southeast Conference
Real-time structured light coding for adaptive patterns
Journal of Real-Time Image Processing
Exploration in relational domains for model-based reinforcement learning
The Journal of Machine Learning Research
Towards minimizing the annotation cost of certified text classification
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Exploratory and interactive daily deals recommendation
Proceedings of the 7th ACM conference on Recommender systems
Assessing the appropriateness of using markov decision processes for RF spectrum management
Proceedings of the 16th ACM international conference on Modeling, analysis & simulation of wireless and mobile systems
SLEDGE: Sequential Labeling of Image Edges for Boundary Detection
International Journal of Computer Vision
A novel reinforcement learning architecture for continuous state and action spaces
Advances in Artificial Intelligence
Efficient batch processing of proximity queries by optimized probing
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Backward Q-learning: The combination of Sarsa algorithm and Q-learning
Engineering Applications of Artificial Intelligence
Reinforcement learning in robotics: A survey
International Journal of Robotics Research
Analysis of cross-price effects on markdown policies by using function approximation techniques
Knowledge-Based Systems
Efficient learning in linearly solvable MDP models
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Reinforcement learning models for scheduling in wireless networks
Frontiers of Computer Science: Selected Publications from Chinese Universities
The use of partially converged simulations in building surrogate models
Advances in Engineering Software
Reinforcement learning based routing in wireless mesh networks
Wireless Networks
A comparative evaluation of multi-objective exploration algorithms for high-level design
ACM Transactions on Design Automation of Electronic Systems (TODAES)
Review: A survey on intelligent routing protocols in wireless sensor networks
Journal of Network and Computer Applications
Construction of approximation spaces for reinforcement learning
The Journal of Machine Learning Research
Hybrid motion graph for character motion synthesis
Journal of Visual Languages and Computing
Review: Cloud computing service composition: A systematic literature review
Expert Systems with Applications: An International Journal
Autonomous Robots
Active Rare Class Discovery and Classification Using Dirichlet Processes
International Journal of Computer Vision
Wireless Personal Communications: An International Journal
MineralMiner: An active sensing simulation environment
Multiagent and Grid Systems
A game theoretic approach to swarm robotics
Applied Bionics and Biomechanics
A multi-agent control architecture for a robotic wheelchair
Applied Bionics and Biomechanics
Hi-index | 0.02 |
This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word "reinforcement." The paper discusses central issues of reinforcement learning, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state. It concludes with a survey of some implemented systems and an assessment of the practical utility of current methods for reinforcement learning.