Proceedings of the third annual conference on Autonomous Agents
Elevator Group Control Using Multiple Reinforcement Learning Agents
Machine Learning
Reinforcement learning and mistake bounded algorithms
COLT '99 Proceedings of the twelfth annual conference on Computational learning theory
Convergence analysis of temporal-difference learning algorithms with linear function approximation
COLT '99 Proceedings of the twelfth annual conference on Computational learning theory
Efficient exploration for optimizing immediate reward
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Learning state features from policies to bias exploration in reinforcement learning
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
A cerebellar model of timing and prediction in the control of reaching
Neural Computation
Toward a Model of Intelligence as an Economy of Agents
Machine Learning
A reinforcement learning agent for personalized information filtering
Proceedings of the 5th international conference on Intelligent user interfaces
Eddies: continuously adaptive query processing
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Congestion-dependent pricing of network services
IEEE/ACM Transactions on Networking (TON)
Automated strategy searches in an electronic goods market: learning and complex price schedules
Proceedings of the 1st ACM conference on Electronic commerce
Learning user's preferences by analyzing Web-browsing behaviors
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Adaptivity in agent-based routing for data networks
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Ant algorithms for discrete optimization
Artificial Life
Learning to Play Chess Using Temporal Differences
Machine Learning
Relevance and reinforcement in interactive browsing
Proceedings of the ninth international conference on Information and knowledge management
On verifying game designs and playing strategies using reinforcement learning
Proceedings of the 2001 ACM symposium on Applied computing
On-line analysis of the TCP acknowledgment delay problem
Journal of the ACM (JACM)
Hierarchical multi-agent reinforcement learning
Proceedings of the fifth international conference on Autonomous agents
An architecture for action selection in robotic soccer
Proceedings of the fifth international conference on Autonomous agents
A social reinforcement learning agent
Proceedings of the fifth international conference on Autonomous agents
A reinforcement learning model of selective visual attention
Proceedings of the fifth international conference on Autonomous agents
Pricing information bundles in a dynamic environment
Proceedings of the 3rd ACM conference on Electronic Commerce
ACM Computing Surveys (CSUR)
Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network
Neural Processing Letters
Programming backgammon using self-teaching neural nets
Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Adaptive dynamic scene analysis
Imaging and vision systems
Multiagent learning using a variable learning rate
Artificial Intelligence
Learning classifier systems: a complete introduction, review, and roadmap
Journal of Artificial Evolution and Applications
Robustness of reputation-based trust: boolean case
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Learning sequences of actions in collectives of autonomous agents
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Strategic sequential bidding in auctions using dynamic programming
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2
Designing agent collectives for systems with markovian dynamics
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to weigh basic behaviors in scalable agents
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Integrated learning for interactive synthetic characters
Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Agents' advanced features for negotiation and coordination
Mutli-agents systems and applications
Relational reinforcement learning
Mutli-agents systems and applications
Adaptive mirroring of system of systems architectures
WOSS '02 Proceedings of the first workshop on Self-healing systems
Learning Sequences of Compatible Actions Among Agents
Artificial Intelligence Review
The Brain-Like Sensorimotor Control System
Journal of Intelligent and Robotic Systems
Efficient and inefficient ant coverage methods
Annals of Mathematics and Artificial Intelligence
Ant colony optimization and stochastic gradient descent
Artificial Life
Planning and Control in Artificial Intelligence: A Unifying Perspective
Applied Intelligence
Rapid Concept Learning for Mobile Robots
Autonomous Robots
Dynamics of a Classical Conditioning Model
Autonomous Robots
Target Reaching by Using Visual Information and Q-learning Controllers
Autonomous Robots
Certain Principles of Biomorphic Robots
Autonomous Robots
Making Organizational Learning Operational: Implications from Learning Classifier Systems
Computational & Mathematical Organization Theory
Reinforced Genetic Programming
Genetic Programming and Evolvable Machines
Rollout Algorithms for Combinatorial Optimization
Journal of Heuristics
Finite-time Analysis of the Multiarmed Bandit Problem
Machine Learning
Machine Learning
Near-Optimal Reinforcement Learning in Polynomial Time
Machine Learning
Technical Update: Least-Squares Temporal Difference Learning
Machine Learning
Machine Learning
Structure in the Space of Value Functions
Machine Learning
Classifiers that approximate functions
Natural Computing: an international journal
Robot learning driven by emotions
Adaptive Behavior
A perspective view and survey of meta-learning
Artificial Intelligence Review
Learning intelligent behavior in a non-stationary and partially observable environment
Artificial Intelligence Review
Reinforcement Learning Rules in a Repeated Game
Computational Economics
Metalearning and neuromodulation
Neural Networks - Computational models of neuromodulation
TD Models of reward predictive responses in dopamine neurons
Neural Networks - Computational models of neuromodulation
Dopamine: generalization and bonuses
Neural Networks - Computational models of neuromodulation
Opponent interactions between serotonin and dopamine
Neural Networks - Computational models of neuromodulation
Control of exploitation-exploration meta-parameter in reinforcement learning
Neural Networks - Computational models of neuromodulation
Neuromodulation, theta rhythm and rat spatial navigation
Neural Networks - Computational models of neuromodulation
The anticipatory classifier system and genetic generalization
Natural Computing: an international journal
Neural computing increases robot adaptivity
Natural Computing: an international journal
Relative Loss Bounds for Temporal-Difference Learning
Machine Learning
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning
Discrete Event Dynamic Systems
Recent Advances in Hierarchical Reinforcement Learning
Discrete Event Dynamic Systems
Robots With Humanoid Features in Public Places: A Case Study
IEEE Intelligent Systems
Tracing Patterns and Attention: Humanoid Robot Cognition
IEEE Intelligent Systems
Jijo-2: An Office Robot that Communicates and Learns
IEEE Intelligent Systems
IEEE Intelligent Systems
Optimal control using the transport equation: the Liouville machine
Adaptive Behavior
Learning cost-sensitive active classifiers
Artificial Intelligence
Learning of plan execution policies for indoor navigation
AI Communications - Special issue on KI-2001
Multiple model-based reinforcement learning
Neural Computation
Designing guide-path networks for automated guided vehicle system by using the Q-learning technique
Computers and Industrial Engineering
Evolutionary Computation
A personalized and integrative comparison-shopping engine and its applications
Decision Support Systems - Special issue: Agents and e-commerce business models
Optimizing hypervideo navigation using a Markov decision process approach
Proceedings of the tenth ACM international conference on Multimedia
Emergent neural computational architectures based on neuroscience
Machines that learn to play games
Formalizing the Ant Algorithms in Terms of Reinforcement Learning
ECAL '99 Proceedings of the 5th European Conference on Advances in Artificial Life
An Information-Theoretic Approach for the Quantification of Relevance
ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Learning While Exploring: Bridging the Gaps in the Eligibility Traces
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Social Agents Playing a Periodical Policy
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
On-Line Support Vector Machine Regression
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Propagation of Q-values in Tabular TD(lambda)
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Characterizing Markov Decision Processes
ECML '02 Proceedings of the 13th European Conference on Machine Learning
A Multi-agent System for Electronic Commerce including Adaptive Strategic Behaviours
EPIA '99 Proceedings of the 9th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Learning a Navigation Task in Changing Environments by Multi-task Reinforcement Learning
EWLR-8 Proceedings of the 8th European Workshop on Learning Robots: Advances in Robot Learning
Reinforcement Learning in Situated Agents: Theoretical and Practical Solutions
EWLR-8 Proceedings of the 8th European Workshop on Learning Robots: Advances in Robot Learning
Selection of Behavior in Social Situations
Proceedings of the EvoWorkshops on Applications of Evolutionary Computing
From the Sea to the Sidewalk: The Evolution of Hexapod Walking Gaits by a Genetic Algorithm
ICES '00 Proceedings of the Third International Conference on Evolvable Systems: From Biology to Hardware
Solving Partially Observable Problems by Evolution and Learning of Finite State Machines
ICES '01 Proceedings of the 4th International Conference on Evolvable Systems: From Biology to Hardware
Enhancing Multi-Agent Based Simulation with Human-Like Decision Making Strategies
MABS '00 Proceedings of the Second International Workshop on Multi-Agent-Based Simulation-Revised and Additional Papers
A Framework for Supporting Intelligent Fault and Performance Management for Communication Networks
MMNS '01 Proceedings of the 4th IFIP/IEEE International Conference on Management of Multimedia Networks and Services: Management of Multimedia on the Internet
An Overview of MAXQ Hierarchical Reinforcement Learning
SARA '02 Proceedings of the 4th International Symposium on Abstraction, Reformulation, and Approximation
Learning Options in Reinforcement Learning
Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Model Minimization in Hierarchical Reinforcement Learning
Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Language as a Complex Adaptive System
PPSN VI Proceedings of the 6th International Conference on Parallel Problem Solving from Nature
An Integrated On-Line Learning System for Evolving Programmable Logic Array Controllers
PPSN VI Proceedings of the 6th International Conference on Parallel Problem Solving from Nature
On Using Constructivism in Neural Classifier Systems
PPSN VII Proceedings of the 7th International Conference on Parallel Problem Solving from Nature
TCS Learning Classifier System Controller on a Real Robot
PPSN VII Proceedings of the 7th International Conference on Parallel Problem Solving from Nature
Using and Evaluating Adaptive Agents for Electronic Commerce Negotiation
IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Application of Reinforcement Learning to Electrical Power System Closed-Loop Emergency Control
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Self-Similar Layered Hidden Markov Models
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Reinforcement Learning: Past, Present and Future
SEAL'98 Selected papers from the Second Asia-Pacific Conference on Simulated Evolution and Learning on Simulated Evolution and Learning
Least-Squares Methods in Reinforcement Learning for Control
SETN '02 Proceedings of the Second Hellenic Conference on AI: Methods and Applications of Artificial Intelligence
Modelling Intelligent Behaviour: The Markov Decision Process Approach
IBERAMIA '98 Proceedings of the 6th Ibero-American Conference on AI: Progress in Artificial Intelligence
An Analysis of the Pheromone Q-Learning Algorithm
IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
Learning to Reach the Pareto Optimal Nash Equilibrium as a Team
AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Q-Learning in Continuous State and Action Spaces
AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
Neurofuzzy Learning of Mobile Robot Behaviours
AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
A Classification Scheme for Negotiation in Electronic Commerce
Agent Mediated Electronic Commerce, The European AgentLink Perspective.
Agent Mediated Electronic Commerce, The European AgentLink Perspective.
AI '01 Proceedings of the 14th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Relational Reinforcement Learning
EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Agents' Advanced Features for Negotiation and Coordination
EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
SVD Reduction in Continuos Environment Reinforcement Learning
Proceedings of the International Conference, 7th Fuzzy Days on Computational Intelligence, Theory and Applications
Reinforcement Learning for Control of Traffic and Access Points in Intelligent Wireless ATM Networks
Proceedings of the International Conference, 7th Fuzzy Days on Computational Intelligence, Theory and Applications
Reinforcement Learning for Biped Locomotion
ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
Intraday FX Trading: An Evolutionary Reinforcement Learning Approach
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Lempel-Ziv Coding in Reinforcement Learning
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Coordinating Learning Agents via Utility Assignment
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Variance-Penalized Reinforcement Learning for Risk-Averse Asset Allocation
IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
Learning to Predict Variable-Delay Rewards and Its Role in Autonomous Developmental Robotics
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
What Is a Learning Classifier System?
Learning Classifier Systems, From Foundations to Applications
Strength or Accuracy? Fitness Calculation in Learning Classifier Systems
Learning Classifier Systems, From Foundations to Applications
State of XCS Classifier System Research
Learning Classifier Systems, From Foundations to Applications
An Introduction to Learning Fuzzy Classifier Systems
Learning Classifier Systems, From Foundations to Applications
The Fighter Aircraft LCS: A Case of Different LCS Goals and Techniques
Learning Classifier Systems, From Foundations to Applications
A Roadmap to the Last Decade of Learning Classifier System Research
Learning Classifier Systems, From Foundations to Applications
An Artificial Economy of Post Production Systems
IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Learning Classifier Systems Meet Multiagent Environments
IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
An Algorithmic Description of XCS
IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Using Classifier Systems as Adaptive Expert Systems for Control
IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
YACS: Combining Dynamic Programming with Generalization in Classifier Systems
IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
Biasing Exploration in an Anticipatory Learning Classifier System
IWLCS '01 Revised Papers from the 4th International Workshop on Advances in Learning Classifier Systems
Two Views of Classifier Systems
IWLCS '01 Revised Papers from the 4th International Workshop on Advances in Learning Classifier Systems
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Connectionist Models of Neurons, Learning Processes and Artificial Intelligence-Part I
Information Integration for Robot Learning Using Neural Fuzzy Systems
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
Hybrid Framework for Neuro-Dynamic Programming Application to Water Supply Networks
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
Game Theory and Artificial Intelligence
Selected papers from the UKMAS Workshop on Foundations and Applications of Multi-Agent Systems
Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer
RoboCup 2001: Robot Soccer World Cup V
Evolutionary Behavior Selection with Activation/Termination Constraints
RoboCup 2001: Robot Soccer World Cup V
Multiple Reward Criterion for Cooperative Behavior Acquisition in a Muliagent Environment
RoboCup-99: Robot Soccer World Cup III
Learning to Behave by Environment Reinforcement
RoboCup-99: Robot Soccer World Cup III
Reinforcement Learning for 3 vs. 2 Keepaway
RoboCup 2000: Robot Soccer World Cup IV
Proceedings of the workshop on Deception, Fraud, and Trust in Agent Societies held during the Autonomous Agents Conference: Trust in Cyber-societies, Integrating the Human and Artificial Perspectives
Toward the Formal Foundation of Ant Programming
ANTS '02 Proceedings of the Third International Workshop on Ant Algorithms
An Improved Q-Learning Algorithm Using Synthetic Pheromones
CEEMAS '01 Revised Papers from the Second International Workshop of Central and Eastern Europe on Multi-Agent Systems: From Theory to Practice in Multi-Agent Systems
Reactive and Memory-Based Genetic Programming for Robot Control
Proceedings of the Second European Workshop on Genetic Programming
On the Relationship between Learning Capability and the Boltzmann-Formula
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Value Prediction in Engineering Applications
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Learning from Human Decision-Making Behaviors - An Application to RoboCup Software Agents
IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence
On the Asymptotic Behaviour of a Constant Stepsize Temporal-Difference Learning Algorithm
EuroCOLT '99 Proceedings of the 4th European Conference on Computational Learning Theory
Open Theoretical Questions in Reinforcement Learning
EuroCOLT '99 Proceedings of the 4th European Conference on Computational Learning Theory
Application of Episodic Q-Learning to a Multi-agent Cooperative Task
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Preliminary Results
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Learning in Character: Building Autonomous Animated Characters That Learn What They Ought to Learn
ICVS '01 Proceedings of the International Conference on Virtual Storytelling: Using Virtual Reality Technologies for Storytelling
Introduction to Sequence Learning
Sequence Learning - Paradigms, Algorithms, and Applications
Hidden-Mode Markov Decision Processes for Nonstationary Sequential Decision Making
Sequence Learning - Paradigms, Algorithms, and Applications
Similarity between Fuzzy Multi-objective Control and Eligibility
AFSS '02 Proceedings of the 2002 AFSS International Conference on Fuzzy Systems. Calcutta: Advances in Soft Computing
Emergent Neural Computational Architectures Based on Neuroscience - Towards Neuroscience-Inspired Computing
Decision-Theoretic Control of Planetary Rovers
Revised Papers from the International Seminar on Advances in Plan-Based Control of Robotic Agents,
Adaptive Representation Methods for Reinforcement Learning
AI '01 Proceedings of the 14th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
Learning as a Consequence of Selection
Selected Papers from the 5th European Conference on Artificial Evolution
Abstraction Methods for Game Theoretic Poker
CG '00 Revised Papers from the Second International Conference on Computers and Games
Learning Time Allocation Using Neural Networks
CG '00 Revised Papers from the Second International Conference on Computers and Games
Chess Neighborhoods, Function Combination, and Reinforcement Learning
CG '00 Revised Papers from the Second International Conference on Computers and Games
Logic, Knowledge Representation, and Bayesian Decision Theory
CL '00 Proceedings of the First International Conference on Computational Logic
MINERVA: A Tour-Guide Robot that Learns
KI '99 Proceedings of the 23rd Annual German Conference on Artificial Intelligence: Advances in Artificial Intelligence
Dynamic Pricing of Information Products Based on Reinforcement Learning: A Yield-Management Approach
KI '02 Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence
Using Document Structures for Personal Ontologies and User Modeling
UM '01 Proceedings of the 8th International Conference on User Modeling 2001
Faster Near-Optimal Reinforcement Learning: Adding Adaptiveness to the E3 Algorithm
ALT '99 Proceedings of the 10th International Conference on Algorithmic Learning Theory
Feedforward Neural Networks in Reinforcement Learning Applied to High-Dimensional Motor Control
ALT '02 Proceedings of the 13th International Conference on Algorithmic Learning Theory
DS '02 Proceedings of the 5th International Conference on Discovery Science
To Collect or Not to Collect? Machine Learning for Memory Management
Proceedings of the 2nd Java Virtual Machine Research and Technology Symposium
High-Level Student Modeling with Machine Learning
ITS '00 Proceedings of the 5th International Conference on Intelligent Tutoring Systems
A Comparison of Decision Making Criteria and Optimization Methods for Active Robotic Sensing
NMA '02 Revised Papers from the 5th International Conference on Numerical Methods and Applications
An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning
ATAL '00 Proceedings of the 7th International Workshop on Intelligent Agents VII. Agent Theories Architectures and Languages
Autonomous Spacecraft Resource Management: A Multi-agent Approach
AI*IA '99 Proceedings of the 6th Congress of the Italian Association for Artificial Intelligence on Advances in Artificial Intelligence
A Platform for Electronic Commerce with Adaptive Agents
Agent-Mediated Electronic Commerce III, Current Issues in Agent-Based Electronic Commerce Systems (includes revised papers from AMEC 2000 Workshop)
An Adaptive, Maintable, Extensible Process Agent
DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Optimizing Average Reward Using Discounted Rewards
COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Self-Optimizing and Pareto-Optimal Policies in General Environments Based on Bayes-Mixtures
COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
A Multi-agent Q-learning Framework for Optimizing Stock Trading Systems
DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Agents for Industry Process Management
DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Reinforcement Learning to Drive a Car by Pattern Matching
Proceedings of the 24th DAGM Symposium on Pattern Recognition
Some Effects of Individual Learning on the Evolution of Sensors
ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
Spatiotemporal Abstraction of Stochastic Sequential Processes
Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Anticipation-Based Control Architecture for a Mobile Robot
ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning
IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
Incorporating Perception-Based Information in Reinforcement Learning Using Computing with Words
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
LC-Learning: Phased Method for Average Reward Reinforcement Learning - Analysis of Optimal Criteria
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Using ILP to Improve Planning in Hierarchical Reinforcement Learning
ILP '00 Proceedings of the 10th International Conference on Inductive Logic Programming
Dynamic balance of a biped robot using fuzzy reinforcement learning agents
Fuzzy Sets and Systems - Special issue: Fuzzy set techniques for intelligent robotic systems
Learning fuzzy rules from iterative execution of games
Fuzzy Sets and Systems - Theme: Modeling and learning
A context-based architecture for general problem solving
ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Timed delivery of reward signals in an autonomous robot
ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Levels of dynamics and adaptive behavior in evolutionary neural controllers
ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Memetic-neural scheduler of jobs in identical parallel machines
Second international workshop on Intelligent systems design and application
Nonlinear credit assignment for musical sequences
Second international workshop on Intelligent systems design and application
Sequential cost-sensitive decision making with reinforcement learning
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Applications of the self-organising map to reinforcement learning
Neural Networks - New developments in self-organizing maps
Reinforcement learning for POMDPs based on action values and stochastic optimization
Eighteenth national conference on Artificial intelligence
The design of collectives of agents to control non-Markovian systems
Eighteenth national conference on Artificial intelligence
Handbook of data mining and knowledge discovery
Neural networks and the financial markets
Exploring artificial intelligence in the new millennium
A System for Building Intelligent Agents that Learn to Retrieve and Extract Information
User Modeling and User-Adapted Interaction
A Bi-Recursive Neural Network Architecture for the Prediction of Protein Coarse Contact Maps
CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Soccer strategies that live in the B2B world of negotiation and decision-making
Decision Support Systems
A non-computationally-intensive neurocontroller for autonomous mobile robot navigation
Biologically inspired robot behavior engineering
A bio-inspired robotic mechanism for autonomous locomotion in unconventional environments
Autonomous robotic systems
Integration of soft computing towards autonomous legged robots
Autonomous robotic systems
SOS++: finding smart behaviors using learning and evolution
ICAL 2003 Proceedings of the eighth international conference on Artificial life
Walverine: a Walrasian trading agent
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Adaptive policy gradient in multiagent learning
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
A selection-mutation model for q-learning in multi-agent systems
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Multi-agent learning in extensive games with complete information
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Introducing an agent of a certain persuasion
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
How to calm hyperactive agents
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
An introduction to reinforcement learning theory: value function methods
Advanced lectures on machine learning
Offline learning and the role of autogenous speech: new suggestions from birdsong research
Speech Communication - Special issue on the nature of speech perception (the psychophysics of speech perception III)
Recent Advances in Hierarchical Reinforcement Learning
Discrete Event Dynamic Systems
A reinforcement learning adaptive fuzzy controller for robots
Fuzzy Sets and Systems - Theme: Modeling and control
Reinforcement learning based on local state feature learning and policy adjustment
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Introduction to multimedia and mobile agents
Autonomous mental development in high dimensional context and action spaces
Neural Networks - 2003 Special issue: Advances in neural networks research IJCNN'03
On the convergence of optimistic policy iteration
The Journal of Machine Learning Research
ε-mdps: learning in varying environments
The Journal of Machine Learning Research
R-max - a general polynomial time algorithm for near-optimal reinforcement learning
The Journal of Machine Learning Research
Using confidence bounds for exploitation-exploration trade-offs
The Journal of Machine Learning Research
Learning behavior-selection by emotions and cognition in a multi-goal robot task
The Journal of Machine Learning Research
Adaptive Radial Basis Decomposition by Learning Vector Quantization
Neural Processing Letters
Mining Plans for Customer-Class Transformation
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
To buy or not to buy: mining airfare data to minimize ticket purchase price
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Adding numbers to text classification
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Learning systems and their engineering: a project proposal
Practicing software engineering in the 21st century
Computer Networks: The International Journal of Computer and Telecommunications Networking
Nash q-learning for general-sum stochastic games
The Journal of Machine Learning Research
Least-squares policy iteration
The Journal of Machine Learning Research
Interpretation by Implementation for Understanding a Multiagent Organization
Computational & Mathematical Organization Theory
Inter-module credit assignment in modular reinforcement learning
Neural Networks
Combining importance sampling and temporal difference control variates to simulate Markov Chains
ACM Transactions on Modeling and Computer Simulation (TOMACS)
The domestic robot—a friendly cognitive system takes care of your home
Ambient intelligence
Autonomous Learning Architecture for Environmental Mapping
Journal of Intelligent and Robotic Systems
Development and the Baldwin effect
Artificial Life
Learning obstacle avoidance with an operant behavior model
Artificial Life
A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL
Probability in the Engineering and Informational Sciences
Reinforcement learning with via-point representation
Neural Networks
An experimental evaluation of reinforcement learning for gain scheduling
Design and application of hybrid intelligent systems
Rated MCRDR: finding non-linear relationships between classifications in MCRDR
Design and application of hybrid intelligent systems
Employing OLAP mining for multiagent reinforcement learning
Design and application of hybrid intelligent systems
Policy gradient methods in multi-agent systems: pursuit problem
Design and application of hybrid intelligent systems
A Reinforcement Learning Framework for Parameter Control in Computer Vision Applications
CRV '04 Proceedings of the 1st Canadian Conference on Computer and Robot Vision
CRV '04 Proceedings of the 1st Canadian Conference on Computer and Robot Vision
The Journal of Machine Learning Research
A Geometric Approach to Multi-Criterion Reinforcement Learning
The Journal of Machine Learning Research
A generic architecture for adaptive agents based on reinforcement learning
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Bio-inspired systems (BIS)
Self-organized load balancing in proxy servers: algorithms and performance
Journal of Intelligent Information Systems - Special issue on web intelligence
Representing von Neumann–Morgenstern Games in the Situation Calculus
Annals of Mathematics and Artificial Intelligence
Transfer of Experience Between Reinforcement Learning Environments with Progressive Difficulty
Artificial Intelligence Review
Utile distinction hidden Markov models
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Using relative novelty to identify useful temporal abstractions in reinforcement learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
P3VI: a partitioned, prioritized, parallel value iterator
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning when and how to coordinate
Web Intelligence and Agent Systems
Reinforcement Learning with Factored States and Actions
The Journal of Machine Learning Research
Recommender Systems Research: A Connection-Centric Survey
Journal of Intelligent Information Systems
Cross channel optimized marketing by reinforcement learning
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Integrating Guidance into Relational Reinforcement Learning
Machine Learning
Best-Response Multiagent Learning in Non-Stationary Environments
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Unifying Temporal and Structural Credit Assignment Problems
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Hierarchical Reinforcement Learning in Communication-Mediated Multiagent Coordination
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Multi-Agent Patrolling with Reinforcement Learning
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Organization-Based Coalition Formation
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Improving the Learning Rate by Inducing a Transition Model
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Text Adaptation for Mobile Digital Teletext
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Learning to play games in extensive form by valuation
TARK '01 Proceedings of the 8th conference on Theoretical aspects of rationality and knowledge
Precomputing avatar behavior from human motion data
SCA '04 Proceedings of the 2004 ACM SIGGRAPH/Eurographics symposium on Computer animation
Proceedings of the 34th conference on Winter simulation: exploring new frontiers
Market-based recommendation: Agents that compete for consumer attention
ACM Transactions on Internet Technology (TOIT)
Affective Learning — A Manifesto
BT Technology Journal
Exploitation vs. exploration: choosing a supplier in an environment of incomplete information
Decision Support Systems
Learning diagnostic policies from examples by systematic search
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Asymmetric multiagent reinforcement learning
Web Intelligence and Agent Systems
Knowledge-Based Kernel Approximation
The Journal of Machine Learning Research
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
The Journal of Machine Learning Research
Coordinating Multiple Agents via Reinforcement Learning
Autonomous Agents and Multi-Agent Systems
Strong, Stable, and Reliable Fitness Pressure in XCS due to Tournament Selection
Genetic Programming and Evolvable Machines
ICEC '04 Proceedings of the 6th international conference on Electronic commerce
A theory of epineuronal memory
Neural Networks
Using Optimal Foraging Models to Evaluate Learned Robotic Foraging Behavior
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
An Architecture for Behavior-Based Reinforcement Learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Fast multi-level adaptation for interactive autonomous characters
ACM Transactions on Graphics (TOG)
Research challenges of autonomic computing
Proceedings of the 27th international conference on Software engineering
System for foreign exchange trading using genetic algorithms and reinforcement learning
International Journal of Systems Science
QoS Control Strategies for High-Quality Video Processing
Real-Time Systems
Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Walverine: a Walrasian trading agent
Decision Support Systems - Special issue: Decision theory and game theory in agent design
Agent learning in supplier selection models
Decision Support Systems - Special issue: Decision theory and game theory in agent design
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process
Proceedings of the 2005 ACM symposium on Applied computing
An adaptive pursuit strategy for allocating operator probabilities
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
XCS with computed prediction in multistep environments
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
An abstraction agorithm for genetics-based reinforcement learning
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
GAMM: genetic algorithms with meta-models for vision
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Self-managed decentralised systems using K-components and collaborative reinforcement learning
WOSS '04 Proceedings of the 1st ACM SIGSOFT workshop on Self-managed systems
Online model-based adaptation for optimizing performance and dependability
WOSS '04 Proceedings of the 1st ACM SIGSOFT workshop on Self-managed systems
Layering and heterogeneity as design principles for animated embedded agents
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Intelligent embedded agents
Computational intelligence for structured learning of a partner robot based on imitation
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Intelligent embedded agents
Using Agents and Simulation to Develop Adequate Thinking Styles
ICALT '05 Proceedings of the Fifth IEEE International Conference on Advanced Learning Technologies
Behavior transfer for value-function-based reinforcement learning
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Multi-agent reward analysis for learning in noisy domains
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Modeling task allocation using a decision theoretic model
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Automatic computer game balancing: a reinforcement learning approach
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Improving reinforcement learning function approximators via neuroevolution
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Novel runtime systems support for adaptive compositional modeling in PSEs
Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
E-commerce intelligent agent: personalization travel support agent using Q Learning
ICEC '05 Proceedings of the 7th international conference on Electronic commerce
Reinforcement learning for active model selection
UBDM '05 Proceedings of the 1st international workshop on Utility-based data mining
Contextual recommender problems [extended abstract]
UBDM '05 Proceedings of the 1st international workshop on Utility-based data mining
Cooperative Multi-Agent Learning: The State of the Art
Autonomous Agents and Multi-Agent Systems
Optimal Control Using the Transport Equation: The Liouville Machine
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Thesis: relational reinforcement learning
AI Communications
Teaching virtual characters how to use body language
Lecture Notes in Computer Science
Implementing Temporal-Difference Learning with the Scaled Conjugate Gradient Algorithm
Neural Processing Letters
Adaptive value function approximations in classifier systems
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Learning classifier system equivalent with reinforcement learning with function approximation
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Counter example for Q-bucket-brigade under prediction problem
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
An autonomous explore/exploit strategy
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Relating reinforcement learning performance to classification performance
ICML '05 Proceedings of the 22nd international conference on Machine learning
Proto-value functions: developmental reinforcement learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
High speed obstacle avoidance using monocular vision and reinforcement learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Coarticulation: an approach for generating concurrent plans in Markov decision processes
ICML '05 Proceedings of the 22nd international conference on Machine learning
A theoretical analysis of Model-Based Interval Estimation
ICML '05 Proceedings of the 22nd international conference on Machine learning
Bayesian sparse sampling for on-line reward optimization
ICML '05 Proceedings of the 22nd international conference on Machine learning
The Development of Embodied Cognition: Six Lessons from Babies
Artificial Life
A middleware for autonomic QoS management based on learning
SEM '05 Proceedings of the 5th international workshop on Software engineering and middleware
Local Reinforcement and Recombination in Classifier Systems
Evolutionary Computation
Rule Fitness and Pathology in Learning Classifier Systems
Evolutionary Computation
Emergence of Cooperation: State of the Art
Artificial Life
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games
Autonomous Agents and Multi-Agent Systems
Hybrid least-squares methods for reinforcement learning
IEA/AIE'2003 Proceedings of the 16th international conference on Developments in applied artificial intelligence
Artificial Intelligence in Medicine
An analytic modelling approach for network routing algorithms that use "ant-like" mobile agents
Computer Networks: The International Journal of Computer and Telecommunications Networking
Context awarable self-configuration system for distributed resource management
IEA/AIE'2005 Proceedings of the 18th international conference on Innovations in Applied Artificial Intelligence
NJFun: a reinforcement learning spoken dialogue system
ANLP/NAACL-ConvSyst '00 Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems - Volume 3
Adaptive dialogue systems - interaction with interact
SIGDIAL '02 Proceedings of the 3rd SIGdial workshop on Discourse and dialogue - Volume 2
Temporal Difference Model Reproduces Anticipatory Neural Activity
Neural Computation
Developing adaptive auction mechanisms
ACM SIGecom Exchanges
Minds and Machines - Machine learning as experimental philosophy of science
Evolution of Cooperative Problem Solving in an Artificial Economy
Neural Computation
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms
Neural Computation
Reinforcement Learning in Continuous Time and Space
Neural Computation
Finding optimal satisficing strategies for and-or trees
Artificial Intelligence
Pedagogical possibilities for the dice game pig
Journal of Computing Sciences in Colleges
Sequence-Learning Algorithm Based on Backward Chaining
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Combining metric and topological navigation of simulated robots
Acta Cybernetica
Efficient Discriminant Viewpoint Selection for Active Bayesian Recognition
International Journal of Computer Vision
Playing games in many possible worlds
EC '06 Proceedings of the 7th ACM conference on Electronic commerce
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle
IEEE Transactions on Dependable and Secure Computing
Adaptive game AI with dynamic scripting
Machine Learning
Universal parameter optimisation in games based on SPSA
Machine Learning
SHAGE: a framework for self-managed robot software
Proceedings of the 2006 international workshop on Self-adaptation and self-managing systems
Simulating sellers in online exchanges
Decision Support Systems
A short tutorial on reinforcement learning: review and applications
Intelligent information processing II
Precomputing avatar behavior from human motion data
Graphical Models - Special issue on SCA 2004
Agent-based buddy-finding methodology for knowledge sharing
Information and Management
Using inaccurate models in reinforcement learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Relational temporal difference learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Learning the structure of Factored Markov Decision Processes in reinforcement learning problems
ICML '06 Proceedings of the 23rd international conference on Machine learning
Qualitative reinforcement learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
PAC model-free reinforcement learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Combining gradient techniques for numerical multi-objective evolutionary optimization
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Standard and averaging reinforcement learning in XCS
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Classifier prediction based on tile coding
Proceedings of the 8th annual conference on Genetic and evolutionary computation
A Bayesian approach to learning classifier systems in uncertain environments
Proceedings of the 8th annual conference on Genetic and evolutionary computation
On-line evolutionary computation for reinforcement learning in stochastic domains
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Genetic algorithms for action set selection across domains: a demonstration
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Reward allotment in an event-driven hybrid learning classifier system for online soccer games
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Une description probabiliste de la communication parlée entre homme et machine
IHM 2004 Proceedings of the 16th conference on Association Francophone d'Interaction Homme-Machine
Adaptive mechanism design: a metalearning approach
ICEC '06 Proceedings of the 8th international conference on Electronic commerce: The new e-commerce: innovations for conquering current barriers, obstacles and limitations to conducting successful business on the internet
Division of labor in a group of robots inspired by ants' foraging behavior
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Design patterns from biology for distributed computing
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A Model of Prefrontal Cortical Mechanisms for Goal-directed Behavior
Journal of Cognitive Neuroscience
Learnable behavioural model for autonomous virtual agents: low-level learning
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
A hierarchical approach to efficient reinforcement learning in deterministic domains
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Efficient agents for cliff-edge environments with a large set of decision options
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning from induced changes in opponent (re)actions in multi-agent games
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Rule value reinforcement learning for cognitive agents
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning the task allocation game
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
On the relationship between MDPs and the BDI architecture
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Coordinating simple and unreliable agents
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Haloperidol Impairs Learning and Error-related Negativity in Humans
Journal of Cognitive Neuroscience
Representation and timing in theories of the dopamine system
Neural Computation
QoS dynamic routing for wireless sensor networks
Proceedings of the 2nd ACM international workshop on Quality of service & security for wireless and mobile networks
Evolving classifiers on field programmable gate arrays: migrating XCS to FPGAs
Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Nature-inspired applications and systems
Fuzzy reinforcement learning for embedded soccer agents in a multi-agent context
International Journal of Robotics and Automation
Fuzzy and tile coding function approximation in agent coevolution
AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Kernel rewards regression: an information efficient batch policy iteration approach
AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Turning lights out with DQ-learning
AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Economy-like reward distribution for division of labor
AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
The self-organizing relationship (SOR) network employing fuzzy inference based heuristic evaluation
Neural Networks - 2006 Special issue: Advances in self-organizing maps--WSOM'05
Integrate and conquer: the next generation of intelligent avatars
Proceedings of the 2005 ACM SIGCHI International Conference on Advances in computer entertainment technology
Game design through self-play experiments
Proceedings of the 2005 ACM SIGCHI International Conference on Advances in computer entertainment technology
OMax brothers: a dynamic yopology of agents for improvization learning
Proceedings of the 1st ACM workshop on Audio and music computing multimedia
Neural Processing Letters
Building autonomic systems using collaborative reinforcement learning
The Knowledge Engineering Review
TAUPE: towards understanding program comprehension
CASCON '06 Proceedings of the 2006 conference of the Center for Advanced Studies on Collaborative research
Modeling energy constrained routing in selfish ad hoc networks
GameNets '06 Proceeding from the 2006 workshop on Game theory for communications and networks
The Role of Problem Classification in Online Meta-cognition
IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Approximate Reasoning in MAS: Rough Set Approach
IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Properties and mechanisms of self-organizing MANET and P2P systems
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Neural mechanism for stochastic behaviour during a competitive game
Neural Networks - 2006 Special issue: Neurobiology of decision making
Neural Networks - 2006 Special issue: Neurobiology of decision making
Effects of reward expectancy on sequential eye movements in monkeys
Neural Networks - 2006 Special issue: Neurobiology of decision making
Multi-agent learning model with bargaining
Proceedings of the 38th conference on Winter simulation
Proceedings of the 38th conference on Winter simulation
Learning what to talk about in descriptive games
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Applied Soft Computing
Neural-based downlink scheduling algorithm for broadband wireless networks
Computer Communications
Fundamenta Informaticae - Contagious Creativity - In Honor of the 80th Birthday of Professor Solomon Marcus
Creating significant learning experiences in introductory artificial intelligence
Proceedings of the 38th SIGCSE technical symposium on Computer science education
Reinforcement Learning with Approximation Spaces
Fundamenta Informaticae
Binet-Cauchy Kernels on Dynamical Systems and its Application to the Analysis of Dynamic Scenes
International Journal of Computer Vision
Behavioral Pattern Identification Through Rough Set Modeling
Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Calculi of Approximation Spaces
Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Rough Set Approach to Behavioral Pattern Identification
Fundamenta Informaticae - New Frontiers in Scientific Discovery - Commemorating the Life and Work of Zdzislaw Pawlak
Dimensions of complexity of intelligent agents
PCAR '06 Proceedings of the 2006 international symposium on Practical cognitive agents and robots
A hybrid system of abductive tactical decision making
International Journal of Hybrid Intelligent Systems
Performance analysis of the AntNet algorithm
Computer Networks: The International Journal of Computer and Telecommunications Networking
Using multi-agent systems for learning optimal policies for complex problems
ACM-SE 45 Proceedings of the 45th annual southeast regional conference
Proceedings of the 2006 international conference on Game research and development
Robust automatic target recognition using learning classifier systems
Information Fusion
Aggregation of web search engines based on users' preferences in WebFusion
Knowledge-Based Systems
A document retrieval support system with term relationship
Web Intelligence and Agent Systems
Gradient descent for symmetric and asymmetric multiagent reinforcement learning
Web Intelligence and Agent Systems
Scientific Programming - Distributed Computing and Applications
On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies
Mathematics of Operations Research
Synergies Between Intrinsic and Synaptic Plasticity Mechanisms
Neural Computation
Reinforcement Learning State Estimator
Neural Computation
If multi-agent learning is the answer, what is the question?
Artificial Intelligence
Policy Gradient in Continuous Time
The Journal of Machine Learning Research
Evolutionary Function Approximation for Reinforcement Learning
The Journal of Machine Learning Research
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
The Journal of Machine Learning Research
STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents
dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
Causal Graph Based Decomposition of Factored MDPs
The Journal of Machine Learning Research
Point-Based Value Iteration for Continuous POMDPs
The Journal of Machine Learning Research
Approximate Reasoning in MAS: Rough Set Approach
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Decentralized, adaptive resource allocation for sensor networks
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Motivated reinforcement learning for adaptive characters in open-ended simulation games
Proceedings of the international conference on Advances in computer entertainment technology
IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploring selfish reinforcement learning in repeated games with stochastic rewards
Autonomous Agents and Multi-Agent Systems
Learning to communicate in a decentralized environment
Autonomous Agents and Multi-Agent Systems
Local strategy learning in networked multi-agent team formation
Autonomous Agents and Multi-Agent Systems
Knowledge acquisition for adaptive game AI
Science of Computer Programming
Modeling embodied visual behaviors
ACM Transactions on Applied Perception (TAP)
On developmental mental architectures
Neurocomputing
Chaotic time series prediction for the game, Rock-Paper-Scissors
Applied Soft Computing
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
Neural Computation
Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension
Evolutionary Computation
Research Issues in Multiple Policy Optimization Using Collaborative Reinforcement Learning
SEAMS '07 Proceedings of the 2007 International Workshop on Software Engineering for Adaptive and Self-Managing Systems
Combining online and offline knowledge in UCT
Proceedings of the 24th international conference on Machine learning
Bayesian actor-critic algorithms
Proceedings of the 24th international conference on Machine learning
Constructing basis functions from directed graphs for value function approximation
Proceedings of the 24th international conference on Machine learning
Proceedings of the 24th international conference on Machine learning
Cross-domain transfer for reinforcement learning
Proceedings of the 24th international conference on Machine learning
Multi-task reinforcement learning: a hierarchical Bayesian approach
Proceedings of the 24th international conference on Machine learning
MILCS: a mutual information learning classifier system
Proceedings of the 9th annual conference companion on Genetic and evolutionary computation
Proceedings of the 9th annual conference companion on Genetic and evolutionary computation
Learning and Cooperation in Sequential Games
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Empirical Studies in Action Selection with Reinforcement Learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Emergence of Mirror Neurons in a Model of Gaze Following
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
The Neural Basis for Visual Selective Attention in Young Infants: A Computational Account
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Responsive characters from motion fragments
ACM SIGGRAPH 2007 papers
Initial results from the use of learning classifier systems to control in vitro neuronal networks
Proceedings of the 9th annual conference on Genetic and evolutionary computation
Empirical analysis of generalization and learning in XCS with gradient descent
Proceedings of the 9th annual conference on Genetic and evolutionary computation
XCSF with computed continuous action
Proceedings of the 9th annual conference on Genetic and evolutionary computation
Practical learning from one-sided feedback
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning and adaptivity in interactive recommender systems
Proceedings of the ninth international conference on Electronic commerce
Learning to trade with insider information
Proceedings of the ninth international conference on Electronic commerce
Data acquisition and cost-effective predictive modeling: targeting offers for electronic commerce
Proceedings of the ninth international conference on Electronic commerce
Shaping multi-agent systems with gradient reinforcement learning
Autonomous Agents and Multi-Agent Systems
Metric embedding of view-graphs
Autonomous Robots
A reinforcement agent for threshold fusion
Applied Soft Computing
Introduction and control of subgoals in reinforcement learning
AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
A layered approach to learning coordination knowledge in multiagent environments
Applied Intelligence
Generalized multiagent learning with performance bound
Autonomous Agents and Multi-Agent Systems
Design of a peer-to-peer system for optimized content replication
Computer Communications
Elman Backpropagation as Reinforcement for Simple Recurrent Networks
Neural Computation
Usage-based web recommendations: a reinforcement learning approach
Proceedings of the 2007 ACM conference on Recommender systems
Reinforcement learning in multi-agent environment and ant colony for packet scheduling in routers
Proceedings of the 5th ACM international workshop on Mobility management and wireless access
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of reinforcement learning to the game of Othello
Computers and Operations Research
IEEE Transactions on Parallel and Distributed Systems
Self-organization for search in peer-to-peer networks: the exploitation-exploration dilemma
Proceedings of the 1st international conference on Bio inspired models of network, information and computing systems
EURASIP Journal on Embedded Systems
Policy-driven autonomic management of multi-component systems
CASCON '07 Proceedings of the 2007 conference of the center for advanced studies on Collaborative research
Adaptive evolutionary programming based on reinforcement learning
Information Sciences: an International Journal
Universal Intelligence: A Definition of Machine Intelligence
Minds and Machines
Modeling dopamine activity by Reinforcement Learning methods: implications from two recent models
Artificial Intelligence Review
Transfer via inter-task mappings in policy search reinforcement learning
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Parallel reinforcement learning with linear function approximation
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
An incentive mechanism for message relaying in unstructured peer-to-peer systems
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Model-based function approximation in reinforcement learning
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Reinforcement learning with utility-aware agents for market-based resource allocation
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Towards reinforcement learning representation transfer
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Distributed path planning for mobile robots using a swarm of interacting reinforcement learners
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Convergence and rate of convergence of a simple ant model
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
IFSA: incremental feature-set augmentation for reinforcement learning tasks
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Advice taking in multiagent reinforcement learning
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Distributed agent-based air traffic flow management
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
A reinforcement learning framework for online data migration in hierarchical storage systems
The Journal of Supercomputing
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning
Artificial Intelligence
Learning how to combine sensory-motor functions into a robust behavior
Artificial Intelligence
Simulating interactions of avatars in high dimensional state space
Proceedings of the 2008 symposium on Interactive 3D graphics and games
Robotics and Autonomous Systems
A study of mechanisms for improving robotic group performance
Artificial Intelligence
Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning
International Journal of Robotics Research
Million Module March: Scalable Locomotion for Large Self-Reconfiguring Robots
International Journal of Robotics Research
Learning to Move in Modular Robots using Central Pattern Generators and Online Optimization
International Journal of Robotics Research
Adaptive building of decision trees by reinforcement learning
AIC'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Applied Informatics and Communications - Volume 7
An approach to fully automatic aircraft collision avoidance and navigation
ACS'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Applied Computer Science - Volume 7
Active audition using the parameter-less self-organising map
Autonomous Robots
Learning polite behavior with situation models
Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction
Optimizing time warp simulation with reinforcement learning techniques
Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come
Application of stochastic learning automata to intelligent vehicle control
ICCOMP'07 Proceedings of the 11th WSEAS International Conference on Computers
An ASML model for an intelligent vehicle control system
ICCOMP'07 Proceedings of the 11th WSEAS International Conference on Computers
Radial basis networks for the simulation of stand alone AC generators during no-break power transfer
Proceedings of the 2007 Summer Computer Simulation Conference
RL-MAC: a reinforcement learning based MAC protocol for wireless sensor networks
International Journal of Sensor Networks
Dynamic learning of action patterns for object acquisition
International Journal of Intelligent Systems Technologies and Applications
Controlling an autonomous agent using internal value based action selection
International Journal of Intelligent Systems Technologies and Applications
Workstation capacity tuning using reinforcement learning
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
AIKED'05 Proceedings of the 4th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering Data Bases
Proceedings of the 2008 ACM symposium on Applied computing
A hybrid web recommender system based on Q-learning
Proceedings of the 2008 ACM symposium on Applied computing
Extremal search of decision policies for scalable distributed applications
Proceedings of the 2nd international conference on Scalable information systems
Biologically-inspired adaptive learning control strategies: A rough set approach
International Journal of Hybrid Intelligent Systems
Knowledge propagation in a distributed omnidirectional vision system
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Marco Somalvico Memorial Issue
Adaptivity at every layer: a modular approach for evolving societies of learning autonomous systems
Proceedings of the 2008 international workshop on Software engineering for adaptive and self-managing systems
Artificial Intelligence techniques: An introduction to their use for modelling environmental systems
Mathematics and Computers in Simulation
Cooperation learning in Multi-Agent Systems with annotation and reward
International Journal of Knowledge-based and Intelligent Engineering Systems
Coordination in multiagent reinforcement learning systems by virtual reinforcement signals
International Journal of Knowledge-based and Intelligent Engineering Systems
Error bounds of optimization algorithms for semi-Markov decision processes
International Journal of Systems Science
Investigation of Q-learning in the context of a virtual learning environment
Informatics in education
Real-time dynamic fuzzy Q-learning and control of mobile robots
ICECS'03 Proceedings of the 2nd WSEAS International Conference on Electronics, Control and Signal Processing
Future Generation Computer Systems
Fuzzy Q-Learning with the modified fuzzy ART neural network
Web Intelligence and Agent Systems
A survey of autonomic computing—degrees, models, and applications
ACM Computing Surveys (CSUR)
Advancing the Layered Approach to Agent-Based Crowd Simulation
Proceedings of the 22nd Workshop on Principles of Advanced and Distributed Simulation
An adaptive approach for ensuring reliability in event based middleware
Proceedings of the second international conference on Distributed event-based systems
Proceedings of the 10th annual conference companion on Genetic and evolutionary computation
Scaling ant colony optimization with hierarchical reinforcement learning partitioning
Proceedings of the 10th annual conference on Genetic and evolutionary computation
Towards efficient online reinforcement learning using neuroevolution
Proceedings of the 10th annual conference on Genetic and evolutionary computation
Genetic algorithms for mentor-assisted evaluation function optimization
Proceedings of the 10th annual conference on Genetic and evolutionary computation
Learning all optimal policies with multiple criteria
Proceedings of the 25th international conference on Machine learning
An object-oriented representation for efficient reinforcement learning
Proceedings of the 25th international conference on Machine learning
Proceedings of the 25th international conference on Machine learning
Hierarchical model-based reinforcement learning: R-max + MAXQ
Proceedings of the 25th international conference on Machine learning
Non-parametric policy gradients: a unified treatment of propositional and relational domains
Proceedings of the 25th international conference on Machine learning
Proceedings of the 25th international conference on Machine learning
Online kernel selection for Bayesian reinforcement learning
Proceedings of the 25th international conference on Machine learning
The many faces of optimism: a unifying approach
Proceedings of the 25th international conference on Machine learning
A semiparametric statistical approach to model-free policy evaluation
Proceedings of the 25th international conference on Machine learning
Preconditioned temporal difference learning
Proceedings of the 25th international conference on Machine learning
A bayesian logistic regression model for active relevance feedback
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective
The Journal of Machine Learning Research
Accelerated Neural Evolution through Cooperatively Coevolved Synapses
The Journal of Machine Learning Research
Rollout sampling approximate policy iteration
Machine Learning
An agent-based system for simulating dynamic choice-sets
Proceedings of the 2008 Spring simulation multiconference
Proceedings of the 2008 Spring simulation multiconference
A combined tactical and strategic hierarchical learning framework in multi-agent games
Sandbox '08 Proceedings of the 2008 ACM SIGGRAPH symposium on Video games
On updates that constrain the features' connections during learning
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Regulating air traffic flow with coupled agents
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Aligning social welfare and agent preferences to alleviate traffic congestion
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Autonomous transfer for reinforcement learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Analysis of an evolutionary reinforcement learning method in a multiagent domain
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
The utility of temporal abstraction in reinforcement learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Switching dynamics of multi-agent learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Dynamics based control with PSRs
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Controlling deliberation in a Markov decision process-based agent
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Expediting RL by using graphical structures
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Transfer of task representation in reinforcement learning using policy-based proto-value functions
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
A new perspective to the keepaway soccer: the takers
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Graph Laplacian based transfer learning in reinforcement learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Autonomous agent learning using an actor-critic algorithm and behavior models
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Social reward shaping in the prisoner's dilemma
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Identifying beneficial teammates using multi-dimensional trust
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Evolutionary dynamics for designing multi-period auctions
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Sensitivity derivatives for flexible sensorimotor learning
Neural Computation
Information Technology and Management
Adapting the interaction state model in conversational recommender systems
Proceedings of the 10th international conference on Electronic commerce
On the possibility of learning in reactive environments with arbitrary dependence
Theoretical Computer Science
Automating cyber-defense management
Proceedings of the 2nd workshop on Recent advances on intrusiton-tolerant systems
State space optimization using plan recognition and reinforcement learning on RTS game
AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
Application of the self organizing maps for visual reinforcement learning of mobile robot
AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
Reinforcement learning for appearance based visual servoing in robotic manipulation
ROCOM'08 Proceedings of the 8th WSEAS International Conference on Robotics, Control and Manufacturing Technology
Geodesic Gaussian kernels for value function approximation
Autonomous Robots
Incremental Learning of Planning Operators in Stochastic Domains
SOFSEM '07 Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science
Real World Multi-agent Systems: Information Sharing, Coordination and Planning
Logic, Language, and Computation
Checking Liveness Properties of Concurrent Systems by Reinforcement Learning
Model Checking and Artificial Intelligence
Reinforcement Learning in Fine Time Discretization
ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
Postural Control of Two-Stage Inverted Pendulum Using Reinforcement Learning and Self-organizing Map
ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part II
Autonomous Learning of Ball Trapping in the Four-Legged Robot League
RoboCup 2006: Robot Soccer World Cup X
Fuzzy Q-Map Algorithm for Reinforcement Learning
Computational Intelligence and Security
Towards Real-Time Distributed Signal Modeling for Brain-Machine Interfaces
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Reinforcement Learning Reward Functions for Unsupervised Learning
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
An Extremely Simple Reinforcement Learning Rule for Neural Networks
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
Online Dynamic Value System for Machine Learning
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
Intelligence Through Interaction: Towards a Unified Theory for Learning
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
State Space Partition for Reinforcement Learning Based on Fuzzy Min-Max Neural Network
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Reinforcement Learning in Nonstationary Environment Navigation Tasks
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Anticipations, Brains, Individual and Social Behavior: An Introduction to Anticipatory Systems
Anticipatory Behavior in Adaptive Learning Systems
Neural Correlates of Anticipation in Cerebellum, Basal Ganglia, and Hippocampus
Anticipatory Behavior in Adaptive Learning Systems
The Role of Anticipation in the Emergence of Language
Anticipatory Behavior in Adaptive Learning Systems
Anticipatory Behavior in Adaptive Learning Systems
Anticipatory Behavior in Adaptive Learning Systems
Anticipatory Behavior in Adaptive Learning Systems
On Affect and Self-adaptation: Potential Benefits of Valence-Controlled Action-Selection
IWINAC '07 Proceedings of the 2nd international work-conference on The Interplay Between Natural and Artificial Computation, Part I: Bio-inspired Modeling of Cognitive Tasks
Feed-Forward Learning: Fast Reinforcement Learning of Controllers
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Combining the Best of the Two Worlds: Inheritance Versus Experience
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Strategies for Affect-Controlled Action-Selection in Soar-RL
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Toward Approximate Adaptive Learning
RSEISP '07 Proceedings of the international conference on Rough Sets and Intelligent Systems Paradigms
Variable Selection for Optimal Decision Making
AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs
ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Co-operative Co-evolutionary System for Solving Dynamic VRPTW Problems with Crisis Situations
HoloMAS '07 Proceedings of the 3rd international conference on Industrial Applications of Holonic and Multi-Agent Systems: Holonic and Multi-Agent Systems for Manufacturing
Cognitive Technical Systems -- What Is the Role of Artificial Intelligence?
KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Making a Robot Learn to Play Soccer Using Reward and Punishment
KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Multi-agent Learning Dynamics: A Survey
CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
Graph-Based Domain Mapping for Transfer Learning in General Games
ECML '07 Proceedings of the 18th European conference on Machine Learning
Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs
ECML '07 Proceedings of the 18th European conference on Machine Learning
Planning and Learning in Environments with Delayed Feedback
ECML '07 Proceedings of the 18th European conference on Machine Learning
ECML '07 Proceedings of the 18th European conference on Machine Learning
Sequence Labeling with Reinforcement Learning and Ranking Algorithms
ECML '07 Proceedings of the 18th European conference on Machine Learning
Imitation Learning Using Graphical Models
ECML '07 Proceedings of the 18th European conference on Machine Learning
Uncovering Fraud in Direct Marketing Data with a Fraud Auditing Case Builder
PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions
AI*IA '07 Proceedings of the 10th Congress of the Italian Association for Artificial Intelligence on AI*IA 2007: Artificial Intelligence and Human-Oriented Computing
Parallel Reinforcement Learning for Weighted Multi-criteria Model with Adaptive Margin
Neural Information Processing
Estimating Internal Variables of a Decision Maker's Brain: A Model-Based Approach for Neuroscience
Neural Information Processing
Neural Information Processing
Computational Modeling of Human-Robot Interaction Based on Active Intention Estimation
Neural Information Processing
Task Learning Based on Reinforcement Learning in Virtual Environment
Neural Information Processing
Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others
RoboCup 2007: Robot Soccer World Cup XI
Model-Based Reinforcement Learning in a Complex Domain
RoboCup 2007: Robot Soccer World Cup XI
A Framework for Learning in Humanoid Simulated Robots
RoboCup 2007: Robot Soccer World Cup XI
Implementing Parametric Reinforcement Learning in Robocup Rescue Simulation
RoboCup 2007: Robot Soccer World Cup XI
Adaptive Power Management Based on Reinforcement Learning for Embedded System
IEA/AIE '08 Proceedings of the 21st international conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: New Frontiers in Applied Artificial Intelligence
Flexible Control Mechanism for Multi-DOF Robotic Arm Based on Biological Fluctuation
SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Toward a Theory of Embodied Statistical Learning
SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Closing the Sensory-Motor Loop on Dopamine Signalled Reinforcement Learning
SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Mutual Development of Behavior Acquisition and Recognition Based on Value System
SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
A Computational Model of the Amygdala Nuclei's Role in Second Order Conditioning
SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Scheduling for Reliable Execution in Autonomic Systems
ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
Simulation-Based Optimization Approach for Software Cost Model with Rejuvenation
ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
Neural Approximation of Monte Carlo Policy Evaluation Deployed in Connect Four
ANNPR '08 Proceedings of the 3rd IAPR workshop on Artificial Neural Networks in Pattern Recognition
Toward Automatic Hint Generation for Logic Proof Tutoring Using Historical Student Data
ITS '08 Proceedings of the 9th international conference on Intelligent Tutoring Systems
Teaching Machine Learning to Design Students
Edutainment '08 Proceedings of the 3rd international conference on Technologies for E-Learning and Digital Entertainment
An Empirical Analysis of the Impact of Prioritised Sweeping on the DynaQ's Performance
ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Epoch-Incremental Queue-Dyna Algorithm
ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
On Using Reinforcement Learning to Solve Sparse Linear Systems
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
RLTE: Reinforcement Learning for Traffic-Engineering
AIMS '08 Proceedings of the 2nd international conference on Autonomous Infrastructure, Management and Security: Resilient Networks and Services
Online Phase-Adaptive Data Layout Selection
ECOOP '08 Proceedings of the 22nd European conference on Object-Oriented Programming
Mixture of Expert Used to Learn Game Play
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Multigrid Reinforcement Learning with Reward Shaping
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Self-organized Reinforcement Learning Based on Policy Gradient in Nonstationary Environments
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Robust Population Coding in Free-Energy-Based Reinforcement Learning
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
A Continuous Internal-State Controller for Partially Observable Markov Decision Processes
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Modular Neural Networks for Model-Free Behavioral Learning
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Mimicking Go Experts with Convolutional Neural Networks
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part II
A Computational Model of Cortico-Striato-Thalamic Circuits in Goal-Directed Behaviour
ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part II
A Learning Automata Approach to Multi-agent Policy Gradient Learning
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Learning Grouping and Anti-predator Behaviors for Multi-agent Systems
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
QFCS: A Fuzzy LCS in Continuous Multi-step Environments with Continuous Vector Actions
Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Evolution Strategies for Direct Policy Search
Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Reinforcement Learning: Insights from Interesting Failures in Parameter Selection
Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Evolving Neural Networks for Online Reinforcement Learning
Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
A Steady-State Genetic Algorithm with Resampling for Noisy Inventory Control
Proceedings of the 10th international conference on Parallel Problem Solving from Nature: PPSN X
Learning Smooth, Human-Like Turntaking in Realtime Dialogue
IVA '08 Proceedings of the 8th international conference on Intelligent Virtual Agents
A New Natural Policy Gradient by Stationary Distribution Metric
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
State-Dependent Exploration for Policy Gradient Methods
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Transferring Instances for Model-Based Reinforcement Learning
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Rule-Based Analysis of Behaviour Learned by Evolutionary and Reinforcement Algorithms
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
Multi-Agent Reinforcement Learning for Intrusion Detection: A Case Study and Evaluation
MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
Robustness Analysis of SARSA(λ): Different Models of Reward and Initialisation
AIMSA '08 Proceedings of the 13th international conference on Artificial Intelligence: Methodology, Systems, and Applications
Robot Navigation Based on Fuzzy RL Algorithm
ISNN '08 Proceedings of the 5th international symposium on Neural Networks: Advances in Neural Networks
Applying Reinforcement Learning to Multi-robot Team Coordination
HAIS '08 Proceedings of the 3rd international workshop on Hybrid Artificial Intelligence Systems
Automated Generation of Knowledge Plane Components for Multimedia Access Networks
MACE '08 Proceedings of the 3rd IEEE international workshop on Modelling Autonomic Communications Environments
A Logical Framework to Reinforcement Learning Using Hybrid Probabilistic Logic Programs
SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
A comparison between ATNoSFERES and Learning Classifier Systems on non-Markov problems
Information Sciences: an International Journal
Value Function Based Reinforcement Learning in Changing Markovian Environments
The Journal of Machine Learning Research
Towards adaptive programming: integrating reinforcement learning into a programming language
Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
A reinforcement learning model for supply chain ordering management: An application to the beer game
Decision Support Systems
A reinforced learning control using iterative error compensation for uncertain dynamical systems
International Journal of Computer Mathematics
INFLUENCE OF TEMPERATURE ON SWARMBOTS THAT LEARN
Cybernetics and Systems
REINFORCEMENT LEARNING FOR POMDP USING STATE CLASSIFICATION
Applied Artificial Intelligence
Agent's actions as a classification criteria for the state space in a learning from rewards system
Journal of Experimental & Theoretical Artificial Intelligence
Hierarchical pathfinding and AI-based learning approach in strategy game design
International Journal of Computer Games Technology - Joint International Conference on Cyber Games and Interactive Entertainment 2006
State space segmentation for acquisition of agent behavior
Web Intelligence and Agent Systems
Itinerary determination of imprecise mobile agents with firm deadline
Web Intelligence and Agent Systems
CCMAC: coordinated cooperative MAC for wireless LANs
Proceedings of the 11th international symposium on Modeling, analysis and simulation of wireless and mobile systems
A self-adaptive placement protocol for mobile directories in MANETs
Proceedings of the 11th international symposium on Modeling, analysis and simulation of wireless and mobile systems
Implementation of a neural-based navigation approach on indoor and outdoor mobile robots
CSTST '08 Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
The Neuromodulatory System: A Framework for Survival and Adaptive Behavior in a Challenging World
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Eavesdropping: audience interaction in networked audio performance
MM '08 Proceedings of the 16th ACM international conference on Multimedia
A Cultural Algorithm for POMDPs from Stochastic Inventory Control
HM '08 Proceedings of the 5th International Workshop on Hybrid Metaheuristics
EURASIP Journal on Wireless Communications and Networking - Cognitive Radio and Dynamic Spectrum Sharing Systems
ART2 neural network interacting with environment
Neurocomputing
Using temporal-difference learning for multi-agent bargaining
Electronic Commerce Research and Applications
WSEAS Transactions on Information Science and Applications
Hybrid reinforcement/supervised learning of dialogue policies from fixed data sets
Computational Linguistics
ICIRA '08 Proceedings of the First International Conference on Intelligent Robotics and Applications: Part I
Comparing active vision models
Image and Vision Computing
Experimental Analysis of Sample-Based Maps for Long-Term SLAM
International Journal of Robotics Research
Actor Critic Learning: A Near Set Approach
RSCTC '08 Proceedings of the 6th International Conference on Rough Sets and Current Trends in Computing
Revisiting UCS: Description, Fitness Sharing, and Comparison with XCS
Learning Classifier Systems
A Learning Classifier System with Mutual-Information-Based Fitness
Learning Classifier Systems
Individual and Social Behaviour in the IPA Market with RL
SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Reinforcement Learning with Markov Logic Networks
MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Learning the Filling Policy of a Biodegradation Process by Fuzzy Actor---Critic Learning Methodology
MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Reinforcement Learning on a Futures Market Simulator
KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Proposal of Exploitation-Oriented Learning PS-r#
IDEAL '08 Proceedings of the 9th International Conference on Intelligent Data Engineering and Automated Learning
Simulating Interactions of Characters
Motion in Games
Learning to Attend -- From Bottom-Up to Top-Down
Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
Biologically Inspired Framework for Learning and Abstract Representation of Attention Control
Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
On the Role of Dopamine in Cognitive Vision
Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
Statistics and Computing
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case
Recent Advances in Reinforcement Learning
Recent Advances in Reinforcement Learning
Basis Expansion in Natural Actor Critic Methods
Recent Advances in Reinforcement Learning
Reinforcement Learning with the Use of Costly Features
Recent Advances in Reinforcement Learning
Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem
Recent Advances in Reinforcement Learning
Optimistic Planning of Deterministic Systems
Recent Advances in Reinforcement Learning
Policy Iteration for Learning an Exercise Policy for American Options
Recent Advances in Reinforcement Learning
Tile Coding Based on Hyperplane Tiles
Recent Advances in Reinforcement Learning
Use of Reinforcement Learning in Two Real Applications
Recent Advances in Reinforcement Learning
Applications of Reinforcement Learning to Structured Prediction
Recent Advances in Reinforcement Learning
New Error Bounds for Approximations from Projected Linear Equations
Recent Advances in Reinforcement Learning
Partial Order Hierarchical Reinforcement Learning
AI '08 Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Making Financial Trading by Recurrent Reinforcement Learning
KES '07 Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International Conference
A Collaborative Reinforcement Learning Approach to Urban Traffic Control Optimization
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
A Study of Reinforcement Learning in a New Multiagent Domain
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Formalizing Multi-state Learning Dynamics
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Learning-Rate Adjusting Q-Learning for Prisoner's Dilemma Games
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
An Information-Theoretic Class of Stochastic Decision Processes
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Towards a Self-Organising Mechanism for Learning Adaptive Decision-Making Rules
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Optimal Local Basis: A Reinforcement Learning Approach for Face Recognition
International Journal of Computer Vision
Learning and planning in environments with delayed feedback
Autonomous Agents and Multi-Agent Systems
Learning to trust in the competence and commitment of agents
Autonomous Agents and Multi-Agent Systems
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Towards end-to-end quality of service: controlling I/O interference in shared storage servers
Proceedings of the 9th ACM/IFIP/USENIX International Conference on Middleware
International Journal of Intelligent Systems Technologies and Applications
Simulation and reinforcement learning with soccer agents
Multiagent and Grid Systems - Innovations in intelligent agent technology
A New Learning Algorithm for Optimal Stopping
Discrete Event Dynamic Systems
Strategy-acquisition system for video trading card game
ACE '08 Proceedings of the 2008 International Conference on Advances in Computer Entertainment Technology
Evolutionary computation using reinforced learning on image compression
ISTASC'08 Proceedings of the 8th conference on Systems theory and scientific computation
Unsupervised learning based feature points detection in ECG
ISTASC'08 Proceedings of the 8th conference on Systems theory and scientific computation
Evolutionary computation using reinforced learning on image compression
SSIP'08 Proceedings of the 8th conference on Signal, Speech and image processing
Unsupervised learning based feature points detection in ECG
SSIP'08 Proceedings of the 8th conference on Signal, Speech and image processing
Intentional learning agent architecture
Autonomous Agents and Multi-Agent Systems
General Game Playing with Ants
SEAL '08 Proceedings of the 7th International Conference on Simulated Evolution and Learning
Performance Evaluation of an Adaptive Ant Colony Optimization Applied to Single Machine Scheduling
SEAL '08 Proceedings of the 7th International Conference on Simulated Evolution and Learning
Improving the Exploration Strategy in Bandit Algorithms
Learning and Intelligent Optimization
Tuning Local Search by Average-Reward Reinforcement Learning
Learning and Intelligent Optimization
Hierarchical Classifiers for Complex Spatio-temporal Concepts
Transactions on Rough Sets IX
An adaptive middleware for supporting time-critical event response
Cluster Computing
Imitation guided learning in learning classifier systems
Natural Computing: an international journal
A Machine Learning Method for Dynamic Traffic Control and Guidance on Freeway Networks
CAR '09 Proceedings of the 2009 International Asia Conference on Informatics in Control, Automation and Robotics
A spiking neural network model of an actor-critic learning agent
Neural Computation
The factored policy-gradient planner
Artificial Intelligence
A new evolutionary reinforcement scheme for stochastic learning automata
ICCOMP'08 Proceedings of the 12th WSEAS international conference on Computers
How people talk when teaching a robot
Proceedings of the 4th ACM/IEEE international conference on Human robot interaction
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Factored value iteration converges
Acta Cybernetica
Factored temporal difference learning in the new ties environment
Acta Cybernetica
A role-oriented BDI framework for real-time multiagent teaming
Intelligent Decision Technologies
Experimental analysis on Sarsa(λ) and Q(λ) with different eligibility traces strategies
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Theoretical advances of intelligent paradigms
Some topics for simulation optimization
Proceedings of the 40th Conference on Winter Simulation
Predictive models in the brain
Connection Science
Letters: On the bias of batch Bellman residual minimisation
Neurocomputing
Gaussian process dynamic programming
Neurocomputing
Robotics and Autonomous Systems
Reinforcement distribution in fuzzy Q-learning
Fuzzy Sets and Systems
Natural Language Engineering
Reinforcement Learning with Orthonormal Basis Adaptation Based on Activity-Oriented Index Allocation
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Modeling reinforcement learning algorithms for performance analysis
Proceedings of the International Conference on Advances in Computing, Communication and Control
Robotics and Computer-Integrated Manufacturing
An Adaptable Oscillator-Based Controller for Autonomous Robots
Journal of Intelligent and Robotic Systems
Boosting the performance of computing systems through adaptive configuration tuning
Proceedings of the 2009 ACM symposium on Applied Computing
Comparing Learning Attention Control in Perceptual and Decision Space
Attention in Cognitive Systems
A case-based approach for coordinated action selection in robot soccer
Artificial Intelligence
Linear Bellman combination for control of character animation
ACM SIGGRAPH 2009 papers
Learning Actions through Imitation and Exploration: Towards Humanoid Robots That Learn from Humans
Creating Brain-Like Intelligence
Co-evolution of Rewards and Meta-parameters in Embodied Evolution
Creating Brain-Like Intelligence
Basal Ganglia Models for Autonomous Behavior Learning
Creating Brain-Like Intelligence
A Probabilistic Approach for Mining Drifting User Interest
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Decision Support Systems
COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS
Cybernetics and Systems
An Optimal Approximate Dynamic Programming Algorithm for the Lagged Asset Acquisition Problem
Mathematics of Operations Research
Journal of Cognitive Neuroscience
Simultaneous Optimal Control and Discrete Stochastic Sensor Selection
HSCC '09 Proceedings of the 12th International Conference on Hybrid Systems: Computation and Control
Reinforcement Learning: A Tutorial Survey and Recent Advances
INFORMS Journal on Computing
Color learning and illumination invariance on mobile robots: A survey
Robotics and Autonomous Systems
Multi-robot task allocation through vacancy chain scheduling
Robotics and Autonomous Systems
Fuzzy CMAC with automatic state partition for reinforcementlearning
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
CNSR '09 Proceedings of the 2009 Seventh Annual Communication Networks and Services Research Conference
Static strategy and dynamic adjustment: An effective method for Grid task scheduling
Future Generation Computer Systems
Designing autonomous layered video coders
Image Communication
Learning the IPA market with individual and social rewards
Web Intelligence and Agent Systems
Analysis and improvement of the genetic discovery component of XCS
International Journal of Hybrid Intelligent Systems - Data Mining and Hybrid Intelligent Systems
A model for the dynamic coordination of multiple competing goals
Journal of Experimental & Theoretical Artificial Intelligence
A new marketing strategy map for direct marketing
Knowledge-Based Systems
Dynamic analysis of multiagent Q-learning with ε-greedy exploration
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Hoeffding and Bernstein races for selecting policies in evolutionary direct policy search
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Regularization and feature selection in least-squares temporal difference learning
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Binary action search for learning continuous-action control policies
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Constraint relaxation in approximate linear programs
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Learning when to stop thinking and do something!
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Model-free reinforcement learning as mixture learning
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Discovering options from example trajectories
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
VCONF: a reinforcement learning approach to virtual machines auto-configuration
ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Automatic exploration of datacenter performance regimes
ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
GMAC '09 Proceedings of the 6th international conference industry session on Grids meets autonomic computing
Training a real-world POMDP-based dialogue system
NAACL-HLT-Dialog '07 Proceedings of the Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technologies
Technical support dialog systems: issues, problems, and solutions
NAACL-HLT-Dialog '07 Proceedings of the Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technologies
Improving recommender systems with adaptive conversational strategies
Proceedings of the 20th ACM conference on Hypertext and hypermedia
Stronger CDA strategies through empirical game-theoretic analysis and reinforcement learning
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Adaptive learning in evolving task allocation networks
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Solving multiagent assignment Markov decision processes
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Generalized model learning for reinforcement learning in factored domains
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Online exploration in least-squares policy iteration
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
An empirical analysis of value function-based and policy search reinforcement learning
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
State-coupled replicator dynamics
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
A task specification language for bootstrap learning
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Modelling the dynamics of multiagent Q-learning with ε-greedy exploration
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Fuzzy Kanerva-based function approximation for reinforcement learning
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Novel reinforcement learning-based approaches to reduce loss probability in buffer-less OBS networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Scheduling policy design for autonomic systems
International Journal of Autonomous and Adaptive Communications Systems
Immediate Reward Reinforcement Learning for Clustering and Topology Preserving Mappings
Similarity-Based Clustering
ISNN '09 Proceedings of the 6th International Symposium on Neural Networks on Advances in Neural Networks
Reordering Sparsification of Kernel Machines in Approximate Policy Iteration
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Automatic control based on wasp behavioral model and stochastic learning automata
MAMECTIS'08 Proceedings of the 10th WSEAS international conference on Mathematical methods, computational techniques and intelligent systems
Demonstration of a POMDP voice dialer
HLT-Demonstrations '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Demo Session
An Inductive Logic Programming Approach to Statistical Relational Learning
Proceedings of the 2005 conference on An Inductive Logic Programming Approach to Statistical Relational Learning
Improving Batch Reinforcement Learning Performance through Transfer of Samples
Proceedings of the 2008 conference on STAIRS 2008: Proceedings of the Fourth Starting AI Researchers' Symposium
Direct Policy Search Reinforcement Learning for Robot Control
Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Transfer Learning and Intelligence: an Argument and Approach
Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Artificial general intelligence: an organism and level based position statement
Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
On the Broad Implications of Reinforcement Learning based AGI
Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference
Towards an Intelligent Tutoring System for Propositional Proof Construction
Proceedings of the 2008 conference on Current Issues in Computing and Philosophy
Proceedings of the 2008 conference on Knowledge-Based Software Engineering: Proceedings of the Eighth Joint Conference on Knowledge-Based Software Engineering
Fast Learning in an Actor-Critic Architecture with Reward and Punishment
Proceedings of the 2008 conference on Tenth Scandinavian Conference on Artificial Intelligence: SCAI 2008
Towards Automatic Model Generation by Optimization
Proceedings of the 2008 conference on Tenth Scandinavian Conference on Artificial Intelligence: SCAI 2008
Learning by Automatic Option Discovery from Conditionally Terminating Sequences
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Least Squares SVM for Least Squares TD Learning
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Multi-Agent Least-Squares Policy Iteration
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Reinforcement Learning with Classifier Selection for Focused Crawling
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Dynamic Multi-Armed Bandit with Covariates
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Reinforcement Learning with the Use of Costly Features
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Multi-Agent Reinforcement Learning for Intrusion Detection: A case study and evaluation
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning to Select Object Recognition Methods for Autonomous Mobile Robots
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning-Rate Adjusting Q-Learning for Two-Person Two-Action Symmetric Games
KES-AMSTA '09 Proceedings of the Third KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Design and performance analysis of an inductive QoS routing algorithm
Computer Communications
Reinforcement learning for robot soccer
Autonomous Robots
Neuroevolutionary reinforcement learning for generalized helicopter control
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Novelty of behaviour as a basis for the neuro-evolution of operant reward learning
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
EDA-RL: estimation of distribution algorithms for reinforcement learning problems
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Evolving an autonomous agent for non-Markovian reinforcement learning
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Uncertainty handling CMA-ES for reinforcement learning
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Simulating human grandmasters: evolution and coevolution of evaluation functions
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
On the characteristics of sequential decision problems and their impact on evolutionary computation
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Learning in the time-dependent minority game
Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
Reinforcement learning for games: failures and successes
Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
TEMMAS: The Electricity Market Multi-Agent Simulator
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Two Steps Reinforcement Learning in Continuous Reinforcement Learning Tasks
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Social and Cognitive System for Learning Negotiation Strategies with Incomplete Information
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Motion Planning of a Non-holonomic Vehicle in a Real Environment by Reinforcement Learning*
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Contribution to the Control of a MAS's Global Behaviour: Reinforcement Learning Tools
Engineering Societies in the Agents World IX
Anticipatory Behavior in Adaptive Learning Systems
Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning
Anticipatory Behavior in Adaptive Learning Systems
The kNN-TD Reinforcement Learning Algorithm
IWINAC '09 Proceedings of the 3rd International Work-Conference on The Interplay Between Natural and Artificial Computation: Part I: Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira's Scientific Legacy
Multi-agent Reinforcement Learning in Network Management
AIMS '09 Proceedings of the 3rd International Conference on Autonomous Infrastructure, Management and Security: Scalability of Networks and Services
Finding Errors of Hybrid Systems by Optimising an Abstraction-Based Quality Estimate
TAP '09 Proceedings of the 3rd International Conference on Tests and Proofs
Learning Representation and Control in Markov Decision Processes: New Frontiers
Foundations and Trends® in Machine Learning
Using machine learning in a cooperative hybrid parallel strategy of metaheuristics
Information Sciences: an International Journal
International Journal of Robotics Research
Toward Rough-Granular Computing
RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Robotic Target Tracking with Approximation Space-Based Feedback During Reinforcement Learning
RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
A q-learning based adaptive bidding strategy in combinatorial auctions
Proceedings of the 11th International Conference on Electronic Commerce
Randomized shortest-path problems: Two related models
Neural Computation
Performance bounded reinforcement learning in strategic interactions
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
An instance-based state representation for network repair
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Reinforcement learning for a CPG-driven biped robot
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Machine learning for adaptive image interpretation
IAAI'04 Proceedings of the 16th conference on Innovative applications of artifical intelligence
Towards autonomic computing: adaptive job routing and scheduling
IAAI'04 Proceedings of the 16th conference on Innovative applications of artifical intelligence
Incremental least squares policy iteration for POMDPs
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Learning representation and control in continuous Markov decision processes
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
QUICR-learning for multi-agent coordination
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Real-time evolution of neural networks in the NERO video game
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Curiosity-driven exploration with planning trajectories
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Inter-task action correlation for reinforcement learning tasks
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Modeling human decision making in cliff-edge environments
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Incremental least-squares temporal difference learning
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
A simple and effective method for incorporating advice into kernel methods
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Using Homomorphisms to transfer options across continuous reinforcement learning domains
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Sample-efficient evolutionary function approximation for reinforcement learning
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Hard constrained semi-Markov decision processes
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Learning partially observable action schemas
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Interactively shaping agents via human reinforcement: the TAMER framework
Proceedings of the fifth international conference on Knowledge capture
A GeoAgent-based framework for knowledge-oriented representation: Embracing social rules in GIS
International Journal of Geographical Information Science
Online Markov Decision Processes
Mathematics of Operations Research
Case-Based Reasoning in Transfer Learning
ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Development of Symbiotic Brain-Machine Interfaces Using a Neurophysiology Cyberworkstation
Proceedings of the 13th International Conference on Human-Computer Interaction. Part II: Novel Interaction Methods and Techniques
NJFun: a reinforcement learning spoken dialogue system
ConversationalSys '00 Proceedings of the ANLP-NAACL 2000 Workshop on Conversational Systems
Natural language generation as planning under uncertainty for spoken dialogue systems
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Prediction of solar conditions by emotional learning
Intelligent Data Analysis
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games
Web Intelligence and Agent Systems
Learning lexical alignment policies for generating referring expressions in spoken dialogue systems
ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
Machine learning in digital games: a survey
Artificial Intelligence Review
Hybrid least-squares algorithms for approximate policy evaluation
Machine Learning
A DR algorithm based on artificial potential field method
Multimedia Tools and Applications
HoloMAS '09 Proceedings of the 4th International Conference on Industrial Applications of Holonic and Multi-Agent Systems: Holonic and Multi-Agent Systems for Manufacturing
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Optimal Online Learning Procedures for Model-Free Policy Evaluation
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Learning the Difference between Partially Observable Dynamical Systems
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Extending the Strada Framework to Design an AI for ORTS
ICEC '09 Proceedings of the 8th International Conference on Entertainment Computing
Reinforcement Learning for Blackjack
ICEC '09 Proceedings of the 8th International Conference on Entertainment Computing
Efficient Sample Reuse in EM-Based Policy Search
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Feature Selection for Value Function Approximation Using Bayesian Model Selection
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Considering Unseen States as Impossible in Factored Reinforcement Learning
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Learning to become an expert: Reinforcement learning and the acquisition of perceptual expertise
Journal of Cognitive Neuroscience
Robust task-based control policies for physics-based characters
ACM SIGGRAPH Asia 2009 papers
Efficient no-regret multiagent learning
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Non-stationary policy learning in 2-player zero sum games
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Value functions for RL-based behavior transfer: a comparative study
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Samuel meets Amarel: automating value function approximation using global state space analysis
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Error bounds for approximate value iteration
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Improving action selection in MDP's via knowledge transfer
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Lazy approximation for solving continuous finite-horizon MDPs
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
The max K-armed bandit: a new model of exploration applied to search heuristic selection
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Adaptive modeling and planning for reactive agents
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Towards competence in autonomous agents
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Improving reinforcement learning function approximators via neuroevolution
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Compact spectral bases for value function approximation using Kronecker factorization
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Efficient reinforcement learning with relocatable action models
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Efficient structure learning in factored-state MDPs
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Thresholded rewards: acting optimally in timed, zero-sum games
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Temporal difference and policy search methods for reinforcement learning: an empirical comparison
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
On policy learning in restricted policy spaces
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Autonomous inter-task transfer in reinforcement learning domains
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Coordination and multi-tasking using EMT
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Markov decision processes for control of a sensor network-based health monitoring system
IAAI'05 Proceedings of the 17th conference on Innovative applications of artificial intelligence - Volume 3
RETALIATE: learning winning policies in first-person shooter games
IAAI'07 Proceedings of the 19th national conference on Innovative applications of artificial intelligence - Volume 2
Reinforcement learning for vulnerability assessment in peer-to-peer networks
IAAI'08 Proceedings of the 20th national conference on Innovative applications of artificial intelligence - Volume 3
Adaptive treatment of epilepsy via batch-mode reinforcement learning
IAAI'08 Proceedings of the 20th national conference on Innovative applications of artificial intelligence - Volume 3
A case study on the critical role of geometric regularity in machine learning
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Adaptive importance sampling with automatic model selection in value function approximation
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Fast spectral learning using Lanczos eigenspace projections
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Adaptive management of air traffic flow: a multiagent coordination approach
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Autonomous robot skill acquisition
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Evaluation of a hierarchical reinforcement learning spoken dialogue system
Computer Speech and Language
ScaNaLU '06 Proceedings of the Third Workshop on Scalable Natural Language Understanding
Journal of Artificial Intelligence Research
OBDD-based universal planning for synchronized agents in non-deterministic domains
Journal of Artificial Intelligence Research
Hierarchical reinforcement learning with the MAXQ value function decomposition
Journal of Artificial Intelligence Research
Accelerating reinforcement learning by composing solutions of automatically identified subtasks
Journal of Artificial Intelligence Research
Optimizing dialogue management with reinforcement learning: experiments with the NJFun system
Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods
Journal of Artificial Intelligence Research
Collective intelligence, data routing and braess' paradox
Journal of Artificial Intelligence Research
Potential-based shaping and Q-value initialization are equivalent
Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation
Journal of Artificial Intelligence Research
Journal of Artificial Intelligence Research
Risk-sensitive reinforcement learning applied to control under constraints
Journal of Artificial Intelligence Research
Perseus: randomized point-based value iteration for POMDPs
Journal of Artificial Intelligence Research
Integrating learning from examples into the search for diagnostic policies
Journal of Artificial Intelligence Research
Cooperative information sharing to improve distributed learning in multi-agent systems
Journal of Artificial Intelligence Research
Learning in real-time search: a unifying framework
Journal of Artificial Intelligence Research
Solving factored MDPs with hybrid state and action variables
Journal of Artificial Intelligence Research
Anytime point-based approximations for large POMDPs
Journal of Artificial Intelligence Research
Closed-loop learning of visual control policies
Journal of Artificial Intelligence Research
Learning to play using low-complexity rule-based policies: illustrations through Ms. Pac-Man
Journal of Artificial Intelligence Research
Optimal and approximate Q-value functions for decentralized POMDPs
Journal of Artificial Intelligence Research
Adaptive stochastic resource control: a machine learning approach
Journal of Artificial Intelligence Research
Learning partially observable deterministic action models
Journal of Artificial Intelligence Research
Learning to reach agreement in a continuous ultimatum game
Journal of Artificial Intelligence Research
Interactive policy learning through confidence-based autonomy
Journal of Artificial Intelligence Research
Infinite-horizon policy-gradient estimation
Journal of Artificial Intelligence Research
Experiments with infinite-horizon, policy-gradient estimation
Journal of Artificial Intelligence Research
IJCAI'99 Proceedings of the 16th international joint conference on Artifical intelligence - Volume 1
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Learning and multiagent reasoning for autonomous agents
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A call admission control scheme using neuroevolution algorithm in cellular networks
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
General game learning using knowledge transfer
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Online learning and exploiting relational models in reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Utile distinctions for relational reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
State similarity based approach for improving performance in RL
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Building portable options: skill transfer in reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Transfer learning in real-time strategy games using hybrid CBR/RL
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Direct code access in self-organizing neural networks for reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Dynamics of temporal difference learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning policies for embodied virtual agents through demonstration
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning to walk through imitation
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Using linear programming for Bayesian exploration in Markov decision processes
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
An analysis of Laplacian methods for value function approximation in MDPs
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Bayesian inverse reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Detecting and forecasting economic regimes in multi-agent automated exchanges
Decision Support Systems
Simultaneous adversarial multi-robot learning
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Generalizing plans to new environments in relational MDPs
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Use of off-line dynamic programming for efficient image interpretation
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Modular self-organization for a long-living autonomous agent
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Topology selection for stream mining systems
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Adaptive Learning Based on Exercises Fitness Degree
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Reinforcement Learning in RoboCup KeepAway with Partial Observability
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Tank War Using Online Reinforcement Learning
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Real-time planning for parameterized human motion
Proceedings of the 2008 ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Customizing directions in an automated wayfinding system for individuals with cognitive impairment
Proceedings of the 11th international ACM SIGACCESS conference on Computers and accessibility
SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Automatic abstraction in reinforcement learning using data mining techniques
Robotics and Autonomous Systems
Operant matching as a nash equilibrium of an intertemporal game
Neural Computation
A neurocomputational model for cocaine addiction
Neural Computation
Learning Bayesian network equivalence classes with Ant Colony optimization
Journal of Artificial Intelligence Research
Towards a general framework for cross-layer decision making in multimedia systems
IEEE Transactions on Circuits and Systems for Video Technology
Temporal difference learning applied to a high-performance game-playing program
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Reinforcement learning in distributed domains: beyond team games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Fast concurrent reinforcement learners
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Multi-agent systems by incremental gradient reinforcement learning
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
R-MAX: a general polynomial time algorithm for near-optimal reinforcement learning
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Exploiting multiple secondary reinforcers in policy gradient reinforcement learning
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Rational and convergent learning in stochastic games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
State abstraction discovery from irrelevant state variables
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Using predictive representations to improve generalization in reinforcement learning
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Two-sided bandits and the dating market
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Evolutionary behavior learning for action-based environment modeling by a mobile robot
Applied Soft Computing
Dynamic Customer Management and the Value of One-to-One Marketing
Marketing Science
Intelligence Dynamics: a concept and preliminary experiments for open-ended learning agents
Autonomous Agents and Multi-Agent Systems
Learning classifier systems: a complete introduction, review, and roadmap
Journal of Artificial Evolution and Applications
Finding optimal satisficing strategies for and-or trees
Artificial Intelligence
Effective learning in the presence of adaptive counterparts
Journal of Algorithms
Learning of shared attention in sociable robotics
Journal of Algorithms
Neuroevolution strategies for episodic reinforcement learning
Journal of Algorithms
Intensional dynamic programming. A Rosetta stone for structured dynamic programming
Journal of Algorithms
Short survey: Taxonomy and survey of RFID anti-collision protocols
Computer Communications
States representations with a hierarchical dependency in reinforcement learning
ISC '07 Proceedings of the 10th IASTED International Conference on Intelligent Systems and Control
An analytic modelling approach for network routing algorithms that use "ant-like" mobile agents
Computer Networks: The International Journal of Computer and Telecommunications Networking
IEEE Journal on Selected Areas in Communications - Special issue on wireless and pervasive communications for healthcare
Fuzzy-UCS: a Michigan-style learning fuzzy-classifier system for supervised learning
IEEE Transactions on Evolutionary Computation
Interactive evolution of particle systems for computer graphics and animation
IEEE Transactions on Evolutionary Computation
A reward field model generation in Q-learning by dynamic programming
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Ant colony optimization incorporated with fuzzy Q-learning for reinforcement fuzzy control
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Optimal contraction theorem for exploration-exploitation tradeoff in search and optimization
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Keeping the resident in the loop: adapting the smart home to the user
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Reinforcement learning versus model predictive control: a comparison on a power system problem
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Neural network output optimization using interval analysis
IEEE Transactions on Neural Networks
A Q-learning approach to derive optimal consumption and investment strategies
IEEE Transactions on Neural Networks
IEEE Transactions on Neural Networks
Learning Deep Architectures for AI
Foundations and Trends® in Machine Learning
Coordination motion-tasks using actual robot dynamics
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
To Elicit Or To Tell: Does It Matter?
Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
Utility in hint generation: Selection of hints from a corpus of student work
Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
Using neural gas for a better machine identity description
ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Efficient skill learning using abstraction selection
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Autonomously learning an action hierarchy using a learned qualitative state representation
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Solving POMDPs: RTDP-bel vs. point-based algorithms
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Learning hierarchical task networks for nondeterministic planning domains
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
An RL-based scheduling algorithm for video traffic in high-rate wireless personal area networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
FICA: A novel intelligent crawling algorithm based on reinforcement learning
Web Intelligence and Agent Systems
Structured prediction with reinforcement learning
Machine Learning
Imitation as a mechanism of cultural transmission
Artificial Life
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine
Measuring Resemblances Between Swarm Behaviours: A Perceptual Tolerance Near Set Approach
Fundamenta Informaticae - Swarm Intelligence
Coordination motion-tasks using actual robot dynamics
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
A rules-based approach for configuring chains of classifiers in real-time stream mining systems
EURASIP Journal on Advances in Signal Processing
An efficient MAC protocol for throughput enhancement in dense RFID system
ISWPC'09 Proceedings of the 4th international conference on Wireless pervasive computing
Customized learning algorithms for episodic tasks withacyclic state spaces
CASE'09 Proceedings of the fifth annual IEEE international conference on Automation science and engineering
MDP based active localization for multiple robots
CASE'09 Proceedings of the fifth annual IEEE international conference on Automation science and engineering
A computational neuroscience model of working memory with application to robot perceptual learning
CI '07 Proceedings of the Third IASTED International Conference on Computational Intelligence
Exploration and exploitation balance management in fuzzy reinforcement learning
Fuzzy Sets and Systems
SIMBA: A simulator for business education and research
Decision Support Systems
Stochastic model for outcome prediction in acute illness
Computers in Biology and Medicine
Reinforcement learning for mapping instructions to actions
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
Markov decision process frameworks for cooperative retransmission in wireless networks
WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
Q-learning for joint access decision in heterogeneous networks
WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
Coevolving intelligent game players in a cultural framework
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Overcoming the bootstrap problem in evolutionary robotics using behavioral diversity
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
On-line neuroevolution applied to the open racing car simulator
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Memory-enhanced evolutionary robotics: the echo state network approach
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
An Additive Reinforcement Learning
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Learning Automata Based Intelligent Tutorial-like System
KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part I
Inference and Learning in Planning (Extended Abstract)
DS '09 Proceedings of the 12th International Conference on Discovery Science
EcoSimNet: A Multi-Agent System for Ecological Simulation and Optimization
EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Anytime Self-play Learning to Satisfy Functional Optimality Criteria
ADT '09 Proceedings of the 1st International Conference on Algorithmic Decision Theory
Effectiveness of Intrinsically Motivated Adaptive Agent for Sustainable Human-Agent Interaction
ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
PRIMA '09 Proceedings of the 12th International Conference on Principles of Practice in Multi-Agent Systems
Dynamic tuning of online data migration policies in hierarchical storage systems using reinforcement learning
A reinforcement learning framework for utility-based scheduling in resource-constrained systems
A reinforcement learning framework for utility-based scheduling in resource-constrained systems
A reinforcement learning approach to dynamic resource allocation
A reinforcement learning approach to dynamic resource allocation
Adaptive data-aware utility-based scheduling in resource-constrained systems
Adaptive data-aware utility-based scheduling in resource-constrained systems
A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments
Dynamic adaptation of user migration policies in distributed virtual environments
Dynamic adaptation of user migration policies in distributed virtual environments
Scalable approach for effective control of gene regulatory networks
Artificial Intelligence in Medicine
An artificial immune network approach for pinyin-to- character conversion
VECIMS'09 Proceedings of the 2009 IEEE international conference on Virtual Environments, Human-Computer Interfaces and Measurement Systems
Simulating sellers in online exchanges
Decision Support Systems
Approximate dynamic programming using Bellman residual elimination and Gaussian process regression
ACC'09 Proceedings of the 2009 conference on American Control Conference
Fuzzy ant colony optimization for optimal control
ACC'09 Proceedings of the 2009 conference on American Control Conference
Robust adaptive Markov decision processes in multi-vehicle applications
ACC'09 Proceedings of the 2009 conference on American Control Conference
A Q-learning model-independent flow controller for high-speed networks
ACC'09 Proceedings of the 2009 conference on American Control Conference
Multiresolution state-space discretization method for Q-learning
ACC'09 Proceedings of the 2009 conference on American Control Conference
Nash Q-learning multi-agent flow control for high-speed networks
ACC'09 Proceedings of the 2009 conference on American Control Conference
Which landmark is useful?: learning selection policies for navigation in unknown environments
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Least absolute policy iteration for robust value function approximation
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Architecture of behavior-based and robotics self-optimizing memory controller
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Transfer of knowledge for a climbing virtual human: a reinforcement learning approach
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Learning motor primitives for robotics
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Adaptive autonomous control using online value iteration with Gaussian processes
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Smoothed Sarsa: reinforcement learning for robot delivery tasks
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A motor learning neural model based on Bayesian network and reinforcement learning
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Relational reinforcement learning applied to shared attention
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Using continuous action spaces to solve discrete problems
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Coordinated multiple ramps metering based on neuro-fuzzy adaptive dynamic programming
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Reinforcement learning of multiple tasks using parametric bias
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A dynamical connectionist model of idea generation
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A parallel hybrid implementation using genetic algorithm, GRASP and reinforcement learning
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Generalized policy iteration for continuous-time systems
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Reconfigurable disruption tolerant routing via reinforcement learning
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Improving management of Anemia in end stage renal disease using reinforcement learning
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Goal-directed feature learning
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
From mirror neurons to computational neurolinguistics
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Dialogue act prediction using stochastic context-free grammar induction
CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
Novel runtime systems support for adaptive compositional modeling in PSEs
Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
k-nearest neighbor Monte-Carlo control algorithm for POMDP-based dialogue systems
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Agent-based buddy-finding methodology for knowledge sharing
Information and Management
Associating domain-dependent knowledge and Monte Carlo approaches within a Go program
Information Sciences: an International Journal
Layering and heterogeneity as design principles for animated embedded agents
Information Sciences: an International Journal
Computational intelligence for structured learning of a partner robot based on imitation
Information Sciences: an International Journal
Switching between different state representations in reinforcement learning
AIA '08 Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications
Impacts of team size on role learning in multiagent systems
AIA '08 Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications
RL-based superframe order adaptation algorithm for IEEE 802.15.4 networks
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
An adaptive inventory control for a supply chain
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
SI-CCMAC: sender initiating concurrent cooperative MAC for wireless LANs
WiOPT'09 Proceedings of the 7th international conference on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks
Bridging the gap between feature- and grid-based SLAM
Robotics and Autonomous Systems
An energy-efficient data gathering algorithm to prolong lifetime of wireless sensor networks
Computer Communications
Probabilistic fuzzy logic system: a tool to process stochastic and imprecise information
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
View estimation learning based on value system
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Human instruction recognition and self behavior acquisition based on state value
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Protecting buying agents in e-marketplaces by direct experience trust modelling
Knowledge and Information Systems
Temporal difference learning with interpolated table value functions
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Evolution versus temporal difference learning for learning to play Ms. Pac-Man
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Coevolutionary temporal difference learning for Othello
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Introducing a round robin tournament into Blondie24
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Fuzzy Q-learning in a nondeterministic environment: developing an intelligent Ms. Pac-Man agent
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Facetwise analysis of XCS for problems with class imbalances
IEEE Transactions on Evolutionary Computation
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
International Journal of Applied Mathematics and Computer Science - Selected Problems of Computer Science and Control
ACM Transactions on Embedded Computing Systems (TECS)
Adaptive dynamic programming: an introduction
IEEE Computational Intelligence Magazine
A survey of collaborative filtering techniques
Advances in Artificial Intelligence
Intercluster connection in cognitive wireless mesh networks based on intelligent network coding
EURASIP Journal on Advances in Signal Processing - Special issue on dynamic spectrum access for wireless networking
Multi-agent Q-learning of channel selection in multi-user cognitive radio systems: a two by two case
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Generation of roles in reinforcement learning considering redistribution of reward between agents
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A novel hybrid learning technique applied to a self-learning multi-robot system
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Dimensionality effects on the Markov property in shape memory alloy hysteretic environment
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Learning intialized by topologically correct representation
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A probabilistic fuzzy logic system: learning in the stochastic environment with incomplete dynamics
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Real-valued Q-learning in multi-agent cooperation
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Behavioral-fusion control based on reinforcement learning
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Implementation of fuzzy Q-learning based on modular fuzzy model and parallel structured learning
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Prerequesites for symbiotic brain-machine interfaces
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Planning-based prediction for pedestrians
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Consideration on robotic giant-swing motion generated by reinforcement learning
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Decision-theoretic robot guidance for active cooperative perception
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Robot task switching under diminishing returns
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Using eigenposes for lossless periodic human motion imitation
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
A learning approach to integration of layers of a hybrid control architecture
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Cooperative multi-robot reinforcement learning: a framework in hybrid state space
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
From manipulation to communicative gesture
Proceedings of the 5th ACM/IEEE international conference on Human-robot interaction
Solving multiconstraint assignment problems using learning automata
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A new mobile robot navigation method using fuzzy logic and a modified Q-learning algorithm
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Rejoinder---The Languages of Stochastic Optimization
INFORMS Journal on Computing
Probability in the Engineering and Informational Sciences
The Knowledge Engineering Review
A framework for the design of a military operational supply network
CISDA'09 Proceedings of the Second IEEE international conference on Computational intelligence for security and defense applications
Online reinforcement learning for dynamic multimedia systems
IEEE Transactions on Image Processing
Reference traces by simulation for tracking control-logic
ETFA'09 Proceedings of the 14th IEEE international conference on Emerging technologies & factory automation
Information Technology and Management
Online adaptive policies for ensemble classifiers
Neurocomputing
Information Systems Research
Induction over Strategic Agents
Information Systems Research
CCMAC: Coordinated cooperative MAC for wireless LANs
Computer Networks: The International Journal of Computer and Telecommunications Networking
A model of portfolio optimization using time adapting genetic network programming
Computers and Operations Research
Managing Adaptive Versatile environments
Pervasive and Mobile Computing
Fuzzy decision tree function approximation in reinforcement learning
International Journal of Artificial Intelligence and Soft Computing
A Least-squares Approach to Direct Importance Estimation
The Journal of Machine Learning Research
Transfer Learning for Reinforcement Learning Domains: A Survey
The Journal of Machine Learning Research
Provably Efficient Learning with Typed Parametric Models
The Journal of Machine Learning Research
RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments
The Journal of Machine Learning Research
Reinforcement Learning in Finite MDPs: PAC Analysis
The Journal of Machine Learning Research
A Convergent Online Single Time Scale Actor Critic Algorithm
The Journal of Machine Learning Research
Bounding the population size in XCS to ensure reproductive opportunities
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Designing efficient exploration with MACS: modules and function approximation
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Simulating sellers' behavior in a reverse auction B2B exchange
ICCS'03 Proceedings of the 2003 international conference on Computational science
Reinforcement learning as a means of dynamic aggregate QoS provisioning
Art-QoS'03 Proceedings of the 2003 international conference on Architectures for quality of service in the internet
Learning and evolution affected by spatial structure
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
An adaptive inventory control model for a supply chain with nonstationary customer demands
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
A learning autonomous driver system on the basis of image classification and evolutional learning
MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition
Model-based least-squares policy evaluation
AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Knowledge discovery and emergent complexity in bioinformatics
KDECB'06 Proceedings of the 1st international conference on Knowledge discovery and emergent complexity in bioinformatics
Integration of genetic programming and reinforcement learning for real robots
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartI
Analyzing parameter sensitivity and classifier representations for real-valued XCS
IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Counter example for Q-bucket-brigade under prediction problem
IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
An experimental comparison between ATNoSFERES and ACS
IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Adaptive value function approximations in classifier systems
IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Three architectures for continuous action
IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
Parallelizing parallel rollout algorithm for solving Markov decision processes
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
A functional spiking neuron hardware oriented model
IWANN'03 Proceedings of the Artificial and natural neural networks 7th international conference on Computational methods in neural modeling - Volume 1
Reinforcement learning for online control of evolutionary algorithms
ESOA'06 Proceedings of the 4th international conference on Engineering self-organising systems
Defending DDoS attacks using hidden Markov models and cooperative reinforcement learning
PAISI'07 Proceedings of the 2007 Pacific Asia conference on Intelligence and security informatics
Unified criterion of state generalization for reactive autonomous agents
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Generating hierarchical structure in reinforcement learning from state variables
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Automatic development of robot behaviour using Monte Carlo methods
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
EvoWorkshops'03 Proceedings of the 2003 international conference on Applications of evolutionary computing
On a dynamical analysis of reinforcement learning in games: emergence of Occam's Razor
CEEMAS'03 Proceedings of the 3rd Central and Eastern European conference on Multi-agent systems
Evolving reinforcement learning-like abilities for robots
ICES'03 Proceedings of the 5th international conference on Evolvable systems: from biology to hardware
Using genetic programming to generate protocol adaptors for interprocess communication
ICES'03 Proceedings of the 5th international conference on Evolvable systems: from biology to hardware
ICANN/ICONIP'03 Proceedings of the 2003 joint international conference on Artificial neural networks and neural information processing
The adaptive web
Heuristic search based exploration in reinforcement learning
IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Learning autonomous behaviours for non-holonomic vehicles
IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
A novel burst assembly algorithm for optical burst switched networks based on learning automata
ONDM'07 Proceedings of the 11th international IFIP TC6 conference on Optical network design and modeling
Decentralized information aggregation and central control in networked production environments
HCI'07 Proceedings of the 12th international conference on Human-computer interaction: applications and services
Computing and using lower and upper bounds for action elimination in MDP planning
SARA'07 Proceedings of the 7th International conference on Abstraction, reformulation, and approximation
Model-based exploration in continuous state spaces
SARA'07 Proceedings of the 7th International conference on Abstraction, reformulation, and approximation
Active learning of dynamic Bayesian networks in Markov decision processes
SARA'07 Proceedings of the 7th International conference on Abstraction, reformulation, and approximation
Field-based coordination of mobile intelligent agents: an evolutionary game theoretic analysis
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part I
Reinforcement learning of competitive skills with soccer agents
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part I
Optimal convergence in multi-agent MDPs
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Reinforcement learning scheme for grouping and anti-predator behavior
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Learning evaluation functions of Shogi positions from different sets of games
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Grounding action-selection in event-based anticipation
ECAL'07 Proceedings of the 9th European conference on Advances in artificial life
Evolution and learning in an intrinsically motivated reinforcement learning robot
ECAL'07 Proceedings of the 9th European conference on Advances in artificial life
Efficient learning of neural networks with evolutionary algorithms
Proceedings of the 29th DAGM conference on Pattern recognition
IEEE Transactions on Signal Processing
Virtual markets: Q-learning sellers with simple state representation
AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
On-line agent teamwork training using immunological network model
AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
Reinforcement learning-based load shared sequential routing
NETWORKING'07 Proceedings of the 6th international IFIP-TC6 conference on Ad Hoc and sensor networks, wireless networks, next generation internet
Online learning of task-driven object-based visual attention control
Image and Vision Computing
Plan-based control of robotic agents: improving the capabilities of autonomous robots
Plan-based control of robotic agents: improving the capabilities of autonomous robots
Posterior weighted reinforcement learning with state uncertainty
Neural Computation
Convergence analysis on approximate reinforcement learning
KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Learning models of relational MDPs using graph kernels
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Reinforcement learning for cooperative actions in a partially observable multi-agent system
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Stochastic weights reinforcement learning for exploratory data analysis
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Cooperation between multiple agents based on partially sharing policy
ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Efficient selectivity and backup operators in Monte-Carlo tree search
CG'06 Proceedings of the 5th international conference on Computers and games
Feature construction for reinforcement learning in hearts
CG'06 Proceedings of the 5th international conference on Computers and games
Skill combination for reinforcement learning
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Clustering with reinforcement learning
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Independent factor reinforcement learning for portfolio management
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
A novel ANN model based on quantum computational MAS theory
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
A novel neural network based reinforcement learning
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
An agent reinforcement learning model based on neural networks
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Reinforcement learning algorithms based on mGA and EA with policy iterations
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Toward perception based computing: a rough-granular perspective
WImBI'06 Proceedings of the 1st WICI international conference on Web intelligence meets brain informatics
Optimizing walking controllers for uncertain inputs and environments
ACM SIGGRAPH 2010 papers
Reducing trials by thinning-out in skill discovery
DS'07 Proceedings of the 10th international conference on Discovery science
Safe state abstraction and reusable continuing subtasks in hierarchical reinforcement learning
AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Validation of a reinforcement learning policy for dosage optimization of erythropoietin
AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Generalization and transfer learning in noise-affected robot navigation tasks
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Heuristic Q-learning soccer players: a new reinforcement learning approach to RoboCup simulation
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Intelligent farmer agent for multi-agent ecological simulations optimization
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Learning to use a perishable good as money
MABS'06 Proceedings of the 2006 international conference on Multi-agent-based simulation VII
MABS'06 Proceedings of the 2006 international conference on Multi-agent-based simulation VII
A k-NN based perception scheme for reinforcement learning
EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
A practical learning-based approach for dynamic storage bandwidth allocation
IWQoS'03 Proceedings of the 11th international conference on Quality of service
Semi-supervised speaker identification under covariate shift
Signal Processing
The MACS project: an approach to affordance-inspired robot control
Proceedings of the 2006 international conference on Towards affordance-based robot control
Temporal difference learning and simulated annealing for optimal control: a case study
KES-AMSTA'08 Proceedings of the 2nd KES International conference on Agent and multi-agent systems: technologies and applications
ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications
WCCI'08 Proceedings of the 2008 IEEE world conference on Computational intelligence: research frontiers
Feature discovery in reinforcement learning using genetic programming
EuroGP'08 Proceedings of the 11th European conference on Genetic programming
Opportunistic transmission for wireless sensor networks under delay constraints
ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III
Learning relational options for inductive transfer in relational reinforcement learning
ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Relational macros for transfer in reinforcement learning
ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Building relational world models for reinforcement learning
ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Probabilistic inductive logic programming
The evolution of cognition: from first order to second order embodiment
ZiF'06 Proceedings of the Embodied communication in humans and machines, 2nd ZiF research group international conference on Modeling communication with robots and virtual humans
Seeing the forest despite the trees: large scale spatial-temporal decision making
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Regret-based reward elicitation for Markov decision processes
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Temporal-difference networks for dynamical systems with continuous observations and actions
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Exploring compact reinforcement-learning representations with linear regression
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
A state-cluster based Q-learning
ICNC'09 Proceedings of the 5th international conference on Natural computation
Urban traffic signal learning control using fuzzy actor-critic methods
ICNC'09 Proceedings of the 5th international conference on Natural computation
Using cognition and learning to improve agents' reactions
Adaptive agents and multi-agent systems
Relational reinforcement learning for agents in worlds with objects
Adaptive agents and multi-agent systems
Character animation in two-player adversarial games
ACM Transactions on Graphics (TOG)
2006: celebrating 75 years of AI - history and outlook: the next 25 years
50 years of artificial intelligence
50 years of artificial intelligence
Intrinsically motivated machines
50 years of artificial intelligence
Reward-modulated hebbian learning of decision making
Neural Computation
Planning to see: A hierarchical approach to planning visual actions on a robot using POMDPs
Artificial Intelligence
Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks
Information Sciences: an International Journal
Smartlocks: lock acquisition scheduling for self-aware synchronization
Proceedings of the 7th international conference on Autonomic computing
Applying reinforcement learning to scheduling strategies in an actual grid environment
International Journal of High Performance Systems Architecture
Reinforcement learning for training a computer program of Chinese chess
International Journal of Intelligent Information and Database Systems
Planning of diverse complex cooperative robot actions using multi-stage genetic algorithm
CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
Use of the knowledge which is independence on reward in reinforcement learning
CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
Opportunistic exploitation of bandwidth resources through reinforcement learning
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
Cooperative communications with relay selection for QoS provisioning in wireless sensor networks
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
On balancing exploration vs. exploitation in a cognitive engine for multi-antenna systems
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
A common-neural-pattern based reasoning for mobile robot cognitive mapping
ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Brain-inspired emergence of behaviors based on the desire for existence by reinforcement learning
ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Improving optimistic exploration in model-free reinforcement learning
ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
A cat-like robot real-time learning to run
ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Bounds for multistage stochastic programs using supervised learning strategies
SAGA'09 Proceedings of the 5th international conference on Stochastic algorithms: foundations and applications
KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Emulation and behavior understanding through shared values
Robotics and Autonomous Systems
Q-learning for opportunistic spectrum access
Proceedings of the 6th International Wireless Communications and Mobile Computing Conference
Joint path and wavelength selection using Q-learning in optical burst switching networks
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
A multi-agent reinforcement learning approach to path selection in optical burst switching networks
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Needle target-insertion trajectory planning based on reforcement learning expert's skill
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
IEEE Transactions on Evolutionary Computation
Impedance learning for robotic contact tasks using natural actor-critic algorithm
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Monotonicity of constrained optimal transmission policies in correlated fading channels with ARQ
IEEE Transactions on Signal Processing
On-line learning and optimization for wireless video transmission
IEEE Transactions on Signal Processing
A systematic framework for dynamically optimizing multi-user wireless video transmission
IEEE Journal on Selected Areas in Communications
MLeXAI: A Project-Based Application-Oriented Model
ACM Transactions on Computing Education (TOCE)
Autonomous Agents and Multi-Agent Systems
What the 2007 TAC Market Design Game tells us about effective auction mechanisms
Autonomous Agents and Multi-Agent Systems
Efficient vision-based navigation
Autonomous Robots
Finding and transferring policies using stored behaviors
Autonomous Robots
Non-parametric Learning to Aid Path Planning over Slopes
International Journal of Robotics Research
Evolving agent behavior in multiobjective domains using fitness-based shaping
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Neural mechanisms of the mind, Aristotle, Zadeh, and fMRI
IEEE Transactions on Neural Networks
A MDP approach to fault-tolerant routing
WD'09 Proceedings of the 2nd IFIP conference on Wireless days
Exploitation and exploration in a performance based contextual advertising system
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Optimizing debt collections using constrained reinforcement learning
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Error Bounds for Approximations from Projected Linear Equations
Mathematics of Operations Research
Spectrum management of cognitive radio using multi-agent reinforcement learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Industry track
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
High-level reinforcement learning in strategy games
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
To teach or not to teach?: decision making under uncertainty in ad hoc teams
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Frequency adjusted multi-agent Q-learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Using spatial hints to improve policy reuse in a reinforcement learning agent
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
PAC-MDP learning with knowledge-based admissible models
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Using graph analysis to study networks of adaptive agent
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Optimal policy switching algorithms for reinforcement learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Learning multi-agent state space representations
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Strategy generation in multi-agent imperfect-information pursuit games
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Cultivating desired behaviour: policy teaching via environment-dynamics tweaks
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
A reward function generation method using genetic algorithms: a robot soccer case study
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Online model learning in adversarial Markov decision processes
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Action discovery for reinforcement learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Model-based direct policy search
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Training a Tetris agent via interactive shaping: a demonstration of the TAMER framework
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Adaptive Auction Mechanism Design and the Incorporation of Prior Knowledge
INFORMS Journal on Computing
Action selection and task sequence learning for hybrid dynamical cognitive agents
Robotics and Autonomous Systems
Combining active learning and reactive control for robot grasping
Robotics and Autonomous Systems
Adaptive data-aware utility-based scheduling in resource-constrained systems
Journal of Parallel and Distributed Computing
Proceedings of the 3rd International Conference on PErvasive Technologies Related to Assistive Environments
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
A step toward an adaptive composition of query suggestion approaches
Proceedings of the third symposium on Information interaction in context
MRL-CC: a novel cooperative communication protocol for QoS provisioning in wireless sensor networks
International Journal of Sensor Networks
Learning and Reversal Learning in the Subcortical Limbic System: A Computational Model
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Self-Organizing Sensorimotor Maps Plus Internal Motivations Yield Animal-Like Behavior
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Coordinated learning in multiagent MDPs with infinite state-space
Autonomous Agents and Multi-Agent Systems
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Decision-theoretic design space exploration of multiprocessor platforms
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
A survey of Tactile Human-Robot Interactions
Robotics and Autonomous Systems
Computer Networks: The International Journal of Computer and Telecommunications Networking
Reinforcement learning intellectual agent of protection for adapting to surrounding environment
Proceedings of the 3rd international conference on Security of information and networks
An adaptive link layer for heterogeneous multi-radio mobile sensor networks
IEEE Journal on Selected Areas in Communications - Special issue on simple wireless sensor networking solutions
Model-free control based on reinforcement learning for a wastewater treatment problem
Applied Soft Computing
Reinforcement learning of competitive and cooperative skills in soccer agents
Applied Soft Computing
Learning to adapt to unknown users: referring expression generation in spoken dialogue systems
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Importance-Driven Turn-Bidding for spoken dialogue systems
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Learning to follow navigational directions
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Optimising information presentation for spoken dialogue systems
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Reading between the lines: learning to map high-level instructions to commands
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Near-optimal Regret Bounds for Reinforcement Learning
The Journal of Machine Learning Research
Evolving Static Representations for Task Transfer
The Journal of Machine Learning Research
EA2: The Winning Strategy for the Inaugural Lemonade Stand Game Tournament
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Uncertainty Propagation for Efficient Exploration in Reinforcement Learning
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
The Dynamics of Multi-Agent Reinforcement Learning
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
A NEAT Way for Evolving Echo State Networks
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
ANTIPA: an agent architecture for intelligent information assistance
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
A reinforcement learning with switching controllers for a continuous action space
Artificial Life and Robotics
Artificial Life and Robotics
IEEE Transactions on Neural Networks
Adapting and evaluating distributed real-time and embedded systems in dynamic environments
Proceedings of the First International Workshop on Data Dissemination for Large Scale Complex Critical Infrastructures
Optimizing a new nonlinear reinforcement scheme with Breeder genetic algorithm
NN'10/EC'10/FS'10 Proceedings of the 11th WSEAS international conference on nural networks and 11th WSEAS international conference on evolutionary computing and 11th WSEAS international conference on Fuzzy systems
Motion fields for interactive character locomotion
ACM SIGGRAPH Asia 2010 papers
Proceedings of the 3rd ACM workshop on Artificial intelligence and security
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Cognitive engine design for link adaptation: an application to multi-antenna systems
IEEE Transactions on Wireless Communications
Using reinforcement learning to create communication channel management strategies for diverse users
SLPAT '10 Proceedings of the NAACL HLT 2010 Workshop on Speech and Language Processing for Assistive Technologies
Learning in closed-loop brain-machine interfaces: modeling and experimental validation
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
The neuronal replicator hypothesis
Neural Computation
Functional Optimization Through Semilocal Approximate Minimization
Operations Research
Measuring universal intelligence: Towards an anytime intelligence test
Artificial Intelligence
Photonic Network Communications
A Human-Robot Collaborative Reinforcement Learning Algorithm
Journal of Intelligent and Robotic Systems
Reinforcement learning using Voronoi space division
Artificial Life and Robotics
A study of Q-learning considering negative rewards
Artificial Life and Robotics
Hierarchical reinforcement learning for adaptive text generation
INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Towards a programmable instrumented generator
INLG '10 Proceedings of the 6th International Natural Language Generation Conference
End-to-end stochastic scheduling of scalable video overtime-varying channels
Proceedings of the international conference on Multimedia
Rule acquisition for cognitive agents by using estimation of distribution algorithms
International Journal of Knowledge Engineering and Soft Data Paradigms
Leveling-up in heroes of might and magic III
FUN'10 Proceedings of the 5th international conference on Fun with algorithms
An autonomic testing framework for IPv6 configuration protocols
AIMS'10 Proceedings of the Mechanisms for autonomous management of networks and services, and 4th international conference on Autonomous infrastructure, management and security
An algorithmic game theory study of wholesale electricity markets based on central auction
Integrated Computer-Aided Engineering - Multi-Agent Systems for Energy Management
Agent-based coordination techniques for matching supply and demand in energy networks
Integrated Computer-Aided Engineering - Multi-Agent Systems for Energy Management
Unsupervised learning of background modeling parameters in multicamera systems
Computer Vision and Image Understanding
Learning adaptive referring expression generation policies for spoken dialogue systems
Empirical methods in natural language generation
Natural language generation as planning under uncertainty for spoken dialogue systems
Empirical methods in natural language generation
Time-based reward shaping in real-time strategy games
ADMI'10 Proceedings of the 6th international conference on Agents and data mining interaction
Multi-policy optimization in self-organizing systems
SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Adaptive ε-greedy exploration in reinforcement learning based on value differences
KI'10 Proceedings of the 33rd annual German conference on Advances in artificial intelligence
Bayesian reasoning for software testing
Proceedings of the FSE/SDP workshop on Future of software engineering research
Learning to coordinate in complex networks
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
EA'09 Proceedings of the 9th international conference on Artificial evolution
AdQL - anomaly detection Q-learning in control multi-queue systems with QoS constraints
KES-AMSTA'10 Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II
Three-subagent adapting architecture for fighting videogames
PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
The penalty avoiding rational policy making algorithm in continuous action spaces
IDEAL'10 Proceedings of the 11th international conference on Intelligent data engineering and automated learning
Tug-of-war model for multi-armed bandit problem
UC'10 Proceedings of the 9th international conference on Unconventional computation
From mirror writing to mirror neurons
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Minimal model of strategy switching in the plus-maze navigation task
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Noisy-or nodes for conditioning models
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
TeXDYNA: hierarchical reinforcement learning in factored MDPs
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
A novel information measure for predictive learning in a social system setting
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Anticipation as a strategy: a design paradigm for robotics
KSEM'10 Proceedings of the 4th international conference on Knowledge science, engineering and management
Improving reinforcement learning agents using genetic algorithms
AMT'10 Proceedings of the 6th international conference on Active media technology
A model of basal ganglia in saccade generation
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part I
Generating adaptive route instructions using hierarchical reinforcement learning
SC'10 Proceedings of the 7th international conference on Spatial cognition
Evolving a single scalable controller for an octopus arm with a variable number of segments
PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part II
Social conformity and its convergence for reinforcement learning
MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Evaluation of techniques for a learning-driven modeling methodology in multiagent simulation
MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Smarter sampling in model-based Bayesian reinforcement learning
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Adaptive bases for reinforcement learning
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Evolutionary dynamics of regret minimization
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Hidden Markov model for human decision process in a partially observable environment
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
An incremental probabilistic neural network for regression and reinforcement learning tasks
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Exploring continuous action spaces with diffusion trees for reinforcement learning
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
One-shot supervised reinforcement learning for multi-targeted tasks: RL-SAS
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
An oscillatory neural network model for birdsong learning and generation
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Incorporating domain models into Bayesian optimization for RL
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
On the potential of process simulation in software project schedule optimization
COMPSAC-W'05 Proceedings of the 29th annual international conference on Computer software and applications conference
Simultaneous learning of perception and action in mobile robots
Robotics and Autonomous Systems
Distributed Reinforcement Learning for Coordinate Multi-Robot Foraging
Journal of Intelligent and Robotic Systems
Reducing reinforcement learning to KWIK online regression
Annals of Mathematics and Artificial Intelligence
Resource-driven mission-phasing techniques for constrained agents in stochastic environments
Journal of Artificial Intelligence Research
A minimum relative entropy principle for learning and acting
Journal of Artificial Intelligence Research
Automatic induction of bellman-error features for probabilistic planning
Journal of Artificial Intelligence Research
Pagerank optimization in polynomial time by stochastic shortest path reformulation
ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Prediction with expert advice under discounted loss
ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Optimality issues of universal greedy agents with static priors
ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Consistency of feature Markov processes
ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Developing strategies for the ART domain
CAEPIA'09 Proceedings of the Current topics in artificial intelligence, and 13th conference on Spanish association for artificial intelligence
Transfer learning via relational templates
ILP'09 Proceedings of the 19th international conference on Inductive logic programming
Policy transfer via Markov logic networks
ILP'09 Proceedings of the 19th international conference on Inductive logic programming
Algorithm selection as a bandit problem with unbounded losses
LION'10 Proceedings of the 4th international conference on Learning and intelligent optimization
Coaching to enhance the online behavior learning of a robotic agent
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part I
ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
Emotion and reinforcement: affective facial expressions facilitate robot learning
ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
An unsupervised, online learning framework for moving object detection
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Reinforcement learning based resource allocation in business process management
Data & Knowledge Engineering
Self-learning fuzzy logic controllers for pursuit-evasion differential games
Robotics and Autonomous Systems
Generalized learning automata for multi-agent reinforcement learning
AI Communications - European Workshop on Multi-Agent Systems (EUMAS) 2009
Continuous-state reinforcement learning with fuzzy approximation
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Parallel reinforcement learning with linear function approximation
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Bifurcation analysis of reinforcement learning agents in the Selten's horse game
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Solving multi-stage games with hierarchical learning automata that bootstrap
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Multi-agent reinforcement learning for intrusion detection
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
A new feature for approximate dynamic programming traffic light controller
Proceedings of the Second International Workshop on Computational Transportation Science
Generating three binary addition algorithms using reinforcement programming
Proceedings of the 48th Annual Southeast Regional Conference
Adaptive case-based reasoning using retention and forgetting strategies
Knowledge-Based Systems
Web-based multi-agent system architecture in a dynamic environment
International Journal of Knowledge-based and Intelligent Engineering Systems
User and noise adaptive dialogue management using hybrid system actions
IWSDS'10 Proceedings of the Second international conference on Spoken dialogue systems for ambient environments
Autonomous discovery of subgoals using acyclic state trajectories
ICICA'10 Proceedings of the First international conference on Information computing and applications
Teaching a robot to perform tasks with voice commands
MICAI'10 Proceedings of the 9th Mexican international conference on Advances in artificial intelligence: Part I
On-line adaptive algorithms in autonomic restart control
ATC'10 Proceedings of the 7th international conference on Autonomic and trusted computing
Agent-augmented co-space: toward merging of real world and cyberspace
ATC'10 Proceedings of the 7th international conference on Autonomic and trusted computing
Multiagent Q-learning for aloha-like spectrum access in cognitive radio systems
EURASIP Journal on Wireless Communications and Networking
Adaptation-based programming in java
Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
Improving space representation in multiagent learning via tile coding
SBIA'10 Proceedings of the 20th Brazilian conference on Advances in artificial intelligence
Structural knowledge transfer by spatial abstraction for reinforcement learning agents
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
Proceedings of the fourth ACM international conference on Web search and data mining
Human-inspired computational fairness
Autonomous Agents and Multi-Agent Systems
Learning the behavior model of a robot
Autonomous Robots
Expert-driven genetic algorithms for simulating evaluation functions
Genetic Programming and Evolvable Machines
Stochastic control via direct comparison
Discrete Event Dynamic Systems
Evaluating Q-learning policies for multi-objective foraging task in a multi-agent environment
ICIRA'10 Proceedings of the Third international conference on Intelligent robotics and applications - Volume Part II
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
An information-spectrum approach to analysis of return maximization in reinforcement learning
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Adaptive decision making in ant colony system by reinforcement learning
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Multivariate decision tree function approximation for reinforcement learning
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Creating long gait animation sequences through Reinforcement Learning
Proceedings of the 2011 conference on Neural Nets WIRN10: Proceedings of the 20th Italian Workshop on Neural Nets
Improved AP association management using machine learning
ACM SIGMOBILE Mobile Computing and Communications Review
Computer Networks: The International Journal of Computer and Telecommunications Networking
Modeling basal ganglia for understanding parkinsonian reaching movements
Neural Computation
A reinforcement learning framework for answering complex questions
Proceedings of the 16th international conference on Intelligent user interfaces
Learning dialogue strategies from older and younger simulated users
SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Adaptive referring expression generation in spoken dialogue systems: evaluation with real users
SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Gaussian processes for fast policy optimisation of POMDP-based dialogue managers
SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Two coupled neural-networks-based solution of the Hamilton-Jacobi-Bellman equation
Applied Soft Computing
A novel multi-agent reinforcement learning approach for job scheduling in Grid computing
Future Generation Computer Systems
Particle swarm optimization in exploratory data analysis
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
Learning visual representations for perception-action systems
International Journal of Robotics Research
Solving non-stationary bandit problems by random sampling from sibling Kalman filters
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III
Temporal vagueness, coordination and communication
ViC'09 Proceedings of the 2009 international conference on Vagueness in communication
Planning with noisy probabilistic relational rules
Journal of Artificial Intelligence Research
The inverse classification problem
Journal of Computer Science and Technology
Swarm reinforcement learning method based on an actor-critic method
SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
SIMPAR'10 Proceedings of the Second international conference on Simulation, modeling, and programming for autonomous robots
Reduct based Q-learning: an introduction
Proceedings of the 2011 International Conference on Communication, Computing & Security
Studying the emergence of money by means of swarm multi-agent simulation
IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Dynamic reward shaping: training a robot by voice
IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
State representation with perceptual constancy based on active motion
ICSR'10 Proceedings of the Second international conference on Social robotics
Selection of actions for an autonomous social robot
ICSR'10 Proceedings of the Second international conference on Social robotics
A Markovian process modeling for Pickomino
CG'10 Proceedings of the 7th international conference on Computers and games
Enhancements for multi-player Monte-Carlo tree search
CG'10 Proceedings of the 7th international conference on Computers and games
Teacher feedback to scaffold and refine demonstrated motion primitives on a mobile robot
Robotics and Autonomous Systems
Empowerment for continuous agent-environment systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Combining Constraint Programming and Local Search for Job-Shop Scheduling
INFORMS Journal on Computing
A CMOS current-mode dynamic programming circuit
IEEE Transactions on Circuits and Systems Part I: Regular Papers - Special section on 2009 IEEE system-on-chip conference
A Generalized Path Integral Control Approach to Reinforcement Learning
The Journal of Machine Learning Research
Hessian matrix distribution for Bayesian policy gradient reinforcement learning
Information Sciences: an International Journal
Wireless Personal Communications: An International Journal
Wireless Personal Communications: An International Journal
A bionic model of adaptive searching behavior
Journal of Computer and Systems Sciences International
Representing trust in cognitive social simulations
SBP'11 Proceedings of the 4th international conference on Social computing, behavioral-cultural modeling and prediction
Introduction to special issue on machine learning for adaptivity in spoken dialogue systems
ACM Transactions on Speech and Language Processing (TSLP)
Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager
ACM Transactions on Speech and Language Processing (TSLP)
Spatially-aware dialogue control using hierarchical reinforcement learning
ACM Transactions on Speech and Language Processing (TSLP)
ACM Transactions on Speech and Language Processing (TSLP)
Comparing user simulations for dialogue strategy learning
ACM Transactions on Speech and Language Processing (TSLP)
Modeling spoken decision support dialogue and optimization of its dialogue strategy
ACM Transactions on Speech and Language Processing (TSLP)
Self-organizing state aggregation for architecture design of Q-learning
Information Sciences: an International Journal
A Modified Memory-Based Reinforcement Learning Method for Solving POMDP Problems
Neural Processing Letters
Nonverbal acoustic communication in human-computer interaction
Artificial Intelligence Review
User Modeling and User-Adapted Interaction
Efficient program generation by evolving graph structures with multi-start nodes
Applied Soft Computing
Learning and using domain-specific heuristics in ASP solvers
AI Communications - Answer Set Programming
Darwinian embodied evolution of the learning ability for survival
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
The world of independent learners is not markovian
International Journal of Knowledge-based and Intelligent Engineering Systems
Behavioural analysis in network formation using agent-based simulation systems
International Journal of Knowledge Engineering and Soft Data Paradigms
Sampled fictitious play for approximate dynamic programming
Computers and Operations Research
Noisy reinforcements in reinforcement learning: some case studies based on gridworlds
ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
TELE-INFO'06 Proceedings of the 5th WSEAS international conference on Telecommunications and informatics
Adaptive navigation for autonomous robots
Robotics and Autonomous Systems
Reinforcement learning for joint radio resource management in LTE-UMTS scenarios
Computer Networks: The International Journal of Computer and Telecommunications Networking
ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
ICOSSSE'05 Proceedings of the 4th WSEAS/IASME international conference on System science and simulation in engineering
LearnPNP: a tool for learning agent behaviors
RoboCup 2010
A nonlinear reinforcement scheme for stochastic learning automata
MMACTEE'06 Proceedings of the 8th WSEAS international conference on Mathematical methods and computational techniques in electrical engineering
CIMMACS'05 Proceedings of the 4th WSEAS international conference on Computational intelligence, man-machine systems and cybernetics
Knowledge of opposite actions for reinforcement learning
Applied Soft Computing
An educational tool for artificial neural networks
Computers and Electrical Engineering
AutoBlackTest: a tool for automatic black-box testing
Proceedings of the 33rd International Conference on Software Engineering
Short term memories and forcing the re-use of knowledge for generalization
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Stochastic processes for return maximization in reinforcement learning
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
An off-policy natural policy gradient method for a partial observable Markov decision process
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Back-propagation as reinforcement in prediction tasks
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Evolving optimal feature set by interactive reinforcement learning for image retrieval
ISNN'05 Proceedings of the Second international conference on Advances in neural networks - Volume Part II
Generic reinforcement schemes and their optimization
ECC'11 Proceedings of the 5th European conference on European computing conference
Self-adaptive provisioning of virtualized resources in cloud computing
Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Decentralized MDPs with sparse interactions
Artificial Intelligence
Reinforcement learning for model building and variance-penalized control
Winter Simulation Conference
Using genetic algorithms to limit the optimism in time warp
Winter Simulation Conference
Winter Simulation Conference
Balancing exploration and exploitation in learning to rank online
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
ICANNGA'11 Proceedings of the 10th international conference on Adaptive and natural computing algorithms - Volume Part II
Fault oblivious high performance computing with dynamic task replication and substitution
Computer Science - Research and Development
Smart data structures: an online machine learning approach to multicore data structures
Proceedings of the 8th ACM international conference on Autonomic computing
Decision making in autonomic computing systems: comparison of approaches and techniques
Proceedings of the 8th ACM international conference on Autonomic computing
Using reinforcement learning for controlling an elastic web application hosting platform
Proceedings of the 8th ACM international conference on Autonomic computing
Proceedings of the 2011 workshop on Organic computing
A framework of intentional characters for simulation of social behavior
Proceedings of the 2010 Summer Computer Simulation Conference
Automatic abstraction and fault tolerance in cortical microachitectures
Proceedings of the 38th annual international symposium on Computer architecture
FQL-RED: an adaptive scalable schema for active queue management
International Journal of Network Management
Use of infeasible individuals in probabilistic model building genetic network programming
Proceedings of the 13th annual conference on Genetic and evolutionary computation
On the relationships between synaptic plasticity and generative systems
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Policy learning in resource-constrained optimization
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Evolution of reward functions for reinforcement learning
Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Evolution for modeling: a genetic programming framework for sesam
Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Learning to win by reading manuals in a Monte-Carlo framework
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Optimization of heuristic search using recursive algorithm selection and reinforcement learning
Annals of Mathematics and Artificial Intelligence
A dynamic programming strategy to balance exploration and exploitation in the bandit problem
Annals of Mathematics and Artificial Intelligence
Adaptive co-construction of state and action spaces in reinforcement learning
Artificial Life and Robotics
Self-adaptive provisioning of virtualized resources in cloud computing
ACM SIGMETRICS Performance Evaluation Review - Performance evaluation review
Planning with incomplete information
MoChArt'10 Proceedings of the 6th international conference on Model checking and artificial intelligence
Training neural networks to play backgammon variants using reinforcement learning
EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Learning chasing behaviours of non-player characters in games using SARSA
EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Adaptive kernel-width selection for kernel-based least-squares policy iteration algorithm
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Concurrent modular Q-learning with local rewards on linked multi-component robotic systems
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
Study of a multi-robot collaborative task through reinforcement learning
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
Reinforcement learning techniques for the control of wastewater treatment plants
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation: new challenges on bioinspired applications - Volume Part II
Journal of Computational Methods in Sciences and Engineering - Intelligent Systems and Knowledge Management (Part II)
Semi-automatic end-user tools for construction of virtual avatar behaviors
Proceedings of the 16th International Conference on 3D Web Technology
Selecting Simulation Algorithm Portfolios by Genetic Algorithms
PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
A Multi-State Q-Learning Approach for the Dynamic Load Balancing of Time Warp
PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Dynamic game difficulty balancing for backgammon
Proceedings of the 49th Annual Southeast Regional Conference
Non-deterministic policies in Markovian decision processes
Journal of Artificial Intelligence Research
A Monte-Carlo AIXI approximation
Journal of Artificial Intelligence Research
Learning in minority games with multiple resources
ECAL'09 Proceedings of the 10th European conference on Advances in artificial life: Darwin meets von Neumann - Volume Part II
A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes
The Journal of Machine Learning Research
The Journal of Machine Learning Research
Exploiting Best-Match Equations for Efficient Reinforcement Learning
The Journal of Machine Learning Research
On-line classification of data streams with missing values based on reinforcement learning
IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Using artificial intelligence techniques for strategy generation in the commons game
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I
Empirical study of Q-learning based elemental hose transport control
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
Towards concurrent Q-learning on linked multi-component robotic systems
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
HCII'11 Proceedings of the 1st international conference on Human interface and the management of information: interacting with information - Volume Part II
ECAL'09 Proceedings of the 10th European conference on Advances in artificial life: Darwin meets von Neumann - Volume Part I
KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Voting in multi-agent system for improvement of partial observations
KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
An agent-based approach to the dynamic price problem
KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Proceedings of the 48th Design Automation Conference
Dynamic thermal management for multimedia applications using machine learning
Proceedings of the 48th Design Automation Conference
Experimental evaluation of automatic hint generation for a logic tutor
AIED'11 Proceedings of the 15th international conference on Artificial intelligence in education
Learning culture-specific dialogue models from non culture-specific data
UAHCI'11 Proceedings of the 6th international conference on Universal access in human-computer interaction: users diversity - Volume Part II
Multiagent reactive plan application learning in dynamic environments
Proceedings of the 15th WSEAS international conference on Computers
A distributed reinforcement learning approach for solving optimization problems
CIT'11 Proceedings of the 5th WSEAS international conference on Communications and information technology
Theoretical considerations of potential-based reward shaping for multi-agent systems
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Evolving subjective utilities: Prisoner's Dilemma game examples
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Information systems in modeling interactive computations on granules
Theoretical Computer Science
Empirical evaluation of ad hoc teamwork in the pursuit domain
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Integrating reinforcement learning with human demonstrations of varying ability
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Metric learning for reinforcement learning agents
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Argumentation-based reasoning in agents with varying degrees of trust
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Sequential constant size compressors for reinforcement learning
AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Comparing humans and AI agents
AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Compression and intelligence: social environments and communication
AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Reinforcement learning and the Bayesian control rule
AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
AGI and neuroscience: open sourcing the brain
AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
Investigation in transfer learning: better way to apply transfer learning between agents
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Preference-based policy learning
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Datum-wise classification: a sequential approach to sparsity
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Lagrange dual decomposition for finite horizon Markov decision processes
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Reinforcement learning through global stochastic search in N-MDPs
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Sparse Kernel-SARSA(λ) with an eligibility trace
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Solving delayed coordination problems in MAS
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Agent-based resource allocation in dynamically formed CubeSat constellations
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Agent sensing with stateful resources
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Journal of Intelligent and Robotic Systems
TAROS'11 Proceedings of the 12th Annual conference on Towards autonomous robotic systems
Real-world reinforcement learning for autonomous humanoid robot charging in a home environment
TAROS'11 Proceedings of the 12th Annual conference on Towards autonomous robotic systems
Heliza: talking dirty to the attackers
Journal in Computer Virology
On the Curse of Dimensionality in Supervised Learning of Smooth Regression Functions
Neural Processing Letters
Personalized pricing recommender system: multi-stage epsilon-greedy approach
Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems
Economic learning for thermal-aware power budgeting in many-core architectures
CODES+ISSS '11 Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
A framework for Resource-Aware Data Accumulation in sparse wireless sensor networks
Computer Communications
Study of SOM-based intelligent multi-controller for real-time scheduling
Applied Soft Computing
Ensemble methods for reinforcement learning with function approximation
MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Policy gradient reinforcement learning with environmental dynamics and action-values in policies
KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part I
ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part I
Agent-based system with learning capabilities for transport problems
ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Evolving equilibrium policies for a multiagent reinforcement learning problem with state attractors
ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Modeling agents and agent systems
Transactions on computational collective intelligence V
International Journal of Computer Games Technology
The Effect of Robust Decisions on the Cost of Uncertainty in Military Airlift Operations
ACM Transactions on Modeling and Computer Simulation (TOMACS)
On-line regression algorithms for learning mechanical models of robots: A survey
Robotics and Autonomous Systems
Strategic points to minimize time cost for decision making under asynchronous time constraints
WISS'10 Proceedings of the 2010 international conference on Web information systems engineering
Reinforcement learning for context aware segmentation
MICCAI'11 Proceedings of the 14th international conference on Medical image computing and computer-assisted intervention - Volume Part III
Principled methods for biasing reinforcement learning agents
AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Balancing exploration and exploitation ratio in reinforcement learning
Proceedings of the 2011 Military Modeling & Simulation Symposium
Events, neural systems and time series
ServiceWave'10 Proceedings of the 2010 international conference on Towards a service-based internet
SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
Deviations of stochastic bandit regret
ALT'11 Proceedings of the 22nd international conference on Algorithmic learning theory
Universal knowledge-seeking agents
ALT'11 Proceedings of the 22nd international conference on Algorithmic learning theory
On the power of global reward signals in reinforcement learning
MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
Learning complex concepts using crowdsourcing: a Bayesian approach
ADT'11 Proceedings of the Second international conference on Algorithmic decision theory
Value-difference based exploration: adaptive control between epsilon-greedy and softmax
KI'11 Proceedings of the 34th Annual German conference on Advances in artificial intelligence
EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
Market-based dynamic task allocation using heuristically accelerated reinforcement learning
EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
Policy invariance under reward transformations for general-sum stochastic games
Journal of Artificial Intelligence Research
A Zeroth-Level Classifier System for Real Time Strategy Games
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Emotion-based intrinsic motivation for reinforcement learning agents
ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part I
A probabilistic method for inferring preferences from clicks
Proceedings of the 20th ACM international conference on Information and knowledge management
A self-adaptive routing paradigm for wireless mesh networks based on reinforcement learning
Proceedings of the 14th ACM international conference on Modeling, analysis and simulation of wireless and mobile systems
Self-teaching adaptive dynamic programming for Gomoku
Neurocomputing
Computers and Operations Research
On the complexity of policy iteration
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Approximate planning for factored POMDPs using belief state simplification
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Solving POMDPs by searching the space of finite policies
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning finite-state controllers for partially observable environments
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Qualitative MDPs and POMDPs: an order-of-magnitude approximation
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
PEGASUS: a policy search method for large MDPs and POMDPs
UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
Learning to cooperate via policy search
UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
The optimal reward baseline for gradient-based reinforcement learning
UAI'01 Proceedings of the Seventeenth conference on Uncertainty in artificial intelligence
Evaluating a reinforcement learning algorithm with a general intelligence test
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Journal of Cognitive Neuroscience
Vigor in the face of fluctuating rates of reward: An experimental examination
Journal of Cognitive Neuroscience
Convergence Rates of Efficient Global Optimization Algorithms
The Journal of Machine Learning Research
Robust Approximate Bilinear Programming for Value Function Approximation
The Journal of Machine Learning Research
The application of learning algorithms in the development of natural interaction
Procedings of the Second Conference on Creativity and Innovation in Design
Quantum reinforcement learning
ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part II
Learning in BDI multi-agent systems
CLIMA IV'04 Proceedings of the 4th international conference on Computational Logic in Multi-Agent Systems
The apriori stochastic dependency detection (ASDD) algorithm for learning stochastic logic rules
CLIMA IV'04 Proceedings of the 4th international conference on Computational Logic in Multi-Agent Systems
Behavior recognition and opponent modeling for adaptive table soccer playing
KI'05 Proceedings of the 28th annual German conference on Advances in Artificial Intelligence
Adaptive Scheduling on Power-Aware Managed Data-Centers Using Machine Learning
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Handling camera movement constraints in reinforcement learning based active object recognition
DAGM'06 Proceedings of the 28th conference on Pattern Recognition
Cooperative behavior of agents based on potential field
CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
A multi-agent fuzzy-reinforcement learning method for continuous domains
CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
An adaptive approach for the exploration-exploitation dilemma for learning agents
CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
General discounting versus average reward
ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Is there an elegant universal theory of prediction?
ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Asymptotic learnability of reinforcement problems with arbitrary dependence
ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Probabilistic generalization of simple grammars and its application to reinforcement learning
ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
BRA: An Algorithm for Simulating Bounded Rational Agents
Computational Economics
Intrinsically motivated intelligent sensed environments
EG-ICE'06 Proceedings of the 13th international conference on Intelligent Computing in Engineering and Architecture
A novel self-organizing neural fuzzy network for automatic generation of fuzzy inference systems
ISNN'05 Proceedings of the Second international conference on Advances in Neural Networks - Volume Part I
Applying neural network to reinforcement learning in continuous spaces
ISNN'05 Proceedings of the Second international conference on Advances in Neural Networks - Volume Part I
Task-Driven discretization of the joint space of visual percepts and continuous actions
ECML'06 Proceedings of the 17th European conference on Machine Learning
Patching approximate solutions in reinforcement learning
ECML'06 Proceedings of the 17th European conference on Machine Learning
Skill acquisition via transfer learning and advice taking
ECML'06 Proceedings of the 17th European conference on Machine Learning
Reinforcement learning for MDPs with constraints
ECML'06 Proceedings of the 17th European conference on Machine Learning
Efficient non-linear control through neuroevolution
ECML'06 Proceedings of the 17th European conference on Machine Learning
Scaling model-based average-reward reinforcement learning for product delivery
ECML'06 Proceedings of the 17th European conference on Machine Learning
Improvement of systems management policies using hybrid reinforcement learning
ECML'06 Proceedings of the 17th European conference on Machine Learning
A sparse kernel-based least-squares temporal difference algorithm for reinforcement learning
ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part I
Unique state and automatical action abstracting based on logical MDPs with negation
ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part II
Analyzing fault monitoring policy for hierarchical network with MMDP environment
FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Using meta-level control with reinforcement learning to improve the performance of the agents
FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Testing probabilistic equivalence through reinforcement learning
FSTTCS'06 Proceedings of the 26th international conference on Foundations of Software Technology and Theoretical Computer Science
Cognitive agents for sense and respond logistics
DAMAS'05 Proceedings of the 2005 international conference on Defence Applications of Multi-Agent Systems
Opponent learning for multi-agent system simulation
RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Adaptive learning in complex trade networks
SEAL'06 Proceedings of the 6th international conference on Simulated Evolution And Learning
Context adaptive self-configuration system
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part III
Performance bounds for mobile cellular networks with handover prediction
MMNS'05 Proceedings of the 8th international conference on Management of Multimedia Networks and Services
Learning to segment document images
PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
Ensemble pruning using reinforcement learning
SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
Monte Carlo matrix inversion policy evaluation
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Natural inspiration for artificial adaptivity: some neurocomputing experiences in robotics
UC'05 Proceedings of the 4th international conference on Unconventional Computation
A tutoring system for commercial games
ICEC'05 Proceedings of the 4th international conference on Entertainment Computing
An RLS-based natural actor-critic algorithm for locomotion of a two-linked robot arm
CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
An autonomous mobile robot based on quantum algorithm
CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Global versus local constructive function approximation for on-line reinforcement learning
AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Structural abstraction experiments in reinforcement learning
AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Adaptive utility-based scheduling in resource-constrained systems
AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
COLT'06 Proceedings of the 19th annual conference on Learning Theory
MABS'04 Proceedings of the 2004 international conference on Multi-Agent and Multi-Agent-Based Simulation
An architecture for multi-agent based self-adaptive system in mobile environment
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Effect of synthetic emotions on agents’ learning speed and their survivability
ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
Valency for adaptive homeostatic agents: relating evolution and learning
ECAL'05 Proceedings of the 8th European conference on Advances in Artificial Life
Fast reinforcement learning of dialogue policies using stable function approximation
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Investigation of evolving populations of adaptive agents
ICANN'05 Proceedings of the 15th international conference on Artificial Neural Networks: biological Inspirations - Volume Part I
An analytic research on secondary-spectrum trading mechanisms based on technical and market changes
Computer Networks: The International Journal of Computer and Telecommunications Networking
URL: A unified reinforcement learning approach for autonomic cloud management
Journal of Parallel and Distributed Computing
Robotic grasping and manipulation through human visuomotor learning
Robotics and Autonomous Systems
A hybrid learning strategy for discovery of policies of action
IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
An analysis of the different components of the anthocnet routing algorithm
ANTS'06 Proceedings of the 5th international conference on Ant Colony Optimization and Swarm Intelligence
Machine learning for spoken dialogue management: an experiment with speech-based database querying
AIMSA'06 Proceedings of the 12th international conference on Artificial Intelligence: methodology, Systems, and Applications
Mining paths of complex crowd scenes
ISVC'05 Proceedings of the First international conference on Advances in Visual Computing
AlchemistJ: a framework for self-adaptive software
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
An intelligent adaptation system based on a self-growing engine
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Aspects of optimal viewpoint selection and viewpoint fusion
ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part II
Grey reinforcement learning for incomplete information processing
TAMC'06 Proceedings of the Third international conference on Theory and Applications of Models of Computation
Self-organizing neural architecture for reinforcement learning
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
On the efficient implementation biologic reinforcement learning using eligibility traces
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Q-Learning with FCMAC in multi-agent cooperation
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Reinforcement learning-based tuning algorithm applied to fuzzy identification
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
On the selection of a transversal to solve nonlinear systems with interval arithmetic
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Discovery of stable peers in a self-organising peer-to-peer gradient topology
DAIS'06 Proceedings of the 6th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems
Teamwork formation for keepaway in robotics soccer (reinforcement learning approach)
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Multiagent reinforcement learning for a planetary exploration multirobot system
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Selecting actions for resource-bounded information extraction using reinforcement learning
Proceedings of the fifth ACM international conference on Web search and data mining
A multiagent approach to managing air traffic flow
Autonomous Agents and Multi-Agent Systems
Intelligent Service Robotics
Sequentially optimal repeated coalition formation under uncertainty
Autonomous Agents and Multi-Agent Systems
Neuroevolution with manifold learning for playing Mario
International Journal of Bio-Inspired Computation
Optimal tuning of continual online exploration in reinforcement learning
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Feature extraction for decision-theoretic planning in partially observable environments
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Reinforcement learning with echo state networks
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Nearly optimal exploration-exploitation decision thresholds
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
A neural network module with pretuning for search and reproduction of input-output mapping
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
On the relation of slow feature analysis and laplacian eigenmaps
Neural Computation
The equilibrium of agent mind: the balance between agent theories and practice
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Quantitative µ-calculus analysis of power management in wireless networks
ICTAC'06 Proceedings of the Third international conference on Theoretical Aspects of Computing
Bounded rational search for on-the-fly model checking of LTL properties
FSEN'09 Proceedings of the Third IPM international conference on Fundamentals of Software Engineering
Multiple overlapping tiles for contextual monte carlo tree search
EvoApplicatons'10 Proceedings of the 2010 international conference on Applications of Evolutionary Computation - Volume Part I
An adaptive mobile system using mobile grid computing in wireless network
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Rough sets and higher order vagueness
RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part I
Behavioral pattern identification through rough set modelling
RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
Towards finite-sample convergence of direct reinforcement learning
ECML'05 Proceedings of the 16th European conference on Machine Learning
ECML'05 Proceedings of the 16th European conference on Machine Learning
ECML'05 Proceedings of the 16th European conference on Machine Learning
Using advice to transfer knowledge acquired in one reinforcement learning task to another
ECML'05 Proceedings of the 16th European conference on Machine Learning
The investigation of the agent in the artificial market
AIS'04 Proceedings of the 13th international conference on AI, Simulation, and Planning in High Autonomy Systems
Optimising natural language generation decision making for situated dialogue
SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
ACM Transactions on Applied Perception (TAP)
Automatic optimization of web recommendations using feedback and ontology graphs
ICWE'05 Proceedings of the 5th international conference on Web Engineering
The novel feature selection method based on emotion recognition system
ICIC'06 Proceedings of the 2006 international conference on Computational Intelligence and Bioinformatics - Volume Part III
Multiobjective water pinch analysis of the cuernavaca city water distribution network
EMO'05 Proceedings of the Third international conference on Evolutionary Multi-Criterion Optimization
Learning action sequences through imitation in behavior based architectures
ARCS'05 Proceedings of the 18th international conference on Architecture of Computing Systems conference on Systems Aspects in Organic and Pervasive Computing
Adaptive modeling: an approach and a method for implementing adaptive agents
MMAS'04 Proceedings of the First international conference on Massively Multi-Agent Systems
Agent based decision support system using reinforcement learning under emergency circumstances
ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part I
A virtual reality platform for modeling cognitive development
Biomimetic Neural Learning for Intelligent Robots
Reinforcement learning using a grid based function approximator
Biomimetic Neural Learning for Intelligent Robots
Spatial representation and navigation in a bio-inspired robot
Biomimetic Neural Learning for Intelligent Robots
Autonomous vehicle steering based on evaluative feedback by reinforcement learning
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Cost integration in multi-step viewpoint selection for object recognition
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Abstract policy evaluation for reactive agents
SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Function approximation via tile coding: automating parameter choice
SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Feature-Discovering approximate value iteration methods
SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Multiagent association rules mining in cooperative learning systems
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
CBR for state value function approximation in reinforcement learning
ICCBR'05 Proceedings of the 6th international conference on Case-Based Reasoning Research and Development
Evolving small-board Go players using coevolutionary temporal difference learning with archives
International Journal of Applied Mathematics and Computer Science
K-Shortest paths q-routing: a new QoS routing algorithm in telecommunication networks
ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
A genetic approach to data dimensionality reduction using a special initial population
IWINAC'05 Proceedings of the First international work-conference on the Interplay Between Natural and Artificial Computation conference on Artificial Intelligence and Knowledge Engineering Applications: a bioinspired approach - Volume Part II
Reinforcement learning based on multi-agent in robocup
ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Evolving agent societies through imitation controlled by artificial emotions
ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Nonlinear prediction by reinforcement learning
ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
Enhanced therapeutic interactivity using social robot Zeno
Proceedings of the 4th International Conference on PErvasive Technologies Related to Assistive Environments
A combined reactive and reinforcement learning controller for an autonomous tracked vehicle
Robotics and Autonomous Systems
The design and implementation of SAMIR
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Cognitive hybrid reasoning intelligent agent system
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Learning plans with patterns of actions in bounded-rational agents
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Finding hidden hierarchy in reinforcement learning
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
A neurobiologically motivated model for self-organized learning
MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Hybrid fuzzy/expert system to control grasping with deformation detection
MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Adaptive neuro-fuzzy-expert controller of a robotic gripper
MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Enhancing the automatic generation of hints with expert seeding
ITS'10 Proceedings of the 10th international conference on Intelligent Tutoring Systems - Volume Part II
A dynamic allocation method of basis functions in reinforcement learning
AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Error bounds in reinforcement learning policy evaluation
AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Stabilising hebbian learning with a third factor in a food retrieval task
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
An adaptive robot motivational system
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Incremental skill acquisition for self-motivated learning animats
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
A model of reaching that integrates reinforcement learning and population encoding of postures
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Experimental study on task teaching to real rats through interaction with a robotic rat
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Simbad: an autonomous robot simulation package for education and research
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Self-organizing relays in LTE networks: queuing analysis and algorithms
Proceedings of the 7th International Conference on Network and Services Management
Reinforcement learning by chaotic exploration generator in target capturing task
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Automatic extraction system of a kidney region based on the q-learning
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Time does not always buy quality in co-evolutionary learning
SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Inducing effective pedagogical strategies using learning context features
UMAP'10 Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
MoCoA: customisable middleware for context-aware mobile applications
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques
Hierarchical neuro-fuzzy models based on reinforcement learning for intelligent agents
IWANN'05 Proceedings of the 8th international conference on Artificial Neural Networks: computational Intelligence and Bioinspired Systems
Learning teleoreactive logic programs from problem solving
ILP'05 Proceedings of the 15th international conference on Inductive Logic Programming
Learning multi-modal control programs
HSCC'05 Proceedings of the 8th international conference on Hybrid Systems: computation and control
ITS'10 Proceedings of the 10th international conference on Intelligent Tutoring Systems - Volume Part I
Adaptive scalable video streaming in wireless networks
Proceedings of the 3rd Multimedia Systems Conference
Data mining techniques for robocup soccer agents
AIS-ADM 2005 Proceedings of the 2005 international conference on Autonomous Intelligent Systems: agents and Data Mining
Modeling the brain's operating system
BVAI'05 Proceedings of the First international conference on Brain, Vision, and Artificial Intelligence
Artificial Life and Robotics
Robot learning from demonstration by constructing skill trees
International Journal of Robotics Research
International Journal of Robotics Research
A review of long-term memory in natural and synthetic systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Generating inspiration for agent design by reinforcement learning
Information and Software Technology
Dynamic cooperator selection in cognitive radio networks
Ad Hoc Networks
Learning to negotiate optimally in non-stationary environments
CIA'06 Proceedings of the 10th international conference on Cooperative Information Agents
Learning-Based spectrum selection in cognitive radio ad hoc networks
WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Multi-agent case-based reasoning for cooperative reinforcement learners
ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
A cooperation online reinforcement learning approach in ant-q
ICONIP'06 Proceedings of the 13 international conference on Neural Information Processing - Volume Part I
The interactive feature selection method development for an ANN based emotion recognition system
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Intelligent pairing assistant for air operation centers
Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
Learning automata-based approach to learn dialogue policies in large state space
International Journal of Intelligent Information and Database Systems
Emergent consensus in decentralised systems using collaborative reinforcement learning
Self-star Properties in Complex Information Systems
A multi-agent approach to controlling a smart environment
Designing Smart Homes
Rough sets and vague concept approximation: from sample approximation to adaptive learning
Transactions on Rough Sets V
Adaptive stock trading with dynamic asset allocation using reinforcement learning
Information Sciences: an International Journal
Dynamic alternation of primate response properties during trial-and-error knowledge updating
Robotics and Autonomous Systems
Adaptive fraud detection using benford's law
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Partial local friendq multiagent learning: application to team automobile coordination problem
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Trace equivalence characterization through reinforcement learning
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Adaptive critic neural networks for identification of wheeled mobile robot
ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
Efficient ant reinforcement learning using replacing eligibility traces
ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
A distributed learning control system for elevator groups
ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
Online testing with reinforcement learning
FATES'06/RV'06 Proceedings of the First combined international conference on Formal Approaches to Software Testing and Runtime Verification
A time-frame based trust model for p2p systems
ICISC'06 Proceedings of the 9th international conference on Information Security and Cryptology
Abstraction and generalization in reinforcement learning: a summary and framework
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Replicator dynamics for multi-agent learning: an orthogonal approach
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Recursive adaptation of stepsize parameter for non-stationary environments
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Coordinating learning agents for multiple resource job scheduling
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Effectiveness of considering state similarity for reinforcement learning
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Imitating inscrutable enemies: learning from stochastic policy observation, retrieval and reuse
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
A general introspective reasoning approach to web search for case adaptation
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part I
Efficient deep web crawling using reinforcement learning
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
A reinforcement learning approach for the flexible job shop scheduling problem
LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Learning heuristic policies – a reinforcement learning problem
LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Teaching a robot to perform task through imitation and on-line feedback
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Adaption of stepsize parameter using newton's method
PRIMA'11 Proceedings of the 14th international conference on Agents in Principle, Agents in Practice
Stochastic abstract policies for knowledge transfer in robotic navigation tasks
MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Three automated stock-trading agents: a comparative study
AAMAS'04 Proceedings of the 6th AAMAS international conference on Agent-Mediated Electronic Commerce: theories for and Engineering of Distributed Mechanisms and Systems
IICS'04 Proceedings of the 4th international conference on Innovative Internet Community Systems
Experimentation system for efficient job performing in veterinary medicine area
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part IV
*-MINIMAX performance in backgammon
CG'04 Proceedings of the 4th international conference on Computers and Games
Reinforcement distribution in continuous state action space fuzzy Q–learning: a novel approach
WILF'05 Proceedings of the 6th international conference on Fuzzy Logic and Applications
EURO-NGI'05 Proceedings of the Second international conference on Wireless Systems and Network Architectures in Next Generation Internet
An overview of cooperative and competitive multiagent learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Learning automata as a basis for multi agent reinforcement learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Dealing with errors in a cooperative multi-agent learning system
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Multi-agent relational reinforcement learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Sift and sort: climbing the semantic pyramid
ESOA'05 Proceedings of the Third international conference on Engineering Self-Organising Systems
Actor-Critic algorithm based on incremental least-squares temporal difference with eligibility trace
ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing Theories and Applications: with aspects of artificial intelligence
A multi-agent reinforcement learning with weighted experience sharing
ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing Theories and Applications: with aspects of artificial intelligence
Adaptive and non-adaptive distribution functions for DSA
PRIMA'10 Proceedings of the 13th international conference on Principles and Practice of Multi-Agent Systems
Learning form experience: a bayesian network based reinforcement learning approach
ICICA'11 Proceedings of the Second international conference on Information Computing and Applications
Adaptive multi-robot team reconfiguration using a policy-reuse reinforcement learning approach
AAMAS'11 Proceedings of the 10th international conference on Advanced Agent Technology
Exploration strategies for learning in multi-agent foraging
SEMCCO'11 Proceedings of the Second international conference on Swarm, Evolutionary, and Memetic Computing - Volume Part II
Admission control policies for a multi-class QoS-aware service oriented architecture
ACM SIGMETRICS Performance Evaluation Review
Tactile Guidance for Policy Adaptation
Foundations and Trends in Robotics
A new class of ε-optimal learning automata
ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing
Homeokinetic reinforcement learning
PSL'11 Proceedings of the First IAPR TC3 conference on Partially Supervised Learning
Co-learning segmentation in marketplaces
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Basis function discovery using spectral clustering and bisimulation metrics
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Solving sparse delayed coordination problems in multi-agent reinforcement learning
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Adaptive information presentation for spoken dialogue systems: evaluation with human subjects
ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
TATM: a trust mechanism for social traders in double auctions
AI'11 Proceedings of the 24th international conference on Advances in Artificial Intelligence
A novel crawling algorithm for web pages
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Information Sciences: an International Journal
Coverage rewarded: Test input generation via adaptation-based programming
ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
Application-aware dynamic spectrum access
Wireless Networks
Autonomous Agents and Multi-Agent Systems
Improving behavior of computer game bots using fictitious play
International Journal of Automation and Computing
Stochastic enforced hill-climbing
Journal of Artificial Intelligence Research
A reinforcement learning framework for spiking networks with dynamic synapses
Computational Intelligence and Neuroscience
ORACLE: Mobility control in wireless sensor and actor networks
Computer Communications
TIRAMOLA: elastic nosql provisioning through a cloud management platform
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Agile strategic information systems based on axiomatic agent architecture
iUBICOM'11 Proceedings of the 6th international conference on Ubiquitous and Collaborative Computing
Multifaceted web services: an approach to secure and scalable grid scheduling
EuroWeb'02 Proceedings of the 2002 international conference on EuroWeb
Enabling opportunistic and dynamic spectrum access through learning techniques
Wireless Communications & Mobile Computing
Computational Intelligence
Reinforcement learning as heuristic for action-rule preferences
ProMAS'10 Proceedings of the 8th international conference on Programming Multi-Agent Systems
Probabilistic argumentation frameworks
TAFA'11 Proceedings of the First international conference on Theory and Applications of Formal Argumentation
Self-Organizing reinforcement learning model
ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
Centralized and distributed task allocation in multi-robot teams via a stochastic clustering auction
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Multi-agent framework for real-time processing of large and dynamic search spaces
Proceedings of the 27th Annual ACM Symposium on Applied Computing
HIS'12 Proceedings of the First international conference on Health Information Science
Adaptive optimal control without weight transport
Neural Computation
The successor representation and temporal context
Neural Computation
Tracking the evolution of cooperation in complex networked populations
EvoBIO'12 Proceedings of the 10th European conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics
Monte-Carlo swarm policy search
SIDE'12 Proceedings of the 2012 international conference on Swarm and Evolutionary Computation
A competitive strategy for function approximation in Q-learning
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Using cases as heuristics in reinforcement learning: a transfer learning application
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Q-error as a selection mechanism in modular reinforcement-learning systems
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Sample efficient on-line learning of optimal dialogue policies with kalman temporal differences
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Risk-sensitive policies for sustainable renewable resource allocation
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Non-linear Monte-Carlo search in civilization II
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Integrated learning for goal-driven autonomy
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Integrating learning into a BDI Agent for environments with changing dynamics
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Regret minimization in multiplayer extensive games
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives
Robotics and Autonomous Systems
Integrating particle swarm optimization with reinforcement learning in noisy problems
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Hierarchical task decomposition through symbiosis in reinforcement learning
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Sample aware embedded feature selection for reinforcement learning
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Two-cornered learning classifier systems for pattern generation and classification
Proceedings of the 14th annual conference on Genetic and evolutionary computation
CMA-TWEANN: efficient optimization of neural networks via self-adaptation and seamless augmentation
Proceedings of the 14th annual conference on Genetic and evolutionary computation
International Journal of Artificial Intelligence in Education - Special issue on Best of ITS 2010
Enhancing the automatic generation of hints with expert seeding
International Journal of Artificial Intelligence in Education - Special issue on Best of ITS 2010
Rewards for pairs of Q-learning agents conducive to turn-taking in medium-access games
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Multiple levels of spatial organization: World Graphs and spatial difference learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Bisimulation Metrics for Continuous Markov Decision Processes
SIAM Journal on Computing
The Knowledge Gradient Algorithm for a General Class of Online Learning Problems
Operations Research
Automatic discovery of ranking formulas for playing with multi-armed bandits
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Goal-Directed online learning of predictive models
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Gradient based algorithms with loss functions and kernels for improved on-policy control
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Feature reinforcement learning in practice
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reinforcement learning with a bilinear q function
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
ℓ1-Penalized projected bellman residual
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Regularized least squares temporal difference learning with nested ℓ2 and ℓ1 penalization
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Recursive least-squares learning with eligibility traces
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Value function approximation through sparse bayesian modeling
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Automatic construction of temporally extended actions for MDPs using bisimulation metrics
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Transferring evolved reservoir features in reinforcement learning tasks
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Transfer learning via multiple inter-task mappings
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Multi-Task reinforcement learning: shaping and feature selection
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Batch, off-policy and model-free apprenticeship learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
MapReduce for parallel reinforcement learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Compound reinforcement learning: theory and an application to finance
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Fuzzy epoch-incremental reinforcement learning algorithm
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part I
DCOPs and bandits: exploration and exploitation in decentralised coordination
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
V-MAX: tempered optimism for better PAC reinforcement learning
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Reinforcement learning transfer via sparse coding
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Decentralized Bayesian reinforcement learning for online agent collaboration
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Dynamic potential-based reward shaping
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Reinforcement learning from simultaneous human and MDP reward
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Transfer in reinforcement learning via shared features
The Journal of Machine Learning Research
Integrating a partial model into model free reinforcement learning
The Journal of Machine Learning Research
Optimistic Bayesian sampling in contextual-bandit problems
The Journal of Machine Learning Research
Memory formation, consolidation, and forgetting in learning agents
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Dynamic channel selection with reinforcement learning for cognitive WLAN over fiber
International Journal of Communication Systems
Strategy-Based learning through communication with humans
KES-AMSTA'12 Proceedings of the 6th KES international conference on Agent and Multi-Agent Systems: technologies and applications
Community of scientist optimization: An autonomy oriented approach to distributed optimization
AI Communications - 18th RCRA International Workshop on “Experimental evaluation of algorithms for solving problems with combinatorial explosion”
Selecting vision operators and fixing their optimal parameters values using reinforcement learning
ICISP'12 Proceedings of the 5th international conference on Image and Signal Processing
Decentralised reinforcement learning for energy-efficient scheduling in wireless sensor networks
International Journal of Communication Networks and Distributed Systems
Beyond reward: the problem of knowledge and data
ILP'11 Proceedings of the 21st international conference on Inductive Logic Programming
Multiagent learning through neuroevolution
WCCI'12 Proceedings of the 2012 World Congress conference on Advances in Computational Intelligence
A novel feature sparsification method for kernel-based approximate policy iteration
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A rapid sparsification method for kernel machines in approximate policy iteration
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
A modular hierarchical reinforcement learning algorithm
ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
Measuring Resemblances Between Swarm Behaviours: A Perceptual Tolerance Near Set Approach
Fundamenta Informaticae - Swarm Intelligence
Single-player Monte-Carlo tree search for SameGame
Knowledge-Based Systems
A New Architecture for Learning Classifier Systems to Solve POMDP Problems
Fundamenta Informaticae
Optimal radio channel recommendations with explicit and implicit feedback
Proceedings of the sixth ACM conference on Recommender systems
Rough Set Approach to Behavioral Pattern Identification
Fundamenta Informaticae - New Frontiers in Scientific Discovery - Commemorating the Life and Work of Zdzislaw Pawlak
A fuzzy reinforcement learning approach for pre-congestion notification based admission control
AIMS'12 Proceedings of the 6th IFIP WG 6.6 international autonomous infrastructure, management, and security conference on Dependable Networks and Services
Distributed self-organized collaboration of autonomous IDS sensors
AIMS'12 Proceedings of the 6th IFIP WG 6.6 international autonomous infrastructure, management, and security conference on Dependable Networks and Services
An online kernel-based clustering approach for value function approximation
SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
An adaptive dialogue system with online dialogue policy learning
SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
Unstable path routing in urban-scale WSN
ACM SIGBED Review - Special Issue on the 3rd International Workshop on Networks of Cooperating Objects (CONET 2012)
Levels of realism for cooperative multi-agent reinforcement learning
ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part I
Reinforcement Learning with Approximation Spaces
Fundamenta Informaticae
Behavioral Pattern Identification Through Rough Set Modeling
Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Calculi of Approximation Spaces
Fundamenta Informaticae - SPECIAL ISSUE ON CONCURRENCY SPECIFICATION AND PROGRAMMING (CS&P 2005) Ruciane-Nide, Poland, 28-30 September 2005
Interactive information systems: Toward perception based computing
Theoretical Computer Science
Fundamenta Informaticae - Contagious Creativity - In Honor of the 80th Birthday of Professor Solomon Marcus
Faster program adaptation through reward attribution inference
Proceedings of the 11th International Conference on Generative Programming and Component Engineering
A diversity dilemma in evolutionary markets
Proceedings of the 13th International Conference on Electronic Commerce
Market niching in multi-attribute computational resource allocation systems
Proceedings of the 13th International Conference on Electronic Commerce
A comparative study of reinforcement learning techniques on dialogue management
EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
A Flexible and Adaptive Hyper-heuristic Approach for (Dynamic) Capacitated Vehicle Routing Problems
Fundamenta Informaticae - Emergent Computing
The Journal of Supercomputing
A multi-agent reinforcement learning approach to robot soccer
Artificial Intelligence Review
A cognitive WSN framework for highway safety based on weighted cognitive maps and Q-learning
Proceedings of the second ACM international symposium on Design and analysis of intelligent vehicular networks and applications
Managing Femto to Macro Interference without X2 Interface Support through POMDP
Mobile Networks and Applications
Learning and reasoning with action-related places for robust mobile manipulation
Journal of Artificial Intelligence Research
Learning to win by reading manuals in a monte-carlo framework
Journal of Artificial Intelligence Research
Real-world reinforcement learning for autonomous humanoid robot docking
Robotics and Autonomous Systems
Learning high-level planning from text
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Generative goal-driven user simulation for dialog management
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Optimising incremental dialogue decisions using information density for interactive systems
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Framework of automatic text summarization using reinforcement learning
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
More for your money: exploiting performance heterogeneity in public clouds
Proceedings of the Third ACM Symposium on Cloud Computing
Constructing test collections by inferring document relevance via extracted relevant information
Proceedings of the 21st ACM international conference on Information and knowledge management
Function optimisation by learning automata
Information Sciences: an International Journal
Mobile robot navigation: neural Q-learning
International Journal of Computer Applications in Technology
SMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology, and Policy
INFORMS Journal on Computing
Estimating interleaved comparison outcomes from historical click data
Proceedings of the 21st ACM international conference on Information and knowledge management
Improving the performance of the reinforcement learning model for answering complex questions
Proceedings of the 21st ACM international conference on Information and knowledge management
Analysis of solutions to the time-optimal planning and execution problem
Intelligent Service Robotics
Thinking Inside the Box: Controlling and Using an Oracle AI
Minds and Machines
Information Sciences: an International Journal
Computers & Mathematics with Applications
Multi-agent learning and control system using ants colony for packet scheduling in routers
APNOMS'07 Proceedings of the 10th Asia-Pacific conference on Network Operations and Management Symposium: managing next generation networks and services
Multi-armed bandit formulation of the task partitioning problem in swarm robotics
ANTS'12 Proceedings of the 8th international conference on Swarm Intelligence
Improving scheduling performance using a q-learning-based leasing policy for clouds
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Autonomous shaping via coevolutionary selection of training experience
PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part II
Distributed learning of best response behaviors in concurrent iterated many-object negotiations
MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Evolutionary dynamics of ant colony optimization
MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Q-Tree: automatic construction of hierarchical state representation for reinforcement learning
ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part III
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
APRIL: active preference learning-based reinforcement learning
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Autonomous data-driven decision-making in smart electricity markets
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Bayesian nonparametric inverse reinforcement learning
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Learning policies for battery usage optimization in electric vehicles
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Policy iteration based on a learned transition model
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Active learning of inverse models with intrinsically motivated goal exploration in robots
Robotics and Autonomous Systems
Adaptive reservoir computing through evolution and learning
Neurocomputing
Adaptive exploration using stochastic neurons
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
Integration of static and self-motion-based depth cues for efficient reaching and locomotor actions
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Making a reinforcement learning agent believe
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Biologically plausible multi-dimensional reinforcement learning in neural networks
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Understanding the role of serotonin in basal ganglia through a unified model
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
A computational model of motor areas based on bayesian networks and most probable explanations
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Learning-Based test programming for programmers
ISoLA'12 Proceedings of the 5th international conference on Leveraging Applications of Formal Methods, Verification and Validation: technologies for mastering change - Volume Part I
Gradient algorithms for exploration/exploitation trade-offs: global and local variants
ANNPR'12 Proceedings of the 5th INNS IAPR TC 3 GIRPR conference on Artificial Neural Networks in Pattern Recognition
Extracting key gene regulatory dynamics for the direct control of mechanical systems
PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part I
Multi-agent task division learning in hide-and-seek games
AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Learning motion controllers with adaptive depth perception
EUROSCA'12 Proceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation
Learning motion controllers with adaptive depth perception
Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Video search and indexing with reinforcement agent for interactive multimedia services
ACM Transactions on Embedded Computing Systems (TECS) - Special issue on embedded systems for interactive multimedia services (ES-IMS)
Reinforcement learning approach to multi-stage decision making problems with changes in action sets
Artificial Life and Robotics
Continuous strategy replicator dynamics for multi-agent Q-learning
Autonomous Agents and Multi-Agent Systems
A distributed Q-learning approach for variable attention to multiple critics
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Cooperative behavior acquisition in multi-agent reinforcement learning system using attention degree
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
Sparse gradient-based direct policy search
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
Learn to swing up and balance a real pole based on raw visual input data
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V
Robot dancing: adapting robot dance to human preferences
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V
Upper confidence tree-based consistent reactive planning application to minesweeper
LION'12 Proceedings of the 6th international conference on Learning and Intelligent Optimization
Evaluation of a family of reinforcement learning cross-domain optimization heuristics
LION'12 Proceedings of the 6th international conference on Learning and Intelligent Optimization
Reinforcement learning transfer using a sparse coded inter-task mapping
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Local coordination in online distributed constraint optimization problems
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Multi-agent learning and the reinforcement gradient
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Recognizing internal states of other agents to anticipate and coordinate interactions
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Observer effect from stateful resources in agent sensing
Autonomous Agents and Multi-Agent Systems
Applying a framework for healthcare incentives simulation
Proceedings of the Winter Simulation Conference
An adaptive simulator for ML-rules
Proceedings of the Winter Simulation Conference
Model-based adaptive spatial sampling for occurrence map construction
Statistics and Computing
A learning strategy for software testing optimization based on dynamic programming
Proceedings of the Fourth Asia-Pacific Symposium on Internetware
Adaptive value function approximation for continuous-state stochastic dynamic programming
Computers and Operations Research
Scheduling fighter aircraft maintenance with reinforcement learning
Proceedings of the Winter Simulation Conference
Stochastic policy search for variance-penalized semi-Markov control
Proceedings of the Winter Simulation Conference
A sampled fictitious play based learning algorithm for infinite horizon Markov decision processes
Proceedings of the Winter Simulation Conference
Organizational Learning as Credit Assignment: A Model and Two Experiments
Organization Science
Learning classifier system with average reward reinforcement learning
Knowledge-Based Systems
A "Society of Mind" Cognitive Architecture Based on the Principles of Artificial Economics
International Journal of Artificial Life Research
Reusing historical interaction data for faster online learning to rank for IR
Proceedings of the sixth ACM international conference on Web search and data mining
Reinforcement Learning with Reward Shaping and Mixed Resolution Function Approximation
International Journal of Agent Technologies and Systems
A Reinforcement Learning Approach to Setting Multi-Objective Goals for Energy Demand Management
International Journal of Agent Technologies and Systems
Simulation Analysis for Choice of Binary Lotteries
Computational Economics
Asymptotic non-learnability of universal agents with computable horizon functions
Theoretical Computer Science
Two-step gradient-based reinforcement learning for underwater robotics behavior learning
Robotics and Autonomous Systems
Simulating Cooperative Behaviors in Dynamic Networks
International Journal of Agent Technologies and Systems
Adaptive Kansei Search Method Using User's Subjective Criterion Deviation
International Journal of Computer Vision and Image Processing
Game designers training first person shooter bots
AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
Exploration / exploitation trade-off in mobile context-aware recommender systems
AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
On-Line model-based continuous state reinforcement learning using background knowledge
AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
XCS with adaptive action mapping
SEAL'12 Proceedings of the 9th international conference on Simulated Evolution and Learning
Exploiting user feedback for adapting mobile interaction obtrusiveness
UCAmI'12 Proceedings of the 6th international conference on Ubiquitous Computing and Ambient Intelligence
Modular value iteration through regional decomposition
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Avoiding unintended AI behaviors
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Decision support for safe AI design
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
On measuring social intelligence: experiments on competition and cooperation
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Space-Time embedded intelligence
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Memory issues of intelligent agents
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Deconstructing reinforcement learning in sigma
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
On ensemble techniques for AIXI approximation
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Motivation management in AGI systems
AGI'12 Proceedings of the 5th international conference on Artificial General Intelligence
Optimizing spectrum trading in cognitive mesh network using machine learning
Journal of Electrical and Computer Engineering - Special issue on Resource Allocation in Communications and Computing
A survey of point-based POMDP solvers
Autonomous Agents and Multi-Agent Systems
Safe exploration of state and action spaces in reinforcement learning
Journal of Artificial Intelligence Research
From dynamic movement primitives to associative skill memories
Robotics and Autonomous Systems
Sourcing strategies in supply risk management: An approximate dynamic programming approach
Computers and Operations Research
Wireless Personal Communications: An International Journal
Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
International Journal of Ad Hoc and Ubiquitous Computing
Learning non-myopically from human-generated reward
Proceedings of the 2013 international conference on Intelligent user interfaces
A hierarchical representation policy iteration algorithm for reinforcement learning
IScIDE'12 Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
Applying reinforcement learning for web pages ranking algorithms
Applied Soft Computing
Transferring task models in Reinforcement Learning agents
Neurocomputing
A state-dependent time evolving multi-constraint routing algorithm
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Policy sharing between multiple mobile robots using decision trees
Information Sciences: an International Journal
EvoApplications'13 Proceedings of the 16th European conference on Applications of Evolutionary Computation
Non-reciprocating Sharing Methods in Cooperative Q-Learning Environments
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
A Hybrid Cooperative Behavior Learning Method for a Rule-Based Shout-Ahead Architecture
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Knowledge-Based Exploration for Reinforcement Learning in Self-Organizing Neural Networks
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Abstraction in Model Based Partially Observable Reinforcement Learning Using Extended Sequence Trees
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Routing in Distributed Cognitive Radio Networks: A Survey
Wireless Personal Communications: An International Journal
Affective touch gesture recognition for a furry zoomorphic machine
Proceedings of the 7th International Conference on Tangible, Embedded and Embodied Interaction
Analysis of strategy in robot soccer game
Neurocomputing
Neuroevolution results in emergence of short-term memory in multi-goal environment
Proceedings of the 15th annual conference on Genetic and evolutionary computation
Selection strategy for XCS with adaptive action mapping
Proceedings of the 15th annual conference on Genetic and evolutionary computation
Hybrid POMDP based evolutionary adaptive framework for efficient visual tracking algorithms
Proceedings of the 15th annual conference on Genetic and evolutionary computation
Solving the distal reward problem with rare correlations
Neural Computation
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Extended rule-based genetic network programming
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Dynamic Pay-Per-Action Mechanisms and Applications to Online Advertising
Operations Research
DCOB: Action space for reinforcement learning of high DoF robots
Autonomous Robots
Learning with configurable operators and RL-based heuristics
NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
Information Systems
Neural representation of reward probability: Evidence from the illusion of control
Journal of Cognitive Neuroscience
Popularity-based relevance propagation
Journal of Web Engineering
Simulation, learning, and optimization techniques in Watson's game strategies
IBM Journal of Research and Development
Self-organized collaboration of distributed IDS sensors
DIMVA'12 Proceedings of the 9th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Forward and backward feature selection in gradient-based MDP algorithms
MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Interaction-based group identity detection via reinforcement learning and artificial evolution
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Generation of tests for programming challenge tasks using multi-objective optimization
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Testing probabilistic equivalence through Reinforcement Learning
Information and Computation
Finding your way in the testing jungle: a learning approach to web security testing
Proceedings of the 2013 International Symposium on Software Testing and Analysis
Learning classifier systems: introducing the user-friendly textbook
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Efficient sample reuse in policy gradients with parameter-based exploration
Neural Computation
From occasional choices to inevitable musts: a computational model of nicotine addiction
Computational Intelligence and Neuroscience
Towards a deeper understanding of cooperative equilibrium: characterization and complexity
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Emergence of social norms through collective learning in networked agent societies
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Using informative behavior to increase engagement in the tamer framework
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Smart exploration in reinforcement learning using absolute temporal difference errors
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Addressing the policy-bias of q-learning by repeating updates
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Teaching on a budget: agents advising agents in reinforcement learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Learning exploration strategies in model-based reinforcement learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
CLEAN rewards for improving multiagent coordination in the presence of exploration
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Learning in non-stationary MDPs as transfer learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Decentralized coordination via task decomposition and reward shaping
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Using response probability to build system redundancy in multiagent systems
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
A generic adaptive simulation algorithm for component-based simulation systems
Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation
Supporting adaptation of decentralized software based on application scenarios
Journal of Systems and Software
Online learning in a chemical perceptron
Artificial Life
On the complexity of trial and error
Proceedings of the forty-fifth annual ACM symposium on Theory of computing
Learning resources in federated environments: a broken link checker based on URL similarity
International Journal of Metadata, Semantics and Ontologies
Building a social multi-agent system simulation management toolbox
Proceedings of the 6th Balkan Conference in Informatics
Corticostriatal contributions to musical expectancy perception
Journal of Cognitive Neuroscience
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Autonomous task partitioning in robot foraging: an approach based on cost estimation
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Inverse reinforcement learning for interactive systems
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Machine learning for interactive systems and robots: a brief introduction
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Shared control of a robot using EEG-based feedback signals
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Emotion-oriented agent in mental state transition learning network
International Journal of Computational Intelligence Studies
Architecture of a cyberphysical avatar
Proceedings of the ACM/IEEE 4th International Conference on Cyber-Physical Systems
Performance bounds for λ policy iteration and application to the game of Tetris
The Journal of Machine Learning Research
On Potential Cognitive Abilities in the Machine Kingdom
Minds and Machines
Finite-sample analysis of least-squares policy iteration
The Journal of Machine Learning Research
The Journal of Machine Learning Research
Linear fitted-Q iteration with multiple reward functions
The Journal of Machine Learning Research
Using historical click data to increase interleaving sensitivity
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
On the moral responsibility of military robots
Ethics and Information Technology
Wireless Personal Communications: An International Journal
Multi-criteria expertness based cooperative Q-learning
Applied Intelligence
Scenario Trees and Policy Selection for Multistage Stochastic Programming Using Machine Learning
INFORMS Journal on Computing
Modelling mental rotation in cognitive robots
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A vision for a stochastic reasoner for autonomic cloud deployment
Proceedings of the Second Nordic Symposium on Cloud Computing & Internet Technologies
ACM Transactions on Interactive Intelligent Systems (TiiS)
A novel reinforcement learning architecture for continuous state and action spaces
Advances in Artificial Intelligence
Vicarious reinforcement and ex ante law enforcement: a study in norm-governed learning agents
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law
Robust Regulation Adaptation in Multi-Agent Systems
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Spike-timing-dependent construction
Neural Computation
Probabilistic model-based imitation learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Review: Vulnerabilities in cognitive radio networks: A survey
Computer Communications
Engineering Applications of Artificial Intelligence
Teaching and leading an ad hoc teammate: Collaboration without pre-coordination
Artificial Intelligence
Evolutionary robotics approach to odor source localization
Neurocomputing
Reinforcement learning in robotics: A survey
International Journal of Robotics Research
A study of ex ante law enforcement in norm-governed learning agents
JSAI-isAI'12 Proceedings of the 2012 international conference on New Frontiers in Artificial Intelligence
Living Machines'13 Proceedings of the Second international conference on Biomimetic and Biohybrid Systems
Reward shaping for statistical optimisation of dialogue management
SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
A model of emotional intelligent agent for cooperative goal exploration
ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories
AHPM as a proposal to improve interaction with air traffic controllers
HCI'13 Proceedings of the 15th international conference on Human-Computer Interaction: interaction modalities and techniques - Volume Part IV
Technical Section: Goal directed multi-finger manipulation: Control policies and analysis
Computers and Graphics
Toward nonlinear local reinforcement learning rules through neuroevolution
Neural Computation
Fidelity, Soundness, and Efficiency of Interleaved Comparison Methods
ACM Transactions on Information Systems (TOIS)
Models of gaze control for manipulation tasks
ACM Transactions on Applied Perception (TAP)
Strategic cognitive sequencing: a computational cognitive neuroscience approach
Computational Intelligence and Neuroscience - Special issue on Neurocognitive Models of Sense Making
An intelligent broker agent for energy trading: an MDP approach
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Efficiently solving joint activity based security games
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Monte-Carlo expectation maximization for decentralized POMDPs
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Sufficiency-based selection strategy for MCTS
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Prior-free exploration bonus for and beyond near bayes-optimal behavior
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Online expectation maximization for reinforcement learning in POMDPs
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Meta-interpretive learning of higher-order dyadic datalog: predicate invention revisited
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Lifelong learning for acquiring the wisdom of the crowd
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Towards a second generation random walk planner: an experimental exploration
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Fault-tolerant planning under uncertainty
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Hedonic value: enhancing adaptation for motivated agents
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Extending sensorimotor contingency theory: prediction, planning, and action generation
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Using reinforcement learning to find an optimal set of features
Computers & Mathematics with Applications
Behavior Selection Using Utility-Based Reinforcement Learning in Irregular Warfare Simulation Models
International Journal of Operations Research and Information Systems
Adaptivity on the robot brain architecture level using reinforcement learning
Robot Soccer World Cup XV
Camera modeling technique of 3D sensing based on tile coding for computer vision
BodyNets '13 Proceedings of the 8th International Conference on Body Area Networks
Proceedings of the 19th international conference on Intelligent User Interfaces
Reduction of state space in reinforcement learning by sensor selection
Artificial Life and Robotics
Reinforcement learning models for scheduling in wireless networks
Frontiers of Computer Science: Selected Publications from Chinese Universities
Wireless Personal Communications: An International Journal
Monte-Carlo tree search for Bayesian reinforcement learning
Applied Intelligence
Learning via human feedback in continuous state and action spaces
Applied Intelligence
Reinforcement learning based routing in wireless mesh networks
Wireless Networks
Fast damage recovery in robotics with the T-resilience algorithm
International Journal of Robotics Research
Towards a real-time interface between a biomimetic model of sensorimotor cortex and a robotic arm
Pattern Recognition Letters
Scheduling a dynamic aircraft repair shop with limited repair resources
Journal of Artificial Intelligence Research
Distributed reasoning for multiagent simple temporal problems
Journal of Artificial Intelligence Research
Analysis of watson's strategies for playing Jeopardy!
Journal of Artificial Intelligence Research
The arcade learning environment: an evaluation platform for general agents
Journal of Artificial Intelligence Research
Learning by observation of agent software images
Journal of Artificial Intelligence Research
Bi-LCQ: A low-weight clustering-based Q-learning approach for NoCs
Microprocessors & Microsystems
Robustness of stochastic bandit policies
Theoretical Computer Science
Universal knowledge-seeking agents
Theoretical Computer Science
General time consistent discounting
Theoretical Computer Science
Construction of approximation spaces for reinforcement learning
The Journal of Machine Learning Research
Counterfactual reasoning and learning systems: the example of computational advertising
The Journal of Machine Learning Research
Journal of Cognitive Neuroscience
Journal of Cognitive Neuroscience
A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation
Artificial Life and Robotics
Efficient bidding strategies for Cliff-Edge problems
Autonomous Agents and Multi-Agent Systems
Multiagent learning in the presence of memory-bounded agents
Autonomous Agents and Multi-Agent Systems
Dopamine ramps are a consequence of reward prediction errors
Neural Computation
Learning potential functions and their representations for multi-task reinforcement learning
Autonomous Agents and Multi-Agent Systems
Optimal learning for sequential sampling with non-parametric beliefs
Journal of Global Optimization
Embodied imitation-enhanced reinforcement learning in multi-agent systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Computer Networks: The International Journal of Computer and Telecommunications Networking
Hierarchical control of traffic signals using Q-learning with tile coding
Applied Intelligence
Wireless Personal Communications: An International Journal
Journal of Intelligent and Robotic Systems
Simulation Analysis for Network Formulation
Computational Economics
Distributed Learning for Planning Under Uncertainty Problems with Heterogeneous Teams
Journal of Intelligent and Robotic Systems
Multiagent meta-level control for radar coordination
Web Intelligence and Agent Systems
MineralMiner: An active sensing simulation environment
Multiagent and Grid Systems
Analysis of emission right prices in greenhouse gas emission trading via agent-based model
Multiagent and Grid Systems
A survey of multi-objective sequential decision-making
Journal of Artificial Intelligence Research
Scalable and efficient bayes-adaptive reinforcement learning based on monte-carlo tree search
Journal of Artificial Intelligence Research
A tour of machine learning: An AI perspective
AI Communications - ECAI 2012 Turing and Anniversary Track
Artificial Intelligence: From programs to solvers
AI Communications - ECAI 2012 Turing and Anniversary Track
Interactive activity recognition and prompting to assist people with cognitive disabilities
Journal of Ambient Intelligence and Smart Environments - Home-based Health and Wellness Measurement and Monitoring
Adaptive function approximation in reinforcement learning with an interpolating growing neural gas
International Journal of Hybrid Intelligent Systems
Integrated Computer-Aided Engineering
Automatic skill acquisition in reinforcement learning using graph centrality measures
Intelligent Data Analysis
A comparison between a communication-based and a data mining-based learning approach for agents
Intelligent Decision Technologies
METAL: A framework for mixture-of-experts task and attention learning
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Similarity of learned helplessness in human being and fuzzy reinforcement learning algorithms
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Computational intelligence models for image processing and information reasoning
Active noise control system via multi-agent credit assignment
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Experimental Evaluation of Automatic Hint Generation for a Logic Tutor
International Journal of Artificial Intelligence in Education - Best of AIED 2011
A multi-agent control architecture for a robotic wheelchair
Applied Bionics and Biomechanics
Behaviour generation in humanoids by learning potential-based policies from constrained motion
Applied Bionics and Biomechanics
Multi-timescale nexting in a reinforcement learning robot
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Mobile Information Systems
Hi-index | 0.02 |
From the Publisher:In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.