Sequencing with earliness and tardiness penalties: a review
Operations Research
Practical Issues in Temporal Difference Learning
Machine Learning
Technical Note: \cal Q-Learning
Machine Learning
A study on decision rules of a scheduling model in an FMS
Computers in Industry
Asynchronous Stochastic Approximation and Q-Learning
Machine Learning
Learning to act using real-time dynamic programming
Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Average reward reinforcement learning: foundations, algorithms, and empirical results
Machine Learning - Special issue on reinforcement learning
Early/tardy scheduling with sequence dependent setups on uniform parallel machines
Computers and Operations Research
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Distributed Learning and Control for Manufacturing Systems Scheduling
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Multi-Machine Scheduling - A Multi-Agent Learning Approach
ICMAS '98 Proceedings of the 3rd International Conference on Multi Agent Systems
Voting in multi-agent system for improvement of partial observations
KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Computers and Operations Research
A closed-loop feedback simulation for RFID-based manufacturing planning and control system
International Journal of Information Technology and Management
Monte-Carlo tree search for Bayesian reinforcement learning
Applied Intelligence
Learning via human feedback in continuous state and action spaces
Applied Intelligence
Hi-index | 0.00 |
This paper presents an approach that is suitable for Just-In-Time (JIT) production for multi-objective scheduling problem in dynamically changing shop floor environment. The proposed distributed learning and control (DLC) approach integrates part-driven distributed arrival time control (DATC) and machine-driven distributed reinforcement learning based control. With DATC, part controllers adjust their associated parts' arrival time to minimize due-date deviation. Within the restricted pattern of arrivals, machine controllers are concurrently searching for optimal dispatching policies. The machine control problem is modeled as Semi Markov Decision Process (SMDP) and solved using Q-learning. The DLC algorithms are evaluated using simulation for two types of manufacturing systems: family scheduling and dynamic batch sizing. Results show that DLC algorithms achieve significant performance improvement over usual dispatching rules in complex real-time shop floor control problems for JIT production.