Distributed Reinforcement Learning Control for Batch Sequencing and Sizing in Just-In-Time Manufacturing Systems

Authors:
Joonki Hong;Vittaldas V. Prabhu
Affiliations:
Department of Industrial and Manufacturing Engineering, The Pennsylvania State University, University Park, PA 16802,USA;Department of Industrial and Manufacturing Engineering, The Pennsylvania State University, University Park, PA 16802,USA. prabhu@engr.psu.edu
Venue:
Applied Intelligence
Year:
2004

Citing 15
Cited 8

Minimizing mean squared deviation of completion times about a common due date

Management Science
Sequencing with earliness and tardiness penalties: a review

Operations Research
Practical Issues in Temporal Difference Learning

Machine Learning
Technical Note: \cal Q-Learning

Machine Learning
Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

Machine Learning
A study on decision rules of a scheduling model in an FMS

Computers in Industry
Asynchronous Stochastic Approximation and Q-Learning

Machine Learning
Learning to act using real-time dynamic programming

Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Average reward reinforcement learning: foundations, algorithms, and empirical results

Machine Learning - Special issue on reinforcement learning
Early/tardy scheduling with sequence dependent setups on uniform parallel machines

Computers and Operations Research
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Distributed Learning and Control for Manufacturing Systems Scheduling

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Multi-Machine Scheduling - A Multi-Agent Learning Approach

ICMAS '98 Proceedings of the 3rd International Conference on Multi Agent Systems

Voting in multi-agent system for improvement of partial observations

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning

Computers and Operations Research
CLA-DE: a hybrid model based on cellular learning automata for numerical optimization

Applied Intelligence
A closed-loop feedback simulation for RFID-based manufacturing planning and control system

International Journal of Information Technology and Management
Intelligent controllers for bi-objective dynamic scheduling on a single machine with sequence-dependent setups

Applied Soft Computing
Monte-Carlo tree search for Bayesian reinforcement learning

Applied Intelligence
Learning via human feedback in continuous state and action spaces

Applied Intelligence
On incorporating the paradigms of discretization and Bayesian estimation to create a new family of pursuit learning automata

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an approach that is suitable for Just-In-Time (JIT) production for multi-objective scheduling problem in dynamically changing shop floor environment. The proposed distributed learning and control (DLC) approach integrates part-driven distributed arrival time control (DATC) and machine-driven distributed reinforcement learning based control. With DATC, part controllers adjust their associated parts' arrival time to minimize due-date deviation. Within the restricted pattern of arrivals, machine controllers are concurrently searching for optimal dispatching policies. The machine control problem is modeled as Semi Markov Decision Process (SMDP) and solved using Q-learning. The DLC algorithms are evaluated using simulation for two types of manufacturing systems: family scheduling and dynamic batch sizing. Results show that DLC algorithms achieve significant performance improvement over usual dispatching rules in complex real-time shop floor control problems for JIT production.