An adaptive inventory control for a supply chain

Authors:
Junqin Xu;Jihui Zhang;Yushuang Liu
Affiliations:
School of Mathematical Science, Qingdao University, Qingdao, China;Institute of Complexity Science, Qingdao University, Qingdao, China;College of Science, Qingdao University of Science & Technology, Qingdao, China
Venue:
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Year:
2009

Citing 8
Cited 0

Dyna, an integrated architecture for learning, planning, and reacting

ACM SIGART Bulletin
Technical Note: \cal Q-Learning

Machine Learning
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Competitive and Cooperative Inventory Policies in a Two-Stage Supply Chain

Management Science
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Learning to Predict by the Methods of Temporal Differences

Machine Learning
A Single-Item Inventory Model for a Nonstationary Demand Process

Manufacturing & Service Operations Management
Asynchronous action-reward learning for nonstationary serial supply chain inventory control

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Uncertainties inherent in customer demands make it difficult for supply chains to achieve just-in-time inventory replenishment, resulting in loosing sales opportunities or keeping excessive chain-wide inventories. In this paper, two adaptive inventory-control models, a centralized model and a decentralized one, are proposed for a supply chain consisting of one supplier and one retailers. The objective of the two models is to satisfy a target service level predefined for each retailer and to minimize the whole inventory cost. The inventory-control parameters of the supplier and retailers are safety lead time and safety stocks, respectively. Unlike most extant inventory-control approaches, modelling the uncertainty of customer demand as a statistical distribution is not a prerequisite in the two models. Instead, using a reinforcement learning technique called action-reward method, the control parameters are designed to adaptively change as customer demand patterns changes. A simulation-based experiment was performed to compare the performance of the two inventory control models.