Informational dynamics of censored observations
Management Science
Finite-time Analysis of the Multiarmed Bandit Problem
Machine Learning
Adaptive Ordering and Pricing for Perishable Products
Operations Research
The Censored Newsvendor and the Optimal Acquisition of Information
Operations Research
Inventory Control in Directed Networks: A Note on Linear Costs
Operations Research
Learning Algorithms for Separable Approximations of Discrete Stochastic Optimization Problems
Mathematics of Operations Research
Online convex optimization in the bandit setting: gradient descent without a gradient
SODA '05 Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms
Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models
Mathematics of Operations Research
Manufacturing & Service Operations Management
Regret in the Newsvendor Model with Partial Information
Operations Research
Analysis of Perishable-Inventory Systems with Censored Demand Data
Operations Research
Logarithmic regret algorithms for online convex optimization
COLT'06 Proceedings of the 19th annual conference on Learning Theory
Solving operational statistics via a Bayesian analysis
Operations Research Letters
A practical inventory control policy using operational statistics
Operations Research Letters
Mathematics of Operations Research
Robust Controls for Network Revenue Management
Manufacturing & Service Operations Management
Fully Distribution-Free Profit Maximization: The Inventory Management Case
Mathematics of Operations Research
Weak aggregating algorithm for the distribution-free perishable inventory problem
Operations Research Letters
Online lot-sizing problems with ordering, holding and shortage costs
Operations Research Letters
Hi-index | 0.00 |
We study stochastic inventory planning with lost sales and instantaneous replenishment where, contrary to the classical inventory theory, knowledge of the demand distribution is not available. Furthermore, we observe only the sales quantity in each period and lost sales are unobservable, that is, demand data are censored. The manager must make an ordering decision in each period based only on historical sales data. Excess inventory is either perishable or carried over to the next period. In this setting, we propose nonparametric adaptive policies that generate ordering decisions over time. We show that the T-period average expected cost of our policy differs from the benchmark newsvendor cost---the minimum expected cost that would have incurred if the manager had known the underlying demand distribution---by at most O(1/T0.5).