A reinforcement learning approach for individualizing erythropoietin dosages in hemodialysis patients

Authors:
José D. Martín-Guerrero;Faustino Gomez;Emilio Soria-Olivas;Jürgen Schmidhuber;Mónica Climente-Martí;N. Víctor Jiménez-Torres
Affiliations:
Department of Electronic Engineering, University of Valencia, CL. Dr. Moliner, 50, 46100 Burjassot, Valencia, Spain;Istituto Dalle Molle di Studi sull'Intelligenza Artificiale (IDSIA), Lugano, Switzerland;Department of Electronic Engineering, University of Valencia, CL. Dr. Moliner, 50, 46100 Burjassot, Valencia, Spain;Istituto Dalle Molle di Studi sull'Intelligenza Artificiale (IDSIA), Lugano, Switzerland;Pharmacy Unit, University Hospital, Dr. Peset, Valencia, Spain;Pharmacy Unit, University Hospital, Dr. Peset, Valencia, Spain and Pharmacy and Pharmaceutical Technology Department, University of Valencia, Spain
Venue:
Expert Systems with Applications: An International Journal
Year:
2009

Citing 2
Cited 1

Drug delivery optimization through Bayesian networks: an application to erythropoietin therapy in uremic anemia

Computers and Biomedical Research - Papers presented at the 16th symposium on computer applications in medical care (SCAMC)
2005 Special Issue: Individualization of pharmacological anemia management using reinforcement learning

Neural Networks - 2005 Special issue: IJCNN 2005

Reinforcement learning based resource allocation in business process management

Data & Knowledge Engineering

Quantified Score

Hi-index	12.05

Visualization

Abstract

This paper presents a reinforcement learning (RL) approach for anemia management in patients undergoing chronic renal failure. Erythropoietin (EPO) is the treatment of choice for this kind of anemia but it is an expensive drug and with some dangerous side-effects that should be considered especially for patients who do not respond to the treatment. Therefore, an individualized treatment appears to be necessary. RL is a suitable approach to tackle this problem. Moreover, resulting policies are similar to medical protocols, and hence, they can easily be transferred to daily practice. A cohort of 64 patients are included in the study. An implementation of the Q-learning algorithm based on a state-aggregation table and another implementation using the multi-layer perceptron as a function approximator (Q-MLP) are compared with the protocols followed in the Nephrology Unit. The policy obtained by the Q-MLP approach outperforms the hospital policy in terms of the ratio of patients that are within the targeted range of hemoglobin (11.5-12.5g/dl) at the end of the analyzed period, since an increase of 25% is observed. It ensures an improvement in patients' quality-of-life and considerable economic savings for the health care system due to both the expensiveness of EPO treatment and the costs incurred by the health care system in order to alleviate problems related to EPO over-dosing. It should be pointed out that the approach presented here is completely general, and therefore, it can be applied to any problem of drug dosage optimization.