Abstract
In this paper we investigate the use of model-based reinforcement learning toassist people with Type 1 Diabetes with insulin dose decisions. The proposedarchitecture consists of multiple Echo State Networks to predict blood glucoselevels combined with Model Predictive Controller for planning. Echo StateNetwork is a version of recurrent neural networks which allows us to learn longterm dependencies in the input of time series data in an online manner.Additionally, we address the quantification of uncertainty for a more robustcontrol. Here, we used ensembles of Echo State Networks to capture model(epistemic) uncertainty. We evaluated the approach with the FDA-approvedUVa/Padova Type 1 Diabetes simulator and compared the results against baselinealgorithms such as Basal-Bolus controller and Deep Q-learning. The resultssuggest that the model-based reinforcement learning algorithm can performequally or better than the baseline algorithms for the majority of virtual Type1 Diabetes person profiles tested.