Abstract
Sequential decision making under uncertainty is central to many ProcessSystems Engineering (PSE) challenges, where traditional methods often facelimitations related to controlling and optimizing complex and stochasticsystems. Reinforcement Learning (RL) offers a data-driven approach to derivecontrol policies for such challenges. This paper presents a survey and tutorialon RL methods, tailored for the PSE community. We deliver a tutorial on RL,covering fundamental concepts and key algorithmic families includingvalue-based, policy-based and actor-critic methods. Subsequently, we surveyexisting applications of these RL techniques across various PSE domains, suchas in fed-batch and continuous process control, process optimization, andsupply chains. We conclude with PSE focused discussion of specializedtechniques and emerging directions. By synthesizing the current state of RLalgorithm development and implications for PSE this work identifies successes,challenges, trends, and outlines avenues for future research at the interfaceof these fields.