Regret-optimal control in dynamic environments

Abstract

We consider the control of linear time-varying dynamical systems from theperspective of regret minimization. Unlike most prior work in this area, wefocus on the problem of designing an online controller which competes with thebest dynamic sequence of control actions selected in hindsight, instead of thebest controller in some specific class of controllers. This formulation isattractive when the environment changes over time and no single controllerachieves good performance over the entire time horizon. We derive the structureof the regret-optimal online controller via a novel reduction to $H_{\infty}$control and present a clean data-dependent bound on its regret. We also presentnumerical simulations which confirm that our regret-optimal controllersignificantly outperforms the $H_2$ and $H_{\infty}$ controllers in dynamicenvironments.

Quick Read (beta)

loading the full paper ...