Regret-optimal control in dynamic environments

Creators: Goel, Gautam; Hassibi, Babak

Abstract

We consider the control of linear time-varying dynamical systems from the perspective of regret minimization. Unlike most prior work in this area, we focus on the problem of designing an online controller which competes with the best dynamic sequence of control actions selected in hindsight, instead of the best controller in some specific class of controllers. This formulation is attractive when the environment changes over time and no single controller achieves good performance over the entire time horizon. We derive the structure of the regret-optimal online controller via a novel reduction to H_∞ control and present a clean data-dependent bound on its regret. We also present numerical simulations which confirm that our regret-optimal controller significantly outperforms the H₂ and H_∞ controllers in dynamic environments.

Additional Information

Attached Files

Submitted - 2010.10473.pdf

Files

2010.10473.pdf

Files (337.6 kB)

Name	Size	Download all
2010.10473.pdf md5:46cae33813ba168bbb80e9cda3ced51a	337.6 kB	Preview Download

Additional details

	All versions	This version
Views	23	23
Downloads	7	7
Data volume	2.4 MB	2.4 MB