Qu, Guannan and Yu, Chenkai and Low, Steven and Wierman, Adam (2021) Exploiting Linear Models for Model-Free Nonlinear Control: A Provably Convergent Policy Gradient Approach. In: 2021 60th IEEE Conference on Decision and Control (CDC). IEEE , Piscataway, NJ, pp. 6539-6546. ISBN 978-1-6654-3659-5. https://resolver.caltech.edu/CaltechAUTHORS:20220628-677879700
Full text is not posted in this repository. Consult Related URLs below.
Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20220628-677879700
Abstract
Model-free learning-based control methods have seen great success recently. However, such methods typically suffer from poor sample complexity and limited convergence guarantees. This is in sharp contrast to classical model-based control, which has a rich theory but typically requires strong modeling assumptions. In this paper, we combine the two approaches. We consider a dynamical system with both linear and non-linear components and use the linear model to define a warm start for a model-free, policy gradient method. We show this hybrid approach outperforms the model-based controller while avoiding the convergence issues associated with model-free approaches via both numerical experiments and theoretical analyses, in which we derive sufficient conditions on the non-linear component such that our approach is guaranteed to converge to the (nearly) global optimal controller.
Item Type: | Book Section | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Related URLs: |
| ||||||||||
ORCID: |
| ||||||||||
Additional Information: | © 2021 IEEE. | ||||||||||
DOI: | 10.1109/cdc45484.2021.9683735 | ||||||||||
Record Number: | CaltechAUTHORS:20220628-677879700 | ||||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechAUTHORS:20220628-677879700 | ||||||||||
Official Citation: | G. Qu, C. Yu, S. Low and A. Wierman, "Exploiting Linear Models for Model-Free Nonlinear Control: A Provably Convergent Policy Gradient Approach," 2021 60th IEEE Conference on Decision and Control (CDC), 2021, pp. 6539-6546, doi: 10.1109/CDC45484.2021.9683735 | ||||||||||
Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | ||||||||||
ID Code: | 115283 | ||||||||||
Collection: | CaltechAUTHORS | ||||||||||
Deposited By: | Tony Diaz | ||||||||||
Deposited On: | 28 Jun 2022 17:49 | ||||||||||
Last Modified: | 28 Jun 2022 17:49 |
Repository Staff Only: item control page