Exploiting Linear Models for Model-Free Nonlinear Control: A Provably Convergent Policy Gradient Approach

Qu, Guannan; Yu, Chenkai; Low, Steven; Wierman, Adam

doi:10.1109/cdc45484.2021.9683735

Published December 14, 2021 | Version public

Book Section - Chapter Metadata-only

Exploiting Linear Models for Model-Free Nonlinear Control: A Provably Convergent Policy Gradient Approach

1. California Institute of Technology
2. Tsinghua University

Model-free learning-based control methods have seen great success recently. However, such methods typically suffer from poor sample complexity and limited convergence guarantees. This is in sharp contrast to classical model-based control, which has a rich theory but typically requires strong modeling assumptions. In this paper, we combine the two approaches. We consider a dynamical system with both linear and non-linear components and use the linear model to define a warm start for a model-free, policy gradient method. We show this hybrid approach outperforms the model-based controller while avoiding the convergence issues associated with model-free approaches via both numerical experiments and theoretical analyses, in which we derive sufficient conditions on the non-linear component such that our approach is guaranteed to converge to the (nearly) global optimal controller.

Additional Information

Additional details

Eprint ID: 115283
Resolver ID: CaltechAUTHORS:20220628-677879700

Created: 2022-06-28

Created from EPrint's datestamp field
Updated: 2022-06-28

Created from EPrint's last_modified field

	All versions	This version
Views	10	10
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Exploiting Linear Models for Model-Free Nonlinear Control: A Provably Convergent Policy Gradient Approach

Additional Information

Additional details

Identifiers

Dates

Exploiting Linear Models for Model-Free Nonlinear Control: A Provably Convergent Policy Gradient Approach

Creators

Abstract

Additional Information

Additional details

Identifiers

Dates