Published September 27, 2021 | Version Submitted
Book Section - Chapter Open

Learning to Control an Unstable System with One Minute of Data: Leveraging Gaussian Process Differentiation in Predictive Control

Abstract

We present a straightforward and efficient way to control unstable robotic systems using an estimated dynamics model. Specifically, we show how to exploit the differentiability of Gaussian Processes to create a state-dependent linearized approximation of the true continuous dynamics that can be integrated with model predictive control. Our approach is compatible with most Gaussian process approaches for system identification, and can learn an accurate model using modest amounts of training data. We validate our approach by learning the dynamics of an unstable system such as a segway with a 7-D state space and 2-D input space (using only one minute of data), and we show that the resulting controller is robust to unmodelled dynamics and disturbances, while state-of-the-art control methods based on nominal models can fail under small perturbations. Code is open sourced at https://github.com/learning-and-control/core.

Additional Information

© 2021 IEEE. This work was supported by NSF awards 1637598, 1645832, 1932091, 1924526, and 1923239, and funding from AeroVironment, JPL and BMW. We would like to thank Andrew Singletary and Ellen Novoseller for their help in setting up the robotics and the mathematical derivations, respectively. We would also like to thank the follwing software packages: PyTorch [43], CVXPY [44] and Gurobi [45].

Attached Files

Submitted - 2103.04548.pdf

Files

2103.04548.pdf

Files (3.6 MB)

Name Size Download all
md5:66ae6cb2565568115fdce53255df71d8
3.6 MB Preview Download

Additional details

Additional titles

Alternative title
Learning Unstable Dynamics with One Minute of Data: A Differentiation-based Gaussian Process Approach

Identifiers

Eprint ID
109029
Resolver ID
CaltechAUTHORS:20210510-095024771

Related works

Funding

NSF
CCF-1637598
NSF
CNS-1645832
NSF
CNS-1932091
NSF
ECCS-1924526
NSF
CMMI-1923239
AeroVironment
JPL
BMW

Dates

Created
2021-05-10
Created from EPrint's datestamp field
Updated
2022-04-04
Created from EPrint's last_modified field