Information Aggregation for Constrained Online Control
Abstract
We consider a two-controller online control problem where a central controller chooses an action from a feasible set that is determined by time-varying and coupling constraints, which depend on all past actions and states. The central controller's goal is to minimize the cumulative cost; however, the controller has access to neither the feasible set nor the dynamics directly, which are determined by a remote local controller. Instead, the central controller receives only an aggregate summary of the feasibility information from the local controller, which does not know the system costs. We show that it is possible for an online algorithm using feasibility information to nearly match the dynamic regret of an online algorithm using perfect information whenever the feasible sets satisfy a causal invariance criterion and there is a sufficiently large prediction window size. To do so, we use a form of feasibility aggregation based on entropic maximization in combination with a novel online algorithm, named Penalized Predictive Control (PPC).
Additional Information
© 2021 Copyright held by the owner/author(s).Attached Files
Published - 3410220.3461737.pdf
Files
3410220.3461737.pdf
Files
(990.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:708820011d887c70fc0949646337ddc1
|
990.8 kB | Preview Download |
Additional details
Identifiers
- Eprint ID
- 109416
- Resolver ID
- CaltechAUTHORS:20210607-115053999
Related works
- Describes
- https://resolver.caltech.edu/CaltechAUTHORS:20210604-111535691 (URL)
Funding
- NSF
- CCF-1637598
- Amazon Web Services
- Hong Kong Research Grant Council
- 16207318
- PIMCO
- ECCS-1931662
- NSF
- ECCS-1932611
- NSF
- CCF-1637598
- NSF
- CNS-1518941
Dates
- Created
-
2021-06-07Created from EPrint's datestamp field
- Updated
-
2021-11-16Created from EPrint's last_modified field