26
Online Adversarial Stabilization of
Unknown Networked Systems
JING YU,
California Institute of Technology, United States
DIMITAR HO,
California Institute of Technology, United States
ADAM WIERMAN,
California Institute of Technology, United States
We investigate the problem of stabilizing an unknown networked linear system under communication con-
straints and adversarial disturbances. We propose the first provably stabilizing algorithm for the problem.
The algorithm uses a distributed version of nested convex body chasing to maintain a consistent estimate of
the network dynamics and applies system level synthesis to determine a distributed controller based on this
estimated model. Our approach avoids the need for system identification and accommodates a broad class of
communication delay while being fully distributed and scaling favorably with the number of subsystems.
CCS Concepts:
•
Theory of computation
→
Adversary models
;
Distributed algorithms
; Online learning
algorithms; Quadratic programming.
Additional Key Words and Phrases: online control; learning-based control; adversarial control; distributed
control; stability; communication delay
ACM Reference Format:
Jing Yu, Dimitar Ho, and Adam Wierman. 2023. Online Adversarial Stabilization of Unknown Networked
Systems.
Proc. ACM Meas. Anal. Comput. Syst.
7, 1, Article 26 (March 2023), 43 pages. https://doi.org/10.1145/
3579452
1 INTRODUCTION
Large-scale networked dynamical systems play a crucial role in many emerging engineering systems
such as the power grid [
26
], autonomous vehicles [
50
], and swarm robots [
57
]. Motivated by the
success of learning-based control methods for single-agent (centralized) linear systems, there has
been growing interest in learning distributed controllers for unknown networked systems composed
of interconnected and spatially distributed linear time-invariant (LTI) subsystems [
16
,
31
,
32
,
52
,
87
].
However, since most existing literature ports centralized learning-based control techniques over
to the distributed setting, almost all previous work assumes that the underlying dynamics are stable,
or that a stabilizing and distributed controller is known. For a large-scale networked system, such
assumptions are often unrealistic, because designing stabilizing distributed controllers itself is a
significant task even if the dynamics model is available [7, 30, 35, 64, 81, 92].
Recent work has begun to lift the assumption of the knowledge of a stabilizing controller in the
centralized case, e.g. [
18
,
40
,
70
]. This line of work follows the approach of system identification,
either by letting the unstable system run open-loop or by exciting the system via control inputs.
However, such approaches induce explosive transient behaviors due to the instability of the under-
lying system. Without proper generalization to the networked setting, such explosive behavior can
cause catastrophic system degradation before a proper stabilizing controller can be learned.
Authors’ addresses: Jing Yu, California Institute of Technology, Pasadena, United States, jing@caltech.edu; Dimitar Ho,
California Institute of Technology, Pasadena, United States, dho@caltech.edu; Adam Wierman, California Institute of
Technology, Pasadena, United States, adamw@caltech.edu.
This work is licensed under a Creative Commons Attribution International 4.0 License.
©
2023 Copyright held by the owner/author(s).
2476-1249/2023/3-ART26
https://doi.org/10.1145/3579452
Proc. ACM Meas. Anal. Comput. Syst., Vol. 7, No. 1, Article 26. Publication date: March 2023.
26:2
Jing Yu, Dimitar Ho, and Adam Wierman
Table 1. Maximum and top
90%
infinity norm of the state (
∥
푥
(
푡
)
∥
∞
) for different disturbance profiles averaged
over 10 runs. Simulation details are provided in Section 5.
Algorithm
Correlated Gaussian (Top
90%
)
Uniform (Top
90%
)
State-dependent (Top
90%
)
This work
1
.
21
×
10
1
(
0
.
31
×
10
1
)
2
.
30
×
10
1
(
0
.
36
×
10
1
)
7
.
14
×
10
1
(
0
.
54
×
10
1
)
SysID
5
.
12
×
10
11
(
1
.
71
×
10
11
)
5
.
12
×
10
11
(
1
.
71
×
10
11
)
5
.
12
×
10
11
(
1
.
71
×
10
11
)
Further, until now, scalability and information constraints have only been considered separately
in learning-based distributed controller design; no general approach exists. On the other hand,
information constraints and scalability have been the central topics in distributed control for the
past decade due to their theoretical challenge and practical importance [
56
,
63
,
72
,
72
,
82
,
93
].
Therefore, it is crucial to simultaneously consider such constraints when designing learning-based
distributed control algorithms for networked systems.
1.1 Contributions
In this work, we overcome the aforementioned challenges by leveraging recent advances in online
learning and distributed control. In particular, we propose an approach that combines a distributed
version of nested convex body chasing (NCBC), in order to maintain a consistent estimate of
the network dynamics, with system level synthesis (SLS), in order to determine a distributed
controller based on the selected consistent model. This combination yields the first online algorithm
that provably stabilizes a networked LTI system with information constraints under adversarial
disturbances (Theorem 4.4). The proposed algorithm (Algorithm 2) is distributed and scales favorably
to the number of subsystems in the network.
The proposed approach in this paper is fundamentally different than traditional system identifica-
tion based methods, which incur prohibitively large state norm under adversarial disturbances, even
in the simplest setting (see Table 1). The reason is that system identification-based approaches seek
to learn the full system dynamics, which requires full excitation of the system against worst-case
disturbances. On the other hand, our approach does not require precise knowledge of the system.
Instead, we maintain model estimates that are consistent with the observations generated by the
unknown system at all times. A consequence of focusing on consistency is a natural endogenous
exploration-exploitation scheme where our algorithm performs well (small state norm) while the
selected model stays consistent, and gains information about the system whenever it observes a
large state norm that renders the selected model inconsistent.
The main result of this paper is an input-to-state stability guarantee (Theorem 4.4), where we
draw novel connections between the path length property of NCBC techniques and system stability
analysis. This follows from a set of novel technical results for SLS in the learning-based control
context. In particular, we generalize a previous result [
7
] on the characterization of the closed loop
under SLS controllers that are synthesized from an arbitrary and potentially incorrect system model
(Lemma 3.2). This result enables the analysis of our algorithm when each subsystem uses local,
asynchronous, and wrong model information for local controller synthesis. Further, we derive a
novel perturbation result with explicit constants for finite-horizon SLS synthesis (Theorem 3.4) that
globally bounds the sensitivity of the optimal solution to the SLS problem (a quadratic program
with equality and sparsity constraints) with respect to the model. This result is also applicable in
other contexts such as a class of MPC problems studied in [6, 15, 68].
1.2 Related work
This work contributes to a large and growing body of work on the topics related to learning-based
control design, online control, and distributed control. We briefly review the literature most related
to this work below.
Proc. ACM Meas. Anal. Comput. Syst., Vol. 7, No. 1, Article 26. Publication date: March 2023.
Online Adversarial Stabilization of Unknown Networked Systems
26:3
Stabilization of unknown systems.
Stabilizing unknown linear systems has long been a
fundamental problem studied in adaptive control theory [
42
]. It recently reemerged as a learning
problem and received considerable attention from the machine learning community [
40
,
59
,
75
,
91
].
Most works have been developed under single-agent setting, with a no-noise assumption [
47
,
74
] or
Gausssian noise models [
28
,
46
]. Under the adversarial noise setting, which is the focus of this paper,
the only work that guarantees stabilization for LTI systems is [
18
], with a system identification-
based approach that achieves order-optimal regret. In contrast, we propose a novel framework
for stabilization under adversarial noise that does not rely on accurate identification of the true
dynamics. In particular, our method is the first algorithm to stabilize a networked LTI system under
adversarial disturbances with information constraints while simultaneously achieving magnitudes
of improvement in empirical performance over the state-of-the-art identification-based approach
[18] in the single-agent setting, despite the regret-optimal guarantee in [18].
Distributed control.
Motivated by large-scale cyberphysical systems that are composed of
physically distributed subsystems with local dynamical interactions, there is a large body of work
on control design for networked systems [
7
,
45
,
92
]. Cyberphysical systems such as the power grid
are commonly constrained by a communication layer that allows specific structure of information
exchange among the subsystems. such information structure imposes significant challenges for
optimal control design, often rendering the problem NP-hard [
76
]. In [
64
], it was shown that a large
class of practically relevant distributed control problems is convex and tractable to solve. Since
then, many works have focused on this class of problems [
30
,
48
]. However, [
83
] observes that the
complexity of computation and implementation of distributed controllers developed under this
setting can be prohibitively expensive, thus not scalable to large-scale systems. The System Level
Synthesis (SLS) framework is developed as a scalable alternative to distributed control design [
7
].
In particular, SLS allows order-constant complexity for synthesis and implementation, due to its
special parameterization and implementation of the feedback controller. As a result, many works
have adopted SLS as the basis for novel (learning-based) control algorithms in both distributed and
centralized setting [
6
,
21
,
24
,
79
]. We contribute to the literature on SLS by developing a suit of
technical results for SLS controllers that can find applications beyond the setting of this work.
Learning distributed controllers.
Many learning-based control algorithms for networked
systems adopt a centralized learning or computational approach with the objective of regret
minimization, e.g., [
16
,
27
,
31
,
32
,
87
]. All prior work use the stochastic noise or no-noise model and
assume a known stabilizing distributed controller is given [
4
–
6
,
44
,
52
,
73
]. As far as we are aware,
no previous work accommodates communication delay while doing both learning and control. The
most related to our work are [
37
] and [
31
], where learning-based SLS controllers are designed to
control unknown networked systems. Both of the methods require the knowledge of a stabilizing
and distributed controller. [
37
] is only applicable to small-uncertainty scenarios, while [
31
] requires
a stabilizing distributed controller and performs centralized learning. In this work, we focus on
stabilization and propose the first distributed learning-based control algorithm that guarantees
stability for unknown networked systems under adversarial disturbances.
Online learning.
The problem of online stabilization for unknown dynamical systems is an
instance of online decision making problems, where an agent makes a sequence of decisions based
on the feedback from an unknown environment with the goal of cost minimization. Online decision
making is studied extensively in the online learning literature, with a line of work [
34
,
53
,
66
,
88
] that
makes interesting connections between convex function and body chasing [
9
,
10
] and linear control
theory. In particular, [
38
] proposes an online nonlinear robust control method based on convex
body chasing that guarantees finite mistakes under adversarial disturbances without the need for
system identification. While [
38
] considers binary cost functions, we present novel technical results
Proc. ACM Meas. Anal. Comput. Syst., Vol. 7, No. 1, Article 26. Publication date: March 2023.
26:4
Jing Yu, Dimitar Ho, and Adam Wierman
that establish the first connection between convex body chasing and stability analysis for both
single-agent and networked linear dynamical systems.
1.3 Notation
Let
∥ · ∥
be the
ℓ
2
norm and
∥ · ∥
퐹
be the Frobenius norm. We denote the
(
푖, 푗
)
th position of a
matrix
푀
as
푀
(
푖, 푗
)
and use
푀
(
:
, 푗
)
,푀
(
푖,
:
)
for the
푗
th column and
푖
th row of
푀
respectively. We
use
[
푁
]
for the set of positive integers up to
푁
. Positive integers are denoted as
N
+
. Bold face
lower cases are reserved for vector signal of the form
x
:
=
[
푥
(
0
)
푇
,푥
(
1
)
푇
, . . .
]
푇
with
푥
(
푡
) ∈
R
푛
is
an infinite sequence of vectors indexed by time
푡
. We reserve bold face capital letters for causal
linear operators/transfer matrices with components
퐾
[
0
]
,퐾
[
1
]
, . . .,
such that
K
:
=
퐾
[
0
]
0
. . .
퐾
[
1
]
퐾
[
0
]
0
. . .
.
.
.
.
.
.
.
.
.
.
.
.
.
We write
y
=
Gx
to mean that
푦
(
푡
)
=
Í
푡
푘
=
0
퐺
[
푘
]
푥
(
푡
−
푘
)
. Given any binary matrix
C ∈ {
1
,
0
}
푁
×
푁
,
we say
푀
∈ C
for a matrix
푀
∈
R
푁
×
푁
if the sparsity of
푀
is
C
. We use
{
푒
푗
}
푛
푗
=
1
for the standard
basis in
R
푛
.
2 PRELIMINARIES AND PROBLEM SETUP
We consider the task of stabilizing an unknown networked system made up of
푁
interconnected,
heterogeneous linear time-invariant (LTI) subsystems, illustrated in Figure 1(a). For each subsystem
푖
∈ [
푁
]
, let
푥
푖
(
푡
) ∈
R
푛
푖
,
푢
푖
(
푡
) ∈
R
푚
푖
,
푤
푖
(
푡
) ∈
R
푛
푖
be the local state, control, and disturbance vectors
respectively. Each subsystem
푖
has dynamics,
푥
푖
(
푡
+
1
)
=
∑︁
푗
∈N(
푖
)