Moving Obstacle Avoidance: a Data-Driven
Risk-Aware Approach
Skylar X. Wei
, Anushri Dixit
, Shashank Tomar, and Joel W. Burdick,
Member, IEEE
Abstract
—
This paper proposes a new structured method
for a moving agent to predict the paths of dynamically
moving obstacles and avoid them using a risk-aware model
predictive control (MPC) scheme. Given noisy measure-
ments of the a priori unknown obstacle trajectory, a boot-
strapping technique predicts a set of obstacle trajectories.
The bootstrapped predictions are incorporated in the MPC
optimization using a risk-aware methodology so as to pro-
vide probabilistic guarantees on obstacle avoidance. We
validate our methods using simulations of a multi-rotor
drone that avoids various moving obstacles.
Index Terms
—
Predictive control for linear systems,
Stochastic optimal control, Uncertain systems, Robotics.
I. INTRODUCTION
E
MERGING applications of robots in urban, cluttered, and
potentially hostile environments have increased the im-
portance of online path planning with obstacle behavior clas-
sification and avoidance [1]. Traditionally, robot-obstacle in-
teraction is formulated as the problem of planning a collision-
free path from a starting position to a goal [2]. In environments
with an arbitrary number of moving obstacles and agents with
bounded velocity, this problem is known to be NP-hard [3].
One way to handle dynamic obstacles is to limit their modeled
motions. In [4], the authors assumed a priori knowledge of the
obstacle dynamics or motion patterns. Or, one can plan the
agent’s path off-line using a Probabilistic Roadmap (PRM) in
a field of static obstacles and then replan when dynamical be-
haviors are observed [5]. However, without prior knowledge of
an obstacle’s behavior, a worst-case analysis of unsafe set can
lead to conservative behavior. Potential fields (PFs) are actively
used for dynamic obstacle avoidance: e.g., Lam et al. [6] apply
artificial PFs with stochastic reachable sets in Human-Centered
environments. Slow moving and simple (linear or double
integrator-like) dynamics are assumed. Switching-based plan-
ning methods detect and classify dynamic obstacle behavior
against a set of trajectories, such as constant speed, linear, and
projectile-like motion [7], [8]. Classification-based methods
require distinguishable obstacle behaviors and prior knowledge
about the dynamic environment to generate set trajectories.
This paper presents a new framework for discovering the
dynamics of a priori unknown moving obstacles, forecasting
their trajectories, and providing risk-aware optimal avoidance
strategies. It replaces the need for obstacle trajectory/model
classification while allowing online computation. Extracting
a dynamics model from data is challenging [9], especially
Both authors contributed equally.
The authors are with the Division of Engineering
&
Applied Science,
California Institute of Technology, MC 104-44, Pasadena, CA 91125,
({swei, adixit, stomar, jburdick}@caltech.edu).
when the available data is limited, noisy, and partial. To
tackle partial measurements, we leverage Takens’ embedding
theorem [10], which uses partial observations to produce
an attractor that is diffeomorphic to the full-state attractor.
We then use Singular Spectrum Analysis (SSA) [11], [12]
to separate noise from the underlying signal and to extract
a predictive model of obstacle behavior. Our use of time
delay embedding is also the basis of Eigensystem Realization
Algorithm (ERA) in linear system identification [13]. Inspired
by [14], we use a classical bootstrap to forecast a set of
obstacle trajectories with statistical quantification. An MPC
planner then incorporates the set of obstacle forecasts as an
affine conservative approximation of a distributionally-robust
chance constraint (DRCC). This constraint is then efficiently
recast in a risk-aware manner, allowing an MPC optimization
based on sequential convex programming [15], [16].
We demonstrate our approach on three scenarios that exhibit
increasingly complicated dynamical behavior. Monte-Carlo
simulations verify the planner’s ability to uphold the user cho-
sen chance constraint. The risk-aware reformulation not only
gives provable probabilistic collision avoidance guarantees, but
also allows an on-line execution of the planner.
Notation:
The set of positive integers, natural numbers, real
numbers, and positive reals are denoted as
Z
+
,
N
,
R
, and
R
+
,
respectively. We denote the sequence of consecutive integers
f
i;
;i
+
k
g
as
Z
i
:
i
+
k
. The finite sequence
f
a
1
;
;a
k
g
of
scalars or vectors
a
is denoted as
f
a
g
k
1
. The expression
I
n
n
denotes
n
by
n
identity matrices and
1
= [1
;
1
;
1]
T
:
II. SSA P
RELIMINARIES
Consider a discrete-time multivariate stochastic process
f
o
m
g
N
1
where
m
denotes the
m
th
observable of the process out
of the total
N
o
available observables, and
N
is the number of
available observations. Suppose that the true stochastic process
model of the observables is
^
o
m
=
o
m
+
m
, where
m
de-
notes a random discrete-time zero-mean measurement noise,
and
o
m
is the noiseless observable that captures the governing
laws, which can be composed of
trends
,
seasons
, and
sta-
tionary
time series. Singular Spectrum Analysis [11] separates
the true signal
o
m
and the noise
m
and extracts a recursive
governing dynamic model of
o
m
that can generate a short term
accurate forecast.
Fig.
1
describes this method
.
1) Time Delay Embedding
:
Takens’ method of delays [10] can
reconstruct qualitative features of the full-state phase-space
from delayed partial observations. The
m
th
-state observables
^
o
m
are delay embedded into a trajectory (Hankel) matrix
H
m
[
L
N
]
,
Fig.
1
gives an example of the Hankel matrix for
state
x
. Parameter
L
is the time delay length, and
N
is the
This article has been accepted for publication in IEEE Control Systems Letters. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/LCSYS.2022.3181191
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
Fig. 1
: A description of bootstrap-SSA-forecast architecture in forecasting the trajectory of a Frisbee where the stochastic observables (corrupted by zero-mean,
noise) consist of
f
^
o
g
N
1
= [
f
^
x
g
N
1
;
f
^
y
g
N
1
;
f
^
z
g
N
1
]
, the Frisbee’s center positions with respect to an inertial frame. The SSA analysis and bootstrap forecast
is applied to every observable state. Despite its 12-state governing dynamics [17] and with only center position measurements of the Frisbee, we show an
example
N
strap
forecasts of the Frisbee trajectory for future time steps
f
1
;
2
;
; N
h
g
using our proposed framework.
time series length. Repeating patterns in the Hankel matrix
represent underlying trends and oscillations, which can be
extracted from its covariance matrix:
X
m
=
H
m
[
L;N
]
(
H
m
[
L;N
]
)
T
.
2) Eigen Decomposition
:
To recover the true signal
o
m
, we
seek the best, low-rank matrix approximation of this sig-
nal by thresholding the eigenvalues of
X
m
[18]. The sym-
metric covariance matrix
X
m
has a spectral decomposition
U
U
T
, where
is a diagonal matrix with real eigenvalues
1
2
L
. The matrix of left eigenvectors
U
=
[
1
;
;
L
]
is orthogonal. The truncated right eigenvectors
V
= [
1
;
;
L
]
T
2
R
L
N
of
X
m
can be found as
V
=
U
.
Suppose
is the optimal threshold and
n
n
+1
, which
partitions the Hankel matrix
H
m
[
L;N
]
as:
H
m
[
L;N
]
=
n
∑
p
=1
√
p
p
T
p
|
{z
}
,
H
o
[
L;N
]
+
L
∑
p
=
n
+1
√
p
p
T
p
|
{z
}
,
H
[
L;N
]
:
(1)
3) Hankelization
:
Matrix
H
o
[
L;N
]
in (
1
) should maintain a Han-
kel structure: minor variations in its
k
th
secondary diagonals
result from insufficient noise removal.
1
A
Hankelization step
performs secondary diagonal averaging in order to find the
matrix
H
O
that is closest to
H
o
[
L;K
]
with respect to the
Frobenius norm among all
L
N
Hankel matrices [11]. The
operator
H
acting on
L
N
matrix
H
y
[
L;N
]
entry wise is
defined as: for the
(
i;j
)
th
element of matrix
H
o
[
L;N
]
and
i
+
j
=
s
, define a set
D
s
,
f
(
l;n
) :
l
+
n
=
s;l
2
Z
1:
L
;n
2
Z
1:
N
g
, is
mapped to
(
i;j
)
th
element of the Hankelized
H
H
o
[
L;N
]
via the
expression in Fig.
1
(for the case of
o
m
=
x
), where
j
D
s
j
de-
notes the number of elements in set
D
s
.
4) Forecast with Linear Recurrence Formula
:
Definition 1.
A time series
Y
N
=
f
y
g
N
1
admits an L-
decomposition of order not larger than d, denoted by
ord
L
(
Y
N
)
d
, if there exist two systems of functions
ρ
k
:
Z
0:
L