Moving Obstacle Avoidance: a Data-Driven Risk-Aware Approach - Moving_Obstacle_Avoidance_a_Data-Driven_Risk-Aware

Moving Obstacle Avoidance: a Data-Driven

Risk-Aware Approach

Skylar X. Wei

, Anushri Dixit

, Shashank Tomar, and Joel W. Burdick,

Member, IEEE

Abstract

—

This paper proposes a new structured method

for a moving agent to predict the paths of dynamically

moving obstacles and avoid them using a risk-aware model

predictive control (MPC) scheme. Given noisy measure-

ments of the a priori unknown obstacle trajectory, a boot-

strapping technique predicts a set of obstacle trajectories.

The bootstrapped predictions are incorporated in the MPC

optimization using a risk-aware methodology so as to pro-

vide probabilistic guarantees on obstacle avoidance. We

validate our methods using simulations of a multi-rotor

drone that avoids various moving obstacles.

Index Terms

—

Predictive control for linear systems,

Stochastic optimal control, Uncertain systems, Robotics.

I. INTRODUCTION

MERGING applications of robots in urban, cluttered, and

potentially hostile environments have increased the im-

portance of online path planning with obstacle behavior clas-

sification and avoidance [1]. Traditionally, robot-obstacle in-

teraction is formulated as the problem of planning a collision-

free path from a starting position to a goal [2]. In environments

with an arbitrary number of moving obstacles and agents with

bounded velocity, this problem is known to be NP-hard [3].

One way to handle dynamic obstacles is to limit their modeled

motions. In [4], the authors assumed a priori knowledge of the

obstacle dynamics or motion patterns. Or, one can plan the

agent’s path off-line using a Probabilistic Roadmap (PRM) in

a field of static obstacles and then replan when dynamical be-

haviors are observed [5]. However, without prior knowledge of

an obstacle’s behavior, a worst-case analysis of unsafe set can

lead to conservative behavior. Potential fields (PFs) are actively

used for dynamic obstacle avoidance: e.g., Lam et al. [6] apply

artificial PFs with stochastic reachable sets in Human-Centered

environments. Slow moving and simple (linear or double

integrator-like) dynamics are assumed. Switching-based plan-

ning methods detect and classify dynamic obstacle behavior

against a set of trajectories, such as constant speed, linear, and

projectile-like motion [7], [8]. Classification-based methods

require distinguishable obstacle behaviors and prior knowledge

about the dynamic environment to generate set trajectories.

This paper presents a new framework for discovering the

dynamics of a priori unknown moving obstacles, forecasting

their trajectories, and providing risk-aware optimal avoidance

strategies. It replaces the need for obstacle trajectory/model

classification while allowing online computation. Extracting

a dynamics model from data is challenging [9], especially

Both authors contributed equally.

The authors are with the Division of Engineering

Applied Science,

California Institute of Technology, MC 104-44, Pasadena, CA 91125,

({swei, adixit, stomar, jburdick}@caltech.edu).

when the available data is limited, noisy, and partial. To

tackle partial measurements, we leverage Takens’ embedding

theorem [10], which uses partial observations to produce

an attractor that is diffeomorphic to the full-state attractor.

We then use Singular Spectrum Analysis (SSA) [11], [12]

to separate noise from the underlying signal and to extract

a predictive model of obstacle behavior. Our use of time

delay embedding is also the basis of Eigensystem Realization

Algorithm (ERA) in linear system identification [13]. Inspired

by [14], we use a classical bootstrap to forecast a set of

obstacle trajectories with statistical quantification. An MPC

planner then incorporates the set of obstacle forecasts as an

affine conservative approximation of a distributionally-robust

chance constraint (DRCC). This constraint is then efficiently

recast in a risk-aware manner, allowing an MPC optimization

based on sequential convex programming [15], [16].

We demonstrate our approach on three scenarios that exhibit

increasingly complicated dynamical behavior. Monte-Carlo

simulations verify the planner’s ability to uphold the user cho-

sen chance constraint. The risk-aware reformulation not only

gives provable probabilistic collision avoidance guarantees, but

also allows an on-line execution of the planner.

Notation:

The set of positive integers, natural numbers, real

numbers, and positive reals are denoted as

, and

respectively. We denote the sequence of consecutive integers

. The finite sequence

;

scalars or vectors

is denoted as

. The expression

denotes

identity matrices and

1

= [1

;

II. SSA P

RELIMINARIES

Consider a discrete-time multivariate stochastic process

where

denotes the

observable of the process out

of the total

available observables, and

is the number of

available observations. Suppose that the true stochastic process

model of the observables is

^

, where

de-

notes a random discrete-time zero-mean measurement noise,

and

is the noiseless observable that captures the governing

laws, which can be composed of

trends

seasons

, and

sta-

tionary

time series. Singular Spectrum Analysis [11] separates

the true signal

and the noise

and extracts a recursive

governing dynamic model of

that can generate a short term

accurate forecast.

Fig.

describes this method

1) Time Delay Embedding

Takens’ method of delays [10] can

reconstruct qualitative features of the full-state phase-space

from delayed partial observations. The

-state observables

are delay embedded into a trajectory (Hankel) matrix

[

]

Fig.

gives an example of the Hankel matrix for

state

. Parameter

is the time delay length, and

is the

This article has been accepted for publication in IEEE Control Systems Letters. This is the author's version which has not been fully edited and

content may change prior to final publication. Citation information: DOI 10.1109/LCSYS.2022.3181191

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

Fig. 1

: A description of bootstrap-SSA-forecast architecture in forecasting the trajectory of a Frisbee where the stochastic observables (corrupted by zero-mean,

noise) consist of

= [

;

]

, the Frisbee’s center positions with respect to an inertial frame. The SSA analysis and bootstrap forecast

is applied to every observable state. Despite its 12-state governing dynamics [17] and with only center position measurements of the Frisbee, we show an

example

strap

forecasts of the Frisbee trajectory for future time steps

;

; N

using our proposed framework.

time series length. Repeating patterns in the Hankel matrix

represent underlying trends and oscillations, which can be

extracted from its covariance matrix:

[

L;N

]

(

[

L;N

]

)

2) Eigen Decomposition

To recover the true signal

, we

seek the best, low-rank matrix approximation of this sig-

nal by thresholding the eigenvalues of

[18]. The sym-

metric covariance matrix

has a spectral decomposition

, where

is a diagonal matrix with real eigenvalues

. The matrix of left eigenvectors

[

;

]

is orthogonal. The truncated right eigenvectors

= [

;

]

can be found as

Suppose

is the optimal threshold and

, which

partitions the Hankel matrix

[

L;N

]

as:

[

L;N

]

∑

√

}

,

[

L;N

]

∑

√

}

,

[

L;N

]

(1)

3) Hankelization

Matrix

[

L;N

]

in (

) should maintain a Han-

kel structure: minor variations in its

secondary diagonals

result from insufficient noise removal.

Hankelization step

performs secondary diagonal averaging in order to find the

matrix

that is closest to

[

L;K

]

with respect to the

Frobenius norm among all

Hankel matrices [11]. The

operator

acting on

matrix

[

L;N

]

entry wise is

defined as: for the

(

i;j

)

element of matrix

[

L;N

]

and

, define a set

,

(

l;n

) :

s;l

, is

mapped to

(

i;j

)

element of the Hankelized

[

L;N

]

via the

expression in Fig.

(for the case of

), where

de-

notes the number of elements in set

4) Forecast with Linear Recurrence Formula

Definition 1.

A time series

admits an L-

decomposition of order not larger than d, denoted by

ord

(

)

, if there exist two systems of functions

;

such that

∑

(

)

(

)

;

i;j

The

secondary diagonals of a matrix

are also the

diagonals of

flipped horizontally with respect to its middle column.

If ord

(

) =

, then the series

admits a

L-decomposition

of the order d

and both systems of functions

(

;

;ρ

)

and

(

;

)

are linearly independent [19].

Definition 2.

A time series

is governed by a linear

recurrent relations/formula (LRF), if there exist coefficients

and

= 0

such that

∑

;

;d < N :

(2)

Real-valued time series governed by LRFs consists of sums

of products of polynomials, exponentials and sinusoids [11].

Theorem 1.

[11] Let

be the vector of the first

components of a left eigenvector

[

L;N

]

, and let

be the

component of eigenvector

. Let

,

∑

Under Assumptions

and

(see below), the LRF coefficients

where

can be computed as:

[

]

∑

;

(3)

and

evolves as the LRF:

∑

III. P

ROBLEM

TATEMENT

Consider the linear, discrete-time dynamical agent model:

;

(4)

where

, and

, for all

, correspond to the system states, controls, and outputs

at time index

respectively. The state transition, actuation,

and measurement matrices are

and

respectively. Constant matrix

maps the system’s states (

) to the system’s

x;y;z

positions

with respect to inertial frame

. We model the

obstacle,

obs

, as a sphere. The obstacle occupies the point set

(

) =

∥

, where

and

are the

obstacle’s center and radius.

We consider the case where the agent (

) is tasked with

following a reference output trajectory

ref

which need not

consider obstacle information. While following this path, the

This article has been accepted for publication in IEEE Control Systems Letters. This is the author's version which has not been fully edited and

content may change prior to final publication. Citation information: DOI 10.1109/LCSYS.2022.3181191

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

agent may encounter

obs

spherical, stationary or moving

obstacles. The obstacle-free region is the open set:

,

{

obs

}

(5)

Assumption 1.

Obstacles can be detected and localized at the

same rate (

Hz) as the planner update. Only measurements

of an obstacle’s geometric center with respect to frame E are

assumed, and they are corrupted by a zero-mean noise. We

can estimate the radius,

, of the

obstacle as

, and the

estimate satisfies

Assumption 2.

All obstacle measurements, admit an L-

decomposition of order

, are governed by LRFs

(

)

whose

LRF coefficients can be uniquely defined.

Assumption 3.

We assume that the obstacles’ velocities

are bounded by

max

, and the initial distances between all

obstacles and the agent are significantly greater than

max

Problem 1.

[Prediction]

Consider a multivariate stochastic

process where observables

, and

denote the

spherical obstacle’s true center location in reference frame,

E. The measurements are corrupted by independent, zero-

mean noises

, and

(see Fig.

). Under

Assumptions

, we seek to predict the obstacle position at

times

+ 1

using these measurements.

Due to limited and noisy partial data and the lack of explicit

dynamics models, we estimate a Bootstrap distribution of the

obstacle predictions, denoted by the random set

pred

, from

time index

+ 1

and calculate its first and

second moments. We account for errors in the forecastS due

to poor signal and noise separation and bandwidth limits (due

to limited training data and incorrect choices of embedding

length

) by solving a DRCC MPC problem.

Problem 2.

[Planning]

Consider the system

(

)

and free-

space

(

)

. Given a discrete-time reference trajectory

ref

where

is the length of the horizon, convex

state constraints

, convex input constraints

, and a convex stage cost

, a total

obs

spherical obstacles each approximated by a set

pred

and risk tolerance

;

, we seek to compute a receding

horizon controller

that avoids the unsafe set

pred

,

∪

obs

pred

via the following non-convex optimization:

min

R

∑

(

ref

;

)

(6a)

s.t.

(6b)

;

init

(6c)

(

pred

)

ε;

(6d)

IV. B

OOTSTRAP

ORECASTING

Despite empirical successes in reconstructing and forecasting

[12], the theoretical accuracy of SSA is strenuous to obtain,

see [20]. Inspired by [14], we use bootstrapping to improve

model discovery and to produce probabilistic forecasts.

Note Assumption

does not

imply full state measurement.

Our real-time bootstrap forecast, Algorithm

, assumes time

series measurements corrupted by noise. The user-defined

parameters

train

and

step

represent the number of initial

training samples, and the number of newly accumulated sam-

ples during an initial bootstrap. Further, one must choose

parameters

and

, where threshold

is used to sep-

arate signal from noise, and

is the number of steps

of progressive relaxation of threshold

In the desired

signal/noise separation (

), the unknown theoretical optimal

threshold

must be estimated. Let

be the Hanke-

lization reconstructed

with the eigenvalues

and their

corresponding right and left eigenvectors. Note, if

d > n

where

, then the norm values

∥

since they are

comprised of the residual measurement noise. We threshold

the difference between two consecutive reconstructions with

, i.e. finding the smallest

s.t.:

∥

(7)

Since the selection of the threshold

is crucial, we add an

additional parameter

to ensure no principle components

are lost in

because of bad choice of

, i.e. to avoid

d < n

. We also include the next

largest eigenvalues

after the first

eigenvalues in the bootstrapping process. Most

importantly, the number of bootstraps,

strap

, needs to be

determined

a priori

, considering the computation capacity,

number of obstacles, and the expected noise level

Algorithm 1

Bootstrap Forecast Algorithms (Per Obstacle)

Data:

Obstacle center position measurements

;

User defined constants:

train

step

strap

Result:

Forecast:

;

straps

Use

;

to update Hankel matrix

while

istrap

strap

while

+ 1

train

for

observables

= 1 :

while

(

)

holds

end

obtain (

istrap

) for each states, istrap

for

+ 1 :

obtain (

istrap

) for each states,

istrap

end

train

step

end

Back-up Strategy

end

Apply the tuples (

istrap

)

Nstraps

for

x; y; z

to the updated Hankel, where

denotes number of eigenvalues post

truncation for the

bootstrap. Perform a

step forecast using

istrap

V. B

OOTSTRAP

LANNING

This section introduces an MPC-based path planner to solve

Problem

. First, we reformulate the obstacle avoidance

The parameters

and

are dictated by measurement noise levels,

which can be characterized off-line in a controlled experimental setting.

The effectiveness of Algorithm

depends highly on the time delay length

, the number of training measurements

train

, the number of bootstraps

strap

, and the MPC horizon length,

. We recommend that

train

be at

least

and that

train

strap

and

step

should be as large as

allowed by the computing platform and benchmarking them offline.

This article has been accepted for publication in IEEE Control Systems Letters. This is the author's version which has not been fully edited and

content may change prior to final publication. Citation information: DOI 10.1109/LCSYS.2022.3181191

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/