CaltechAUTHORS
  A Caltech Library Service

Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions

Ahmadi, Mohamadreza and Singletary, Andrew and Burdick, Joel W. and Ames, Aaron D. (2019) Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions. In: 2019 IEEE 58th Conference on Decision and Control (CDC). IEEE , Piscataway, NJ, pp. 4797-4803. ISBN 978-1-7281-1398-2. https://resolver.caltech.edu/CaltechAUTHORS:20190410-120651366

[img] PDF - Submitted Version
See Usage Policy.

2MB

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20190410-120651366

Abstract

A multi-agent partially observable Markov decision process (MPOMDP) is a modeling paradigm used for high-level planning of heterogeneous autonomous agents subject to uncertainty and partial observation. Despite their modeling efficiency, MPOMDPs have not received significant attention in safety-critical settings. In this paper, we use barrier functions to design policies for MPOMDPs that ensure safety. Notably, our method does not rely on discretizations of the belief space, or finite memory. To this end, we formulate sufficient and necessary conditions for the safety of a given set based on discrete-time barrier functions (DTBFs) and we demonstrate that our formulation also allows for Boolean compositions of DTBFs for representing more complicated safe sets. We show that the proposed method can be implemented online by a sequence of one-step greedy algorithms as a standalone safe controller or as a safety-filter given a nominal planning policy. We illustrate the efficiency of the proposed methodology based on DTBFs using a high-fidelity simulation of heterogeneous robots.


Item Type:Book Section
Related URLs:
URLURL TypeDescription
https://doi.org/10.1109/CDC40024.2019.9030241DOIArticle
https://arxiv.org/abs/1903.07823arXivDiscussion Paper
ORCID:
AuthorORCID
Ahmadi, Mohamadreza0000-0003-1447-3012
Singletary, Andrew0000-0001-6635-4256
Ames, Aaron D.0000-0003-0848-3177
Additional Information:© 2019 IEEE.
Record Number:CaltechAUTHORS:20190410-120651366
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20190410-120651366
Official Citation:M. Ahmadi, A. Singletary, J. W. Burdick and A. D. Ames, "Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions," 2019 IEEE 58th Conference on Decision and Control (CDC), Nice, France, 2019, pp. 4797-4803, doi: 10.1109/CDC40024.2019.9030241
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:94638
Collection:CaltechAUTHORS
Deposited By: George Porter
Deposited On:10 Apr 2019 20:00
Last Modified:10 Sep 2020 20:57

Repository Staff Only: item control page