Published November 2024 | Version Published
Journal Article

Sample-based bounds for coherent risk measures: Applications to policy synthesis and verification

Abstract

Autonomous systems are increasingly used in highly variable and uncertain environments giving rise to the pressing need to consider risk in both the synthesis and verification of policies for these systems. This paper first develops a sample-based method to upper bound the risk measure evaluation of a random variable whose distribution is unknown. These bounds permit us to generate high-confidence verification statements for a large class of robotic systems in a sample-efficient manner. Second, we develop a sample-based method to determine solutions to non-convex optimization problems that outperform a large fraction of the decision space of possible solutions. Both sample-based approaches then permit us to rapidly synthesize risk-aware policies that are guaranteed to achieve a minimum level of system performance. To showcase our approach in simulation, we verify a cooperative multi-agent system and develop a risk-aware controller that outperforms the system's baseline controller. Our approach can be extended to account for any g-entropic risk measure.

Copyright and License

© 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.

Funding

This work was supported by the AFOSR, grant FA9550-19-1-0302.

Contributions

Prithvi Akella: Conceptualization, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. Anushri Dixit: Conceptualization, Formal analysis, Writing – review & editing. Mohamadreza Ahmadi: Conceptualization, Formal analysis, Supervision, Writing – review & editing. Joel W. Burdick: Formal analysis, Supervision, Validation, Writing – review & editing. Aaron D. Ames: Funding acquisition, Writing – review & editing.

Data Availability

No data was used for the research described in the article.

Additional details

Related works

Is new version of
Discussion Paper: arXiv:2204.09833 (arXiv)

Funding

United States Air Force Office of Scientific Research
FA9550-19-1-0302

Dates

Accepted
2024-07-27
Accepted
Available
2024-08-02
Published online
Available
2024-08-20
Version of Record online

Caltech Custom Metadata

Publication Status
Published