Sample-based bounds for coherent risk measures: Applications to policy synthesis and verification

Akella, Prithvi; Dixit, Anushri; Ahmadi, Mohamadreza; Burdick, Joel W.; Ames, Aaron D.

doi:10.1016/j.artint.2024.104195

Published November 2024 | Version Published

Journal Article Metadata-only

Sample-based bounds for coherent risk measures: Applications to policy synthesis and verification

1. California Institute of Technology

Autonomous systems are increasingly used in highly variable and uncertain environments giving rise to the pressing need to consider risk in both the synthesis and verification of policies for these systems. This paper first develops a sample-based method to upper bound the risk measure evaluation of a random variable whose distribution is unknown. These bounds permit us to generate high-confidence verification statements for a large class of robotic systems in a sample-efficient manner. Second, we develop a sample-based method to determine solutions to non-convex optimization problems that outperform a large fraction of the decision space of possible solutions. Both sample-based approaches then permit us to rapidly synthesize risk-aware policies that are guaranteed to achieve a minimum level of system performance. To showcase our approach in simulation, we verify a cooperative multi-agent system and develop a risk-aware controller that outperforms the system's baseline controller. Our approach can be extended to account for any g-entropic risk measure.

Copyright and License

Funding

This work was supported by the AFOSR, grant FA9550-19-1-0302.

Contributions

Prithvi Akella: Conceptualization, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. Anushri Dixit: Conceptualization, Formal analysis, Writing – review & editing. Mohamadreza Ahmadi: Conceptualization, Formal analysis, Supervision, Writing – review & editing. Joel W. Burdick: Formal analysis, Supervision, Validation, Writing – review & editing. Aaron D. Ames: Funding acquisition, Writing – review & editing.

Data Availability

No data was used for the research described in the article.

Additional details

Is new version of: Discussion Paper: arXiv:2204.09833 (arXiv)

United States Air Force Office of Scientific Research
FA9550-19-1-0302

Accepted: 2024-07-27

Accepted
Available: 2024-08-02

Published online
Available: 2024-08-20

Version of Record online

Publication Status: Published

Views

Downloads

Show more details

	All versions	This version
Views	10	10
Downloads	7	0
Data volume	8.2 MB	0 Bytes

More info on how stats are collected....

DOI

Resource type

Journal Article

Publisher

Elsevier

Published in

Artificial Intelligence, 336, 104195, ISSN: 0004-3702, 2024.

Languages

English

Technical metadata

Created: October 10, 2024
Modified: October 10, 2024

Sample-based bounds for coherent risk measures: Applications to policy synthesis and verification

Copyright and License

Funding

Contributions

Data Availability

Additional details

Related works

Funding

Dates

Caltech Custom Metadata

Sample-based bounds for coherent risk measures: Applications to policy synthesis and verification

Creators

Abstract

Copyright and License

Funding

Contributions

Data Availability

Additional details

Related works

Funding

Dates

Caltech Custom Metadata