Qu, Guannan and Wierman, Adam and Li, Na (2022) Scalable Reinforcement Learning for Multiagent Networked Systems. Operations Research . ISSN 0030-364X. doi:10.1287/opre.2021.2226. (In Press) https://resolver.caltech.edu/CaltechAUTHORS:20220914-591652300
Full text is not posted in this repository. Consult Related URLs below.
Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20220914-591652300
Abstract
We study reinforcement learning (RL) in a setting with a network of agents whose states and actions interact in a local manner where the objective is to find localized policies such that the (discounted) global reward is maximized. A fundamental challenge in this setting is that the state-action space size scales exponentially in the number of agents, rendering the problem intractable for large networks. In this paper, we propose a scalable actor critic (SAC) framework that exploits the network structure and finds a localized policy that is an O(ρκ+1)-approximation of a stationary point of the objective for some ρ∈(0,1), with complexity that scales with the local state-action space size of the largest κ-hop neighborhood of the network. We illustrate our model and approach using examples from wireless communication, epidemics, and traffic.
Item Type: | Article | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Related URLs: |
| |||||||||
ORCID: |
| |||||||||
Alternate Title: | Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems | |||||||||
DOI: | 10.1287/opre.2021.2226 | |||||||||
Record Number: | CaltechAUTHORS:20220914-591652300 | |||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechAUTHORS:20220914-591652300 | |||||||||
Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | |||||||||
ID Code: | 116912 | |||||||||
Collection: | CaltechAUTHORS | |||||||||
Deposited By: | Tony Diaz | |||||||||
Deposited On: | 22 Sep 2022 19:44 | |||||||||
Last Modified: | 22 Sep 2022 19:44 |
Repository Staff Only: item control page