RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems
Author(s)
Liu, Bai; Xie, Qiaomin; Modiano, Eytan
Download3529375.pdf (2.602Mb)
Publisher Policy
Publisher Policy
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordAbstract
<jats:p>With the rapid advance of information technology, network systems have become increasingly complex and hence the underlying system dynamics are often unknown or difficult to characterize. Finding a good network control policy is of significant importance to achieve desirable network performance (e.g., high throughput or low delay). In this work, we consider using model-based reinforcement learning (RL) to learn the optimal control policy for queueing networks so that the average job delay (or equivalently the average queue backlog) is minimized. Traditional approaches in RL, however, cannot handle the unbounded state spaces of the network control problem. To overcome this difficulty, we propose a new algorithm, called RL for Queueing Networks (RL-QN), which applies model-based RL methods over a finite subset of the state space while applying a known stabilizing policy for the rest of the states. We establish that the average queue backlog under RL-QN with an appropriately constructed subset can be arbitrarily close to the optimal result. We evaluate RL-QN in dynamic server allocation, routing, and switching problems. Simulation results show that RL-QN minimizes the average queue backlog effectively.</jats:p>
Date issued
2022Department
Massachusetts Institute of Technology. Department of Aeronautics and AstronauticsJournal
ACM Transactions on Modeling and Performance Evaluation of Computing Systems
Publisher
Association for Computing Machinery (ACM)
Citation
Liu, Bai, Xie, Qiaomin and Modiano, Eytan. 2022. "RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems." ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 7 (1).
Version: Final published version