Deep Reinforcement Learning for Continuous Power Allocation in Flexible High Throughput Satellites
Author(s)
Luis, Juan Jose Garau; Guerster, Markus; del Portillo, Inigo; Crawley, Edward; Cameron, Bruce Gregory
DownloadAccepted version (363.8Kb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
© 2019 IEEE. Many of the next generation of satellites will be equipped with numerous degrees of freedom in power and bandwidth allocation capabilities, making manual resource allocation impractical. Therefore, it is desirable to automate the operation of these highly flexible satellites. This paper presents a novel approach based on Deep Reinforcement Learning to allocate power in multibeam satellite systems. The proposed architecture represents the problem as continuous state and action spaces. We make use of the Proximal Policy Optimization algorithm to optimize the allocation policy for minimum unmet system demand and power consumption. Finally, the performance of the algorithm is analyzed through simulations of a multibeam satellite system. The analysis shows promising results for Deep Reinforcement Learning to be used as a dynamic resource allocation algorithm.
Date issued
2019-06Department
Massachusetts Institute of Technology. Department of Aeronautics and AstronauticsJournal
2019 IEEE Cognitive Communications for Aerospace Applications Workshop, CCAAW 2019
Publisher
IEEE
Citation
Luis, Juan Jose Garau, Guerster, Markus, del Portillo, Inigo, Crawley, Edward and Cameron, Bruce. 2019. "Deep Reinforcement Learning for Continuous Power Allocation in Flexible High Throughput Satellites." 2019 IEEE Cognitive Communications for Aerospace Applications Workshop, CCAAW 2019.
Version: Author's final manuscript