Learning Algorithms for Minimizing Queue Length Regret

Stahlbuhk, Thomas; Shrader, Brooke; Modiano, Eytan

dc.contributor.author	Stahlbuhk, Thomas
dc.contributor.author	Shrader, Brooke
dc.contributor.author	Modiano, Eytan
dc.date.accessioned	2021-10-27T20:30:30Z
dc.date.available	2021-10-27T20:30:30Z
dc.date.issued	2021
dc.identifier.uri	https://hdl.handle.net/1721.1/136035
dc.description.abstract	© 1963-2012 IEEE. We consider a system consisting of a single transmitter/receiver pair and N channels over which they may communicate. Packets randomly arrive to the transmitter's queue and wait to be successfully sent to the receiver. The transmitter may attempt a frame transmission on one channel at a time, where each frame includes a packet if one is in the queue. For each channel, an attempted transmission is successful with an unknown probability. The transmitter's objective is to quickly identify the best channel to minimize the number of packets in the queue over T time slots. To analyze system performance, we introduce queue length regret, which is the expected difference between the total queue length of a learning policy and a controller that knows the rates, a priori. One approach to designing a transmission policy would be to apply algorithms from the literature that solve the closely-related stochastic multi-armed bandit problem. These policies would focus on maximizing the number of successful frame transmissions over time. However, we show that these methods have Omega (log {{T}}) queue length regret. On the other hand, we show that there exists a set of queue-length based policies that can obtain order optimal {O}(1) queue length regret. We use our theoretical analysis to devise heuristic methods that are shown to perform well in simulation.
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.isversionof	10.1109/TIT.2021.3054854
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source	arXiv
dc.title	Learning Algorithms for Minimizing Queue Length Regret
dc.type	Article
dc.contributor.department	Lincoln Laboratory
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
dc.relation.journal	IEEE Transactions on Information Theory
dc.eprint.version	Original manuscript
dc.type.uri	http://purl.org/eprint/type/JournalArticle
eprint.status	http://purl.org/eprint/status/NonPeerReviewed
dc.date.updated	2021-05-03T17:27:39Z
dspace.orderedauthors	Stahlbuhk, T; Shrader, B; Modiano, E
dspace.date.submission	2021-05-03T17:27:40Z
mit.journal.volume	67
mit.journal.issue	3
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed

Files in this item

Name:: 2005.05206.pdf
Size:: 858.4Kb
Format:: PDF
Description:: Submitted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record