dc.contributor.advisor | Eytan Modiano. | en_US |
dc.contributor.author | Liu, Bai(Aerospace scientist)Massachusetts Institute of Technology. | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics. | en_US |
dc.date.accessioned | 2019-10-04T21:33:15Z | |
dc.date.available | 2019-10-04T21:33:15Z | |
dc.date.copyright | 2019 | en_US |
dc.date.issued | 2019 | en_US |
dc.identifier.uri | https://hdl.handle.net/1721.1/122414 | |
dc.description | Thesis: S.M., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2019 | en_US |
dc.description | Cataloged from PDF version of thesis. | en_US |
dc.description | Includes bibliographical references (pages 59-91). | en_US |
dc.description.abstract | With the rapid growth of information technology, network systems have become increasingly complex. In particular, designing network control policies requires knowledge of underlying network dynamics, which are often unknown, and need to be learned. Existing reinforcement learning methods such as Q-Learning, Actor-Critic, etc. are heuristic and do not offer performance guarantees. In contrast, model-based learning methods offer performance guarantees, but can only be applied with bounded state spaces. In the thesis, we propose to use model-based reinforcement learning. By applying Lyapunov analysis, our algorithm can be applied to queueing networks with unbounded state spaces. We prove that under our algorithm, the average queue backlog can get arbitrarily close to the optimal result. We also implement simulations to illustrate the effectiveness of our algorithm. | en_US |
dc.description.statementofresponsibility | by Bai Liu. | en_US |
dc.format.extent | 61 pages | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Aeronautics and Astronautics. | en_US |
dc.title | Reinforcement learning in network control | en_US |
dc.type | Thesis | en_US |
dc.description.degree | S.M. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | en_US |
dc.identifier.oclc | 1119730914 | en_US |
dc.description.collection | S.M. Massachusetts Institute of Technology, Department of Aeronautics and Astronautics | en_US |
dspace.imported | 2019-10-04T21:33:13Z | en_US |
mit.thesis.degree | Master | en_US |
mit.thesis.department | Aero | en_US |