Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents

Alumootil, Varkey

dc.contributor.advisor	Shah, Devavrat
dc.contributor.author	Alumootil, Varkey
dc.date.accessioned	2022-01-14T14:52:30Z
dc.date.available	2022-01-14T14:52:30Z
dc.date.issued	2021-06
dc.date.submitted	2021-06-17T20:12:49.354Z
dc.identifier.uri	https://hdl.handle.net/1721.1/139143
dc.description.abstract	Performance of state-of-the art offline and model-based reinforcement learning (RL) algorithms deteriorates significantly when subjected to severe data scarcity and the presence of heterogeneous agents. In this work, we propose a model-based offline RL method to approach this setting. Using all available data from the various agents, we construct personalized simulators for each individual agent, which are then used to train RL policies. We do so by modeling the transition dynamics of the agents as a low rank tensor decomposition of latent factors associated with agents, states, and actions. We perform experiments on various benchmark environments and demonstrate improvement over existing offline approaches in the scarce data regime.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright MIT
dc.rights.uri	http://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents
dc.type	Thesis
dc.description.degree	M.Eng.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Engineering in Electrical Engineering and Computer Science

Files in this item

Name:: Alumootil-varkey-meng-eecs-202 ...
Size:: 1.225Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record