Compositional RL Agents That Follow Language Commands in Temporal Logic

Kuo, Yen-Ling; Katz, Boris; Barbu, Andrei

dc.contributor.author	Kuo, Yen-Ling
dc.contributor.author	Katz, Boris
dc.contributor.author	Barbu, Andrei
dc.date.accessioned	2021-10-12T13:33:19Z
dc.date.available	2021-10-12T13:33:19Z
dc.date.issued	2021-07
dc.date.submitted	2021-04
dc.identifier.issn	2296-9144
dc.identifier.uri	https://hdl.handle.net/1721.1/132922
dc.description.abstract	We demonstrate how a reinforcement learning agent can use compositional recurrent neural networks to learn to carry out commands specified in linear temporal logic (LTL). Our approach takes as input an LTL formula, structures a deep network according to the parse of the formula, and determines satisfying actions. This compositional structure of the network enables zero-shot generalization to significantly more complex unseen formulas. We demonstrate this ability in multiple problem domains with both discrete and continuous state-action spaces. In a symbolic domain, the agent finds a sequence of letters that satisfy a specification. In a Minecraft-like environment, the agent finds a sequence of actions that conform to a formula. In the Fetch environment, the robot finds a sequence of arm configurations that move blocks on a table to fulfill the commands. While most prior work can learn to execute one formula reliably, we develop a novel form of multi-task learning for RL agents that allows them to learn from a diverse set of tasks and generalize to a new set of diverse tasks without any additional training. The compositional structures presented here are not specific to LTL, thus opening the path to RL agents that perform zero-shot generalization in other compositional domains.	en_US
dc.description.sponsorship	NSF (Award 1231216)	en_US
dc.description.sponsorship	ONR (Award N00014-20-1-2589)	en_US
dc.publisher	Frontiers Media SA	en_US
dc.relation.isversionof	http://dx.doi.org/10.3389/frobt.2021.689550	en_US
dc.rights	Creative Commons Attribution 4.0 International license	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	Frontiers	en_US
dc.title	Compositional RL Agents That Follow Language Commands in Temporal Logic	en_US
dc.type	Article	en_US
dc.identifier.citation	Kuo, Yen-Ling et al. "Compositional RL Agents That Follow Language Commands in Temporal Logic." Frontiers in Robotics and AI 8 (July 2021): 689550. © 2021 Kuo et al.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.relation.journal	Frontiers in Robotics and AI	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.date.submission	2021-08-13T16:32:24Z
mit.journal.volume	8	en_US
mit.license	PUBLISHER_CC
mit.metadata.status	Complete	en_US

Files in this item

Name:: frobt-08-689550.pdf
Size:: 1.572Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record