Show simple item record

dc.contributor.authorKuo, Yen-Ling
dc.contributor.authorKatz, Boris
dc.contributor.authorBarbu, Andrei
dc.date.accessioned2021-10-12T13:33:19Z
dc.date.available2021-10-12T13:33:19Z
dc.date.issued2021-07
dc.date.submitted2021-04
dc.identifier.issn2296-9144
dc.identifier.urihttps://hdl.handle.net/1721.1/132922
dc.description.abstractWe demonstrate how a reinforcement learning agent can use compositional recurrent neural networks to learn to carry out commands specified in linear temporal logic (LTL). Our approach takes as input an LTL formula, structures a deep network according to the parse of the formula, and determines satisfying actions. This compositional structure of the network enables zero-shot generalization to significantly more complex unseen formulas. We demonstrate this ability in multiple problem domains with both discrete and continuous state-action spaces. In a symbolic domain, the agent finds a sequence of letters that satisfy a specification. In a Minecraft-like environment, the agent finds a sequence of actions that conform to a formula. In the Fetch environment, the robot finds a sequence of arm configurations that move blocks on a table to fulfill the commands. While most prior work can learn to execute one formula reliably, we develop a novel form of multi-task learning for RL agents that allows them to learn from a diverse set of tasks and generalize to a new set of diverse tasks without any additional training. The compositional structures presented here are not specific to LTL, thus opening the path to RL agents that perform zero-shot generalization in other compositional domains.en_US
dc.description.sponsorshipNSF (Award 1231216)en_US
dc.description.sponsorshipONR (Award N00014-20-1-2589)en_US
dc.publisherFrontiers Media SAen_US
dc.relation.isversionofhttp://dx.doi.org/10.3389/frobt.2021.689550en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceFrontiersen_US
dc.titleCompositional RL Agents That Follow Language Commands in Temporal Logicen_US
dc.typeArticleen_US
dc.identifier.citationKuo, Yen-Ling et al. "Compositional RL Agents That Follow Language Commands in Temporal Logic." Frontiers in Robotics and AI 8 (July 2021): 689550. © 2021 Kuo et al.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.relation.journalFrontiers in Robotics and AIen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.date.submission2021-08-13T16:32:24Z
mit.journal.volume8en_US
mit.licensePUBLISHER_CC
mit.metadata.statusCompleteen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record