dc.contributor.author | Kuo, Yen-Ling | |
dc.contributor.author | Katz, Boris | |
dc.contributor.author | Barbu, Andrei | |
dc.date.accessioned | 2021-10-12T13:33:19Z | |
dc.date.available | 2021-10-12T13:33:19Z | |
dc.date.issued | 2021-07 | |
dc.date.submitted | 2021-04 | |
dc.identifier.issn | 2296-9144 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/132922 | |
dc.description.abstract | We demonstrate how a reinforcement learning agent can use compositional recurrent neural networks to learn to carry out commands specified in linear temporal logic (LTL). Our approach takes as input an LTL formula, structures a deep network according to the parse of the formula, and determines satisfying actions. This compositional structure of the network enables zero-shot generalization to significantly more complex unseen formulas. We demonstrate this ability in multiple problem domains with both discrete and continuous state-action spaces. In a symbolic domain, the agent finds a sequence of letters that satisfy a specification. In a Minecraft-like environment, the agent finds a sequence of actions that conform to a formula. In the Fetch environment, the robot finds a sequence of arm configurations that move blocks on a table to fulfill the commands. While most prior work can learn to execute one formula reliably, we develop a novel form of multi-task learning for RL agents that allows them to learn from a diverse set of tasks and generalize to a new set of diverse tasks without any additional training. The compositional structures presented here are not specific to LTL, thus opening the path to RL agents that perform zero-shot generalization in other compositional domains. | en_US |
dc.description.sponsorship | NSF (Award 1231216) | en_US |
dc.description.sponsorship | ONR (Award N00014-20-1-2589) | en_US |
dc.publisher | Frontiers Media SA | en_US |
dc.relation.isversionof | http://dx.doi.org/10.3389/frobt.2021.689550 | en_US |
dc.rights | Creative Commons Attribution 4.0 International license | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
dc.source | Frontiers | en_US |
dc.title | Compositional RL Agents That Follow Language Commands in Temporal Logic | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Kuo, Yen-Ling et al. "Compositional RL Agents That Follow Language Commands in Temporal Logic." Frontiers in Robotics and AI 8 (July 2021): 689550. © 2021 Kuo et al. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.relation.journal | Frontiers in Robotics and AI | en_US |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dspace.date.submission | 2021-08-13T16:32:24Z | |
mit.journal.volume | 8 | en_US |
mit.license | PUBLISHER_CC | |
mit.metadata.status | Complete | en_US |