Show simple item record

dc.contributor.authorAraki, Brandon
dc.contributor.authorVodrahalli, Kiran
dc.contributor.authorLeech, Thomas
dc.contributor.authorVasile, Cristian-Ioan
dc.contributor.authorDonahue, Mark D.
dc.contributor.authorRus, Daniela L
dc.date.accessioned2019-12-19T20:42:45Z
dc.date.available2019-12-19T20:42:45Z
dc.date.issued2019-06
dc.identifier.isbn9780992374754
dc.identifier.urihttps://hdl.handle.net/1721.1/123310
dc.description.abstractThis paper introduces the Logic-based Value Iteration Network (LVIN) framework, which combines imitation learning and logical automata to enable agents to learn complex behaviors from demonstrations. We address two problems with learning from expert knowledge: (1) how to generalize learned policies for a task to larger classes of tasks, and (2) how to account for erroneous demonstrations. Our LVIN model solves finite gridworld environments by instantiating a recurrent, convolutional neural network as a value iteration procedure over a learned Markov Decision Process (MDP) that factors into two MDPs: a small finite state automaton (FSA) corresponding to logical rules, and a larger MDP corresponding to motions in the environment. The parameters of LVIN (value function, reward map, FSA transitions, large MDP transitions) are approximately learned from expert trajectories. Since the model represents the learned rules as an FSA, the model is interpretable; since the FSA is integrated into planning, the behavior of the agent can be manipulated by modifying the FSA transitions. We demonstrate these abilities in several domains of interest, including a lunchboxpacking manipulation task and a driving domain.en_US
dc.description.sponsorshipNational Science Foundation (Grant 1723943)en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant N000141812830)en_US
dc.description.sponsorshipAir Force Office of Scientific Research (Contract FA8702-15-D-0001)en_US
dc.publisherRobotics: Science and Systems Foundationen_US
dc.relation.isversionofhttp://dx.doi.org/10.15607/rss.2019.xv.064en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceProf. Daniella Rusen_US
dc.titleLearning to Plan with Logical Automataen_US
dc.typeArticleen_US
dc.identifier.citationAraki, Brandon et al. "Learning to Plan with Logical Automata." Robotics: Science and Systems 2019, June 2019, Freiburg, Germany, Robotics: Science and Systems Foundation, June 2019en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentLincoln Laboratoryen_US
dc.relation.journalRobotics: Science and Systems 2019en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.date.submission2019-12-19T18:15:34Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record