Deep Bayesian Nonparametric Learning of Rules and Plans from Demonstrations with a Learned Automaton Prior

Araki, Brandon; Vodrahalli, Kiran; Leech, Thomas; Vasile, Cristian-Ioan; Donahue, Mark; Rus, Daniela

Author(s)

Araki, Brandon; Vodrahalli, Kiran; Leech, Thomas; Vasile, Cristian-Ioan; Donahue, Mark; ... Show more

DownloadAccepted version (1.487Mb)

Open Access Policy

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

We introduce a method to learn imitative policies from expert demonstrations that are interpretable and manipulable. We achieve interpretability by modeling the interactions between high-level actions as an automaton with connections to formal logic. We achieve manipulability by integrating this automaton into planning, so that changes to the automaton have predictable effects on the learned behavior. These qualities allow a human user to first understand what the model has learned, and then either correct the learned behavior or zero-shot generalize to new, similar tasks. We build upon previous work by no longer requiring additional supervised information which is hard to collect in practice. We achieve this by using a deep Bayesian nonparametric hierarchical model. We test our model on several domains and also show results for a real-world implementation on a mobile robotic arm platform.

Date issued

2020

URI

https://hdl.handle.net/1721.1/135230

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science; Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Lincoln Laboratory

Journal

Proceedings of the AAAI Conference on Artificial Intelligence

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Collections

MIT Open Access Articles