| dc.contributor.author | Xia, Victoria F. | |
| dc.contributor.author | Wang, Zi | |
| dc.contributor.author | Allen, Kelsey Rebecca | |
| dc.contributor.author | Silver, Tom | |
| dc.contributor.author | Kaelbling, Leslie P | |
| dc.date.accessioned | 2022-07-18T17:21:48Z | |
| dc.date.available | 2021-09-20T18:21:48Z | |
| dc.date.available | 2022-07-18T17:21:48Z | |
| dc.date.issued | 2019 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/132315.2 | |
| dc.description.abstract | © 7th International Conference on Learning Representations, ICLR 2019. All Rights Reserved. We present a representation for describing transition models in complex uncertain domains using relational rules. For any action, a rule selects a set of relevant objects and computes a distribution over properties of just those objects in the resulting state given their properties in the previous state. An iterative greedy algorithm is used to construct a set of deictic references that determine which objects are relevant in any given state. Feed-forward neural networks are used to learn the transition distribution on the relevant objects' properties. This strategy is demonstrated to be both more versatile and more sample efficient than learning a monolithic transition model in a simulated domain in which a robot pushes stacks of objects on a cluttered table. | en_US |
| dc.language.iso | en | |
| dc.relation.isversionof | https://openreview.net/forum?id=SJxsV2R5FQ | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | MIT web domain | en_US |
| dc.title | Learning sparse relational transition models | en_US |
| dc.type | Article | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.relation.journal | 7th International Conference on Learning Representations, ICLR 2019 | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2020-12-22T16:20:59Z | |
| dspace.orderedauthors | Xia, V; Wang, Z; Allen, K; Silver, T; Kaelbling, LP | en_US |
| dspace.date.submission | 2020-12-22T16:21:04Z | |
| mit.license | OPEN_ACCESS_POLICY | |
| mit.metadata.status | Publication Information Needed | en_US |