Show simple item record

dc.contributor.authorWang, Yunbo
dc.contributor.authorLiu, Bo
dc.contributor.authorWu, Jiajun
dc.contributor.authorZhu, Yuke
dc.contributor.authorDu, Simon S
dc.contributor.authorFei-Fei, Li
dc.contributor.authorTenenbaum, Joshua B
dc.date.accessioned2021-12-07T19:14:34Z
dc.date.available2021-12-07T19:14:34Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/138359
dc.description.abstract© 2020 Inst. Sci. inf., Univ. Defence in Belgrade. All rights reserved. A major difficulty of solving continuous POMDPs is to infer the multi-modal distribution of the unobserved true states and to make the planning algorithm dependent on the perceived uncertainty. We cast POMDP filtering and planning problems as two closely related Sequential Monte Carlo (SMC) processes, one over the real states and the other over the future optimal trajectories, and combine the merits of these two parts in a new model named the DualSMC network. In particular, we first introduce an adversarial particle filter that leverages the adversarial relationship between its internal components. Based on the filtering results, we then propose a planning algorithm that extends the previous SMC planning approach [Piche et al., 2018] to continuous POMDPs with an uncertainty-dependent policy. Crucially, not only can DualSMC handle complex observations such as image input but also it remains highly interpretable. It is shown to be effective in three continuous POMDP domains: the floor positioning domain, the 3D light-dark navigation domain, and a modified Reacher domain.en_US
dc.language.isoen
dc.publisherInternational Joint Conferences on Artificial Intelligence Organizationen_US
dc.relation.isversionof10.24963/IJCAI.2020/579en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleDualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPsen_US
dc.typeArticleen_US
dc.identifier.citationWang, Yunbo, Liu, Bo, Wu, Jiajun, Zhu, Yuke, Du, Simon S et al. 2020. "DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs." IJCAI International Joint Conference on Artificial Intelligence, 2021-January.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciences
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journalIJCAI International Joint Conference on Artificial Intelligenceen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2021-12-07T19:08:50Z
dspace.orderedauthorsWang, Y; Liu, B; Wu, J; Zhu, Y; Du, SS; Fei-Fei, L; Tenenbaum, JBen_US
dspace.date.submission2021-12-07T19:08:52Z
mit.journal.volume2021-Januaryen_US
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record