Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution

Roy, Nicholas; He, Ruijie

The MIT Libraries is completing a major upgrade to DSpace@MIT. Starting May 5 2026, DSpace will remain functional, viewable, searchable, and downloadable, however, you will not be able to edit existing collections or add new material. We are aiming to have full functionality restored by May 18, 2026, but intermittent service interruptions may occur. Please email dspace-lib@mit.edu with any questions. Thank you for your patience as we implement this important upgrade.

Show simple item record

dc.contributor.advisor	Nicholas Roy
dc.contributor.author	Roy, Nicholas	en_US
dc.contributor.author	He, Ruijie	en_US
dc.contributor.other	Robotics, Vision & Sensor Networks	en_US
dc.date.accessioned	2009-09-28T21:00:15Z
dc.date.available	2009-09-28T21:00:15Z
dc.date.issued	2009-09-23
dc.identifier.uri	http://hdl.handle.net/1721.1/46820
dc.description.abstract	Online, forward-search techniques have demonstrated promising results for solving problems in partially observable environments. These techniques depend on the ability to efficiently search and evaluate the set of beliefs reachable from the current belief. However, enumerating or sampling action-observation sequences to compute the reachable beliefs is computationally demanding; coupled with the need to satisfy real-time constraints, existing online solvers can only search to a limited depth. In this paper, we propose that policies can be generated directly from the distribution of the agent's posterior belief. When the underlying state distribution is Gaussian, and the observation function is an exponential family distribution, we can calculate this distribution of beliefs without enumerating the possible observations. This property not only enables us to plan in problems with large observation spaces, but also allows us to search deeper by considering policies composed of multi-step action sequences. We present the Posterior Belief Distribution (PBD) algorithm, an efficient forward-search POMDP planner for continuous domains, demonstrating that better policies are generated when we can perform deeper forward search.	en_US
dc.format.extent	12 p.	en_US
dc.relation.ispartofseries	MIT-CSAIL-TR-2009-044
dc.rights	Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 Unported	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/
dc.title	Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution	en_US

Files in this item

Name:: MIT-CSAIL-TR-2009-044.pdf
Size:: 321.8Kb
Format:: PDF

View/Open

Name:: MIT-CSAIL-TR-2009-044.ps
Size:: 1.722Mb
Format:: Postscript

View/Open

Name:: license_rdf
Size:: 597bytes
Format:: Unknown

View/Open

This item appears in the following Collection(s)

CSAIL Technical Reports (July 1, 2003 - present)

Show simple item record