dc.contributor.author | Chaudhari, Pratik Anil | |
dc.contributor.author | Karaman, Sertac | |
dc.contributor.author | Hsu, David | |
dc.contributor.author | Frazzoli, Emilio | |
dc.date.accessioned | 2013-10-29T13:48:44Z | |
dc.date.available | 2013-10-29T13:48:44Z | |
dc.date.issued | 2013-06 | |
dc.identifier.isbn | 978-1-4799-0177-7 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/81825 | |
dc.description.abstract | This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially observable Markov decision processes (POMDPs), such that the behavior of successive approximations converges to the behavior of the original continuous system in an appropriate sense. We also show that the optimal cost function and control policies for these POMDP approximations converge almost surely to their counterparts for the underlying continuous system in the limit. We demonstrate this approach on two popular continuous-time problems, viz., the Linear-Quadratic-Gaussian (LQG) control problem and the light-dark domain problem. | en_US |
dc.description.sponsorship | United States. Army Research Office. Multidisciplinary University Research Initiative (Grant W911NF-11-1-0046) | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6580549 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike 3.0 | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
dc.source | MIT web domain | en_US |
dc.title | Sampling-based algorithms for continuous-time POMDPs | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Chaudhari, Pratik Anil et al. "Sampling-based algorithms for continuous-time POMDPs." IEEE American Control Conference (ACC), 2013. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | en_US |
dc.contributor.mitauthor | Chaudhari, Pratik Anil | en_US |
dc.contributor.mitauthor | Karaman, Sertac | en_US |
dc.contributor.mitauthor | Frazzoli, Emilio | en_US |
dc.relation.journal | Proceedings of the 2013 American Control Conference (ACC) | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Chaudhari, Pratik Anil; Karaman, Sertac; Hsu, David; Frazzoli, Emilio | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-0505-1400 | |
dc.identifier.orcid | https://orcid.org/0000-0002-2225-7275 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |