dc.contributor.author | Undurti, Aditya | |
dc.contributor.author | How, Jonathan P. | |
dc.date.accessioned | 2011-08-31T15:29:46Z | |
dc.date.available | 2011-08-31T15:29:46Z | |
dc.date.issued | 2010-07 | |
dc.date.submitted | 2010-05 | |
dc.identifier.isbn | 978-1-4244-5038-1 | |
dc.identifier.issn | 1050-4729 | |
dc.identifier.other | INSPEC Accession Number: 11431048 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/65564 | |
dc.description.abstract | This work seeks to address the problem of planning in the presence of uncertainty and constraints. Such problems arise in many situations, including the basis of this work, which involves planning for a team of first responders (both humans and robots) operating in an urban environment. The problem is framed as a Partially-Observable Markov Decision Process (POMDP) with constraints, and it is shown that even in a relatively simple planning problem, modeling constraints as large penalties does not lead to good solutions. The main contribution of the work is a new online algorithm that explicitly ensures constraint feasibility while remaining computationally tractable. Its performance is demonstrated on an example problem and it is demonstrated that our online algorithm generates policies comparable to an offline constrained POMDP algorithm. | en_US |
dc.description.sponsorship | United States. Office of Naval Research (grant N00014-07-1-0749) | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/ROBOT.2010.5509743 | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | IEEE | en_US |
dc.title | An online algorithm for constrained POMDPs | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Undurti, Aditya, and Jonathan P How. “An Online Algorithm for Constrained POMDPs.” 2010 IEEE International Conference on Robotics and Automation. Anchorage, AK, 2010. 3966-3973. © Copyright 2010 IEEE | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | en_US |
dc.contributor.approver | How, Jonathan P. | |
dc.contributor.mitauthor | Undurti, Aditya | |
dc.contributor.mitauthor | How, Jonathan P. | |
dc.relation.journal | 2010 IEEE International Conference on Robotics and Automation (ICRA) | en_US |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
dspace.orderedauthors | Undurti, Aditya; How, Jonathan P | en |
dc.identifier.orcid | https://orcid.org/0000-0001-8576-1930 | |
mit.license | PUBLISHER_POLICY | en_US |
mit.metadata.status | Complete | |