Show simple item record

dc.contributor.advisorBrian C. Williams.en_US
dc.contributor.authorRodrigues Quemel e Assis Santana, Pedro Henrique deen_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Aeronautics and Astronautics.en_US
dc.date.accessioned2017-02-22T19:01:09Z
dc.date.available2017-02-22T19:01:09Z
dc.date.copyright2016en_US
dc.date.issued2016en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/107049
dc.descriptionThesis: Ph. D. in Aerospace Engineering, Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2016.en_US
dc.descriptionCataloged from PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 289-306).en_US
dc.description.abstractA special report on the cover of the June 2016 issue of the IEEE Spectrum magazine reads: "can we trust robots?" In a world that has been experiencing a seemingly irreversible process by which autonomous systems have been given increasingly more space in strategic areas such as transportation, manufacturing, energy supply, planetary exploration, and even medical surgeries, it is natural that we start asking ourselves if these systems could be held at the same or even higher levels of safety than we expect from humans. In an effort to make a contribution towards a world of autonomy that we can trust, this thesis argues that one necessary step in this direction is the endowment of autonomous agents with the ability to dynamically adapt to their environment while meeting strict safety guarantees. From a technical standpoint, we propose that autonomous agents in safety-critical applications be able to execute conditional plans (or policies) within risk bounds (also referred to as chance constraints). By being conditional, the plan allows the autonomous agent to adapt to its environment in real-time by conditioning the choice of activity to be executed on the agent's current level of knowledge, or belief, about the true state of world. This belief state is, in turn, a function of the history of potentially noisy sensor observations gathered by the agent from the environment. With respect to bounded risk, it refers to the fact that executing such conditional plans should guarantee to keep the agent "safe" - as defined by sets of state constraints - with high probability, while moving away from the conservatism of minimum risk approaches. In this thesis, we propose Chance-Constrained Partially Observable Markov Decision Processes (CC-POMDP's) as a formalism for conditional risk-bounded planning under uncertainty. Moreover, we present Risk-bounded AO* (RAO*), a heuristic forward search-based algorithm that searches for solutions to a CC-POMDP by leveraging admissible utility and risk heuristics to simultaneously guide the search and perform early pruning of overly-risky policy branches. In an effort to facilitate the specification of risk-bounded behavior by human modelers, we also present the Chance-constrained Reactive Model-based Programming Language (cRMPL), a novel variant of RMPL that incorporates chance constraints as part of its syntax. Finally, in support of the temporal planning applications with duration uncertainty that this thesis is concerned about, we present the Polynomial-time Algorithm for Risk-aware Scheduling (PARIS) and its extension to conditional scheduling of Probabilistic Temporal Plan Networks (PTPN's). The different tools and algorithms developed in the context of this thesis are combined to form the Conditional Planning for Autonomy with Risk (CLARK) system, a risk-aware conditional planning system that can generate chance-constrained, dynamic temporal plans for autonomous agents that must operate under uncertainty. With respect to our empirical validation, each component of CLARK is benchmarked against the relevant state of the art throughout the chapters, followed by several demonstrations of the whole CLARK system working in tandem with other building blocks of an architecture for autonomy.en_US
dc.description.statementofresponsibilityby Pedro Henrique de Rodrigues Quemel e Assis Santana.en_US
dc.format.extent306 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectAeronautics and Astronautics.en_US
dc.titleDynamic execution of temporal plans with sensing actions and bounded risken_US
dc.typeThesisen_US
dc.description.degreePh. D. in Aerospace Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronautics
dc.identifier.oclc971021121en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record