Show simple item record

dc.contributor.authorGaudet, Brian
dc.contributor.authorLinares, Richard
dc.contributor.authorFurfaro, Roberto
dc.date.accessioned2021-11-08T18:19:24Z
dc.date.available2021-11-08T18:19:24Z
dc.date.issued2018-03
dc.identifier.urihttps://hdl.handle.net/1721.1/137757
dc.description.abstract© 2020 COSPAR This work develops a deep reinforcement learning based approach for Six Degree-of-Freedom (DOF) planetary powered descent and landing. Future Mars missions will require advanced guidance, navigation, and control algorithms for the powered descent phase to target specific surface locations and achieve pinpoint accuracy (landing error ellipse <5 m radius). This requires both a navigation system capable of estimating the lander's state in real-time and a guidance and control system that can map the estimated lander state to a commanded thrust for each lander engine. In this paper, we present a novel integrated guidance and control algorithm designed by applying the principles of reinforcement learning theory. The latter is used to learn a policy mapping the lander's estimated state directly to a commanded thrust for each engine, resulting in accurate and almost fuel-optimal trajectories over a realistic deployment ellipse. Specifically, we use proximal policy optimization, a policy gradient method, to learn the policy. Another contribution of this paper is the use of different discount rates for terminal and shaping rewards, which significantly enhances optimization performance. We present simulation results demonstrating the guidance and control system's performance in a 6-DOF simulation environment and demonstrate robustness to noise and system parameter uncertainty.en_US
dc.language.isoen
dc.publisherElsevier BVen_US
dc.relation.isversionof10.1016/J.ASR.2019.12.030en_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleDeep reinforcement learning for six degree-of-freedom planetary landingen_US
dc.typeArticleen_US
dc.identifier.citationGaudet, Brian, Linares, Richard and Furfaro, Roberto. 2018. "Deep reinforcement learning for six degree-of-freedom planetary landing." Advances in Space Research, 65 (7).
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronautics
dc.relation.journalAdvances in Space Researchen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2021-05-06T14:39:57Z
dspace.orderedauthorsGaudet, B; Linares, R; Furfaro, Ren_US
dspace.date.submission2021-05-06T14:39:58Z
mit.journal.volume65en_US
mit.journal.issue7en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record