Show simple item record

dc.contributor.advisorKim, Sangbae
dc.contributor.authorZhang, Jenny L.
dc.date.accessioned2024-03-21T19:08:36Z
dc.date.available2024-03-21T19:08:36Z
dc.date.issued2024-02
dc.date.submitted2024-03-04T16:38:15.687Z
dc.identifier.urihttps://hdl.handle.net/1721.1/153828
dc.description.abstractWe present a minimal phase oscillator model for learning quadrupedal locomotion. Each of the four oscillators is coupled only to itself and its corresponding leg through local feedback of the ground reaction force, which we interpret as an observer feedback gain. The oscillator itself is interpreted as a latent contact state-estimator. Through a systematic ablation study, we show that the combination of phase observations, simple phase-based rewards, and the local feedback dynamics induces policies that exhibit emergent gait preferences, while using a reduced set of simple rewards, and without prescribing a specific gait.
dc.publisherMassachusetts Institute of Technology
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleLearning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback
dc.typeThesis
dc.description.degreeM.Eng.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.orcidhttps://orcid.org/0009-0004-3290-1118
mit.thesis.degreeMaster
thesis.degree.nameMaster of Engineering in Electrical Engineering and Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record