Show simple item record

dc.contributor.advisorLeslie Kaelbling
dc.contributor.authorZewdie, Dawit H.en_US
dc.contributor.authorKonidaris, Georgeen_US
dc.contributor.otherLearning and Intelligent Systemsen
dc.date.accessioned2015-11-30T19:30:04Z
dc.date.available2015-11-30T19:30:04Z
dc.date.issued2015-11-24
dc.identifier.urihttp://hdl.handle.net/1721.1/100053
dc.description.abstractRecent years have seen increased interest in non-parametric reinforcement learning. There are now practical kernel-based algorithms for approximating value functions; however, kernel regression requires that the underlying function being approximated be smooth on its domain. Few problems of interest satisfy this requirement in their natural representation. In this paper we define Value-Consistent Pseudometric (VCPM), the distance function corresponding to a transformation of the domain into a space where the target function is maximally smooth and thus well-approximated by kernel regression. We then present DKBRL, an iterative batch RL algorithm interleaving steps of Kernel-Based Reinforcement Learning and distance metric adjustment. We evaluate its performance on Acrobot and PinBall, continuous-space reinforcement learning domains with discontinuous value functions.en_US
dc.format.extent16 p.en_US
dc.relation.ispartofseriesMIT-CSAIL-TR-2015-032
dc.rightsCreative Commons Attribution-ShareAlike 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by-sa/4.0/
dc.subjectMetric learningen_US
dc.titleRepresentation Discovery for Kernel-Based Reinforcement Learningen_US
dc.date.updated2015-11-30T19:30:04Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record