Browsing MIT Open Access Articles by Subject "value iteration"

Now showing items 1-1 of 1

Universal Reinforcement Learning

Farias, Vivek F.; Moallemi, Ciamac C.; Van Roy, Benjamin; Weissman, Tsachy (Institute of Electrical and Electronics Engineers, 2010-04)

We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence future observations and costs. The goal is to ...