A Monte-Carlo AIXI Approximation

Veness, Joel; Ng, Kee Siong; Hutter, Marcus; Uther, William; Silver, David

dc.contributor.author	Veness, Joel
dc.contributor.author	Ng, Kee Siong
dc.contributor.author	Hutter, Marcus
dc.contributor.author	Uther, William
dc.contributor.author	Silver, David
dc.date.accessioned	2011-10-19T18:11:58Z
dc.date.available	2011-10-19T18:11:58Z
dc.date.issued	2011-01
dc.date.submitted	2010-07
dc.identifier.issn	1943-5037
dc.identifier.issn	1076-9757
dc.identifier.uri	http://hdl.handle.net/1721.1/66495
dc.description.abstract	This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a new Monte-Carlo Tree Search algorithm along with an agent-specific extension to the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a variety of stochastic and partially observable domains. We conclude by proposing a number of directions for future research.	en_US
dc.description.sponsorship	Australian Research Council (grant DP0988049)	en_US
dc.language.iso	en_US
dc.publisher	AI Access Foundation	en_US
dc.relation.isversionof	http://dx.doi.org/10.1613/jair.3125	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	JAIR	en_US
dc.title	A Monte-Carlo AIXI Approximation	en_US
dc.type	Article	en_US
dc.identifier.citation	Veness, Joel et al. (2011) "A Monte-Carlo AIXI Approximation", Journal of Artificial Intelligence Research (2011) Volume 40, pages 95-142. © 2011 AI Access Foundation	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.approver	Silver, David
dc.contributor.mitauthor	Silver, David
dc.relation.journal	Journal of Artificial Intelligence Research	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Veness, Joel; Ng, Kee Siong; Hutter, Marcus; Uther, William; Silver, David
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Veness-2011-A Monte-Carlo AIXI ...
Size:: 668.2Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record