Show simple item record

dc.contributor.advisorLeslie Kaelbling
dc.contributor.authorKawaguchi, Kenjien_US
dc.contributor.otherLearning and Intelligent Systemsen
dc.date.accessioned2016-06-01T22:00:14Z
dc.date.available2016-06-01T22:00:14Z
dc.date.issued2016-05-26
dc.identifier.urihttp://hdl.handle.net/1721.1/102796
dc.descriptionSM thesisen_US
dc.description.abstractThis thesis discusses novel principles to improve the theoretical analyses of a class of methods, aiming to provide theoretically driven yet practically useful methods. The thesis focuses on a class of methods, called bound-based search, which includes several planning algorithms (e.g., the A* algorithm and the UCT algorithm), several optimization methods (e.g., Bayesian optimization and Lipschitz optimization), and some learning algorithms (e.g., PAC-MDP algorithms). For Bayesian optimization, this work solves an open problem and achieves an exponential convergence rate. For learning algorithms, this thesis proposes a new analysis framework, called PAC-RMDP, and improves the previous theoretical bounds. The PAC-RMDP framework also provides a unifying view of some previous near-Bayes optimal and PAC-MDP algorithms. All proposed algorithms derived on the basis of the new principles produced competitive results in our numerical experiments with standard benchmark tests.en_US
dc.format.extent87 p.en_US
dc.relation.ispartofseriesMIT-CSAIL-TR-2016-006
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectPAC-MDPen_US
dc.subjectAI planningen_US
dc.subjectGlobal optimizationen_US
dc.titleTowards Practical Theory: Bayesian Optimization and Optimal Explorationen_US
dc.date.updated2016-06-01T22:00:15Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record