Show simple item record

dc.contributor.authorMaruyama, Yu
dc.contributor.authorZheng, Xiaoyu
dc.contributor.authorKawaguchi, Kenji
dc.date.accessioned2017-03-28T17:19:32Z
dc.date.available2017-03-28T17:19:32Z
dc.date.issued2016-06
dc.date.submitted2015-03
dc.identifier.issn1943-5037
dc.identifier.issn1076-9757
dc.identifier.urihttp://hdl.handle.net/1721.1/107756
dc.description.abstractThis paper considers global optimization with a black-box unknown objective function that can be non-convex and non-differentiable. Such a difficult optimization problem arises in many real-world applications, such as parameter tuning in machine learning, engineering design problem, and planning with a complex physics simulator. This paper proposes a new global optimization algorithm, called Locally Oriented Global Optimization (LOGO), to aim for both fast convergence in practice and finite-time error bound in theory. The advantage and usage of the new algorithm are illustrated via theoretical analysis and an experiment conducted with 11 benchmark test functions. Further, we modify the LOGO algorithm to specifically solve a planning problem via policy search with continuous state/action space and long time horizon while maintaining its finite-time error bound. We apply the proposed planning method to accident management of a nuclear power plant. The result of the application study demonstrates the practical utility of our method.en_US
dc.language.isoen_US
dc.publisherAssociation for the Advancement of Artificial Intelligenceen_US
dc.relation.isversionofhttp://dx.doi.org/10.1613/jair.4742en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAAAIen_US
dc.titleGlobal Continuous Optimization with Error Bound and Fast Convergenceen_US
dc.typeArticleen_US
dc.identifier.citationKawaguchi, Kenji, Yu Maruyama and Xiaoyu Zheng. "Global Continuous Optimization with Error Bound and Fast Convergence." Journal of Articial Intelligence Research 56 (2016): 153-195.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorKawaguchi, Kenji
dc.relation.journalJournal of Artificial Intelligence Researchen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsKawaguchi, Kenji, Maruyama, Yu & Zheng, Xiaoyuen_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-1839-7504
mit.licensePUBLISHER_POLICYen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record