Show simple item record

dc.contributor.authorKim, Beomjoon
dc.contributor.authorLee, Kyungjae
dc.contributor.authorLim, Sungbin
dc.contributor.authorKaelbling, Leslie P
dc.contributor.authorLozano-Perez, Tomas
dc.date.accessioned2022-07-12T12:30:34Z
dc.date.available2021-09-20T18:21:48Z
dc.date.available2022-07-12T12:30:34Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/132316.2
dc.description.abstract<jats:p>Many important applications, including robotics, data-center management, and process control, require planning action sequences in domains with continuous state and action spaces and discontinuous objective functions. Monte Carlo tree search (MCTS) is an effective strategy for planning in discrete action spaces. We provide a novel MCTS algorithm (voot) for deterministic environments with continuous action spaces, which, in turn, is based on a novel black-box function-optimization algorithm (voo) to efficiently sample actions. The voo algorithm uses Voronoi partitioning to guide sampling, and is particularly efficient in high-dimensional spaces. The voot algorithm has an instance of voo at each node in the tree. We provide regret bounds for both algorithms and demonstrate their empirical effectiveness in several high-dimensional problems including two difficult robotics planning problems.</jats:p>en_US
dc.language.isoen
dc.publisherAssociation for the Advancement of Artificial Intelligence (AAAI)en_US
dc.relation.isversionof10.1609/AAAI.V34I06.6546en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceOther repositoryen_US
dc.titleMonte Carlo Tree Search in Continuous Spaces Using Voronoi Optimistic Optimization with Regret Boundsen_US
dc.typeArticleen_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.relation.journalProceedings of the AAAI Conference on Artificial Intelligenceen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2020-12-22T18:54:11Z
dspace.orderedauthorsKim, B; Lee, K; Lim, S; Kaelbling, L; Lozano-Perez, Ten_US
dspace.date.submission2020-12-22T18:54:16Z
mit.journal.volume34en_US
mit.journal.issue06en_US
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusPublication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version