Show simple item record

dc.contributor.authorKleiman-Weiner, Max
dc.contributor.authorTenenbaum, Joshua B
dc.date.accessioned2020-08-17T14:19:24Z
dc.date.available2020-08-17T14:19:24Z
dc.date.issued2018-12
dc.identifier.urihttps://hdl.handle.net/1721.1/126610
dc.description.abstractLearning to cooperate with friends and compete with foes is a key component of multi-agent reinforcement learning. Typically to do so, one requires access to either a model of or interaction with the other agent(s). Here we show how to learn effective strategies for cooperation and competition in an asymmetric information game with no such model or interaction. Our approach is to encourage an agent to reveal or hide their intentions using an information-theoretic regularizer. We consider both the mutual information between goal and action given state, as well as the mutual information between goal and state. We show how to optimize these regularizers in a way that is easy to integrate with policy gradient reinforcement learning. Finally, we demonstrate that cooperative (competitive) policies learned with our approach lead to more (less) reward for a second agent in two simple asymmetric information games.en_US
dc.description.sponsorshipNational Science Foundation (U.S.). (Grant 1231216)en_US
dc.language.isoen
dc.publisherCurran Associatesen_US
dc.relation.isversionofhttps://papers.nips.cc/paper/8227-learning-to-share-and-hide-intentions-using-information-regularizationen_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceNeural Information Processing Systems (NIPS)en_US
dc.titleLearning to share and hide intentions using information regularizationen_US
dc.typeArticleen_US
dc.identifier.citationStrouse, D. J. et al. “.” Paper presented at the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Dec 3-8 2018, Curran Associates © 2018 The Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.relation.journal32nd Conference on Neural Information Processing Systems (NeurIPS 2018)en_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-10-08T14:52:29Z
dspace.date.submission2019-10-08T14:52:34Z
mit.journal.volume2018en_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record