Learning to share and hide intentions using information regularization

Kleiman-Weiner, Max; Tenenbaum, Joshua B

dc.contributor.author	Kleiman-Weiner, Max
dc.contributor.author	Tenenbaum, Joshua B
dc.date.accessioned	2020-08-17T14:19:24Z
dc.date.available	2020-08-17T14:19:24Z
dc.date.issued	2018-12
dc.identifier.uri	https://hdl.handle.net/1721.1/126610
dc.description.abstract	Learning to cooperate with friends and compete with foes is a key component of multi-agent reinforcement learning. Typically to do so, one requires access to either a model of or interaction with the other agent(s). Here we show how to learn effective strategies for cooperation and competition in an asymmetric information game with no such model or interaction. Our approach is to encourage an agent to reveal or hide their intentions using an information-theoretic regularizer. We consider both the mutual information between goal and action given state, as well as the mutual information between goal and state. We show how to optimize these regularizers in a way that is easy to integrate with policy gradient reinforcement learning. Finally, we demonstrate that cooperative (competitive) policies learned with our approach lead to more (less) reward for a second agent in two simple asymmetric information games.	en_US
dc.description.sponsorship	National Science Foundation (U.S.). (Grant 1231216)	en_US
dc.language.iso	en
dc.publisher	Curran Associates	en_US
dc.relation.isversionof	https://papers.nips.cc/paper/8227-learning-to-share-and-hide-intentions-using-information-regularization	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	Learning to share and hide intentions using information regularization	en_US
dc.type	Article	en_US
dc.identifier.citation	Strouse, D. J. et al. “.” Paper presented at the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Dec 3-8 2018, Curran Associates © 2018 The Author(s)	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences	en_US
dc.relation.journal	32nd Conference on Neural Information Processing Systems (NeurIPS 2018)	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2019-10-08T14:52:29Z
dspace.date.submission	2019-10-08T14:52:34Z
mit.journal.volume	2018	en_US
mit.metadata.status	Complete

Files in this item

Name:: 8227-learning-to-share-and-hid ...
Size:: 4.595Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record