dc.contributor.author | Serrino, Jack | |
dc.contributor.author | Parkes, DC | |
dc.contributor.author | Kleiman-Weiner, Max | |
dc.contributor.author | Tenenbaum, Joshua B | |
dc.date.accessioned | 2021-12-07T18:33:03Z | |
dc.date.available | 2021-12-07T14:03:48Z | |
dc.date.available | 2021-12-07T18:33:03Z | |
dc.date.issued | 2019-01 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/138343.2 | |
dc.description.abstract | © 2019 Neural information processing systems foundation. All rights reserved. Recent breakthroughs in AI for multi-agent games like Go, Poker, and Dota, have seen great strides in recent years. Yet none of these games address the real-life challenge of cooperation in the presence of unknown and uncertain teammates. This challenge is a key game mechanism in hidden role games. Here we develop the DeepRole algorithm, a multi-agent reinforcement learning agent that we test on The Resistance: Avalon, the most popular hidden role game. DeepRole combines counterfactual regret minimization (CFR) with deep value networks trained through self-play. Our algorithm integrates deductive reasoning into vector-form CFR to reason about joint beliefs and deduce partially observable actions. We augment deep value networks with constraints that yield interpretable representations of win probabilities. These innovations enable DeepRole to scale to the full Avalon game. Empirical game-theoretic methods show that DeepRole outperforms other hand-crafted and learned agents in five-player Avalon. DeepRole played with and against human players on the web in hybrid human-agent teams. We find that DeepRole outperforms human players as both a cooperator and a competitor. | en_US |
dc.description.sponsorship | NSF (Award CCF-1231216) | en_US |
dc.language.iso | en | |
dc.relation.isversionof | https://papers.nips.cc/paper/2019/hash/912d2b1c7b2826caf99687388d2e8f7c-Abstract.html | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | Neural Information Processing Systems (NIPS) | en_US |
dc.title | Finding friend and foe in multi-agent games | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Serrino, J, Parkes, DC, Kleiman-Weiner, M and Tenenbaum, JB. 2019. "Finding friend and foe in multi-agent games." Advances in Neural Information Processing Systems, 32. | en_US |
dc.contributor.department | Center for Brains, Minds, and Machines | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | en_US |
dc.relation.journal | Advances in Neural Information Processing Systems | en_US |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2021-12-07T13:58:31Z | |
dspace.orderedauthors | Serrino, J; Parkes, DC; Kleiman-Weiner, M; Tenenbaum, JB | en_US |
dspace.date.submission | 2021-12-07T13:58:32Z | |
mit.journal.volume | 32 | en_US |
mit.license | PUBLISHER_POLICY | |
mit.metadata.status | Publication Information Needed | en_US |