MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Computing Equilibria in Colonel Blotto by Applying Counterfactual Regret Minimization Using a Layered Graph Representation

Author(s)
Zhang, Isaac S.
Thumbnail
DownloadThesis PDF (1.173Mb)
Advisor
Farina, Gabriele
Terms of use
Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) Copyright retained by author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/
Metadata
Show full item record
Abstract
Equilibrium computation of games is one of the fundamental problems at the intersection of computer science and economics. Many popular games have been solved to superhuman levels with a variety of learning techniques, such as diplomacy, many different variants of poker, and most notably, chess. In this paper, we will be focusing on the game of Colonel Blotto which is a classic problem that was first introduced by Emile Borel[1] in his seminal 1921 talk on the theory of games. Colonel Blotto is a game played between two colonels who must distribute their troops among different battlefields, and it is scored by a winner-take-all rule. Colonel Blotto has historically been difficult to solve due to its immense action space. [2] was the first to formulate the game in a linear program and later, [3] was able to greatly improve their formulation in practice by representing the action space using layered graphs. Recently, the multiplicative weights update (MWU) algorithm was implemented in Colonel Blotto by [4] that took advantage of sampling from the action space to learn in larger game settings. We take advantage of the layered graph representation from [3] and use it to run counterfactual regret minimization (CFR) on Colonel Blotto for the first time. CFR is a state-of-the-art learning algorithm that permits parameter free learning and has practical performance that is much better than its theoretical bounds.
Date issued
2024-05
URI
https://hdl.handle.net/1721.1/156784
Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Publisher
Massachusetts Institute of Technology

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.