World-class interpretable poker

Bertsimas, Dimitris; Paskov, Alex

Author(s)

Bertsimas, Dimitris; Paskov, Alex

Download10994_2022_Article_6179.pdf (1.547Mb)

Publisher with Creative Commons License

Terms of use

Creative Commons Attribution https://creativecommons.org/licenses/by/4.0

Metadata

Show full item record

Abstract

Abstract We address the problem of interpretability in iterative game solving for imperfect-information games such as poker. This lack of interpretability has two main sources: first, the use of an uninterpretable feature representation, and second, the use of black box methods such as neural networks, for the fitting procedure. In this paper, we present advances on both fronts. Namely, first we propose a novel, compact, and easy-to-understand game-state feature representation for Heads-up No-limit (HUNL) Poker. Second, we make use of globally optimal decision trees, paired with a counterfactual regret minimization (CFR) self-play algorithm, to train our poker bot which produces an entirely interpretable agent. Through experiments against Slumbot, the winner of the most recent Annual Computer Poker Competition, we demonstrate that our approach yields a HUNL Poker agent that is capable of beating the Slumbot. Most exciting of all, the resulting poker bot is highly interpretable, allowing humans to learn from the novel strategies it discovers.

Date issued

2022-06

URI

https://hdl.handle.net/1721.1/142962.2

Department

Sloan School of Management; Massachusetts Institute of Technology. Operations Research Center

Journal

Machine Learning

Publisher

Springer Science and Business Media LLC

Citation

Bertsimas, Dimitris and Paskov, Alex. 2022. "World-class interpretable poker."

Version: Final published version

ISSN

0885-6125

1573-0565

Collections

MIT Open Access Articles

Version	Item	Date	Summary
2	1721.1/142962.2*	2022-06-13T18:18:23Z	Publication information verified/added.
1	1721.1/142962	2022-06-13T12:56:43Z

DSpace@MIT