No-Regret Learning in General Games

Fishelson, Maxwell K.

dc.contributor.advisor	Daskalakis, Constantinos
dc.contributor.author	Fishelson, Maxwell K.
dc.date.accessioned	2023-03-31T14:41:01Z
dc.date.available	2023-03-31T14:41:01Z
dc.date.issued	2023-02
dc.date.submitted	2023-02-28T14:36:07.020Z
dc.identifier.uri	https://hdl.handle.net/1721.1/150227
dc.description.abstract	This thesis investigates the regret performance of no-regret learning algorithms in the competitive, though not fully-adversarial, environment of games. We establish exponential improvements on previously best-known external and internal regret bounds for these settings. We show that Optimistic Hedge – a common variant of multiplicative-weights-updates with recency bias – attains poly(log T) regret in multi-player general-sum games. In particular, when every player of the game uses Optimistic Hedge to iteratively update her strategy in response to the history of play so far, then after T rounds of interaction, each player experiences total regret that is poly(log T). Our bound improves, exponentially, the O(T¹ᐟ²) regret attainable by standard no-regret learners in games, the O(T¹ᐟ⁴) regret attainable by no-regret learners with recency bias [Syr+15], and the O(T¹ᐟ⁶) bound that was recently shown for Optimistic Hedge in the special case of two-player games [CP20]. A corollary of our bound is that Optimistic Hedge converges to coarse correlated equilibrium in general games at a rate of [formula]. We then extend this result from external regret to internal and swap regret, thereby establishing uncoupled learning dynamics that converge to an approximate correlated equilibrium at the rate of [formula]. This substantially improves over the prior best rate of convergence for correlated equilibria of O(T⁻³ᐟ⁴) due to Chen and Peng (NeurIPS ‘20), and it is optimal up to polylogarithmic factors in T. The results presented here originate from my works [DFG21] and [Ana+22].
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright MIT
dc.rights.uri	http://rightsstatements.org/page/InC-EDU/1.0/
dc.title	No-Regret Learning in General Games
dc.type	Thesis
dc.description.degree	S.M.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Science in Electrical Engineering and Computer Science

Files in this item

Name:: Fishelson-maxfish-SM-EECS-2023 ...
Size:: 868.1Kb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record