Online learning in repeated auctions

Rigolette, Philippe; Weed, Jonathan

Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/137801.2

Author(s)

Rigolette, Philippe; Weed, Jonathan

DownloadSubmitted version (521.5Kb)

Open Access Policy

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

© 2016 J. Weed, V. Perchet & P. Rigollet. Motivated by online advertising auctions, we consider repeated Vickrey auctions where goods of unknown value are sold sequentially and bidders only learn (potentially noisy) information about a good's value once it is purchased. We adopt an online learning approach with bandit feedback to model this problem and derive bidding strategies for two models: stochastic and adversarial. In the stochastic model, the observed values of the goods are random variables centered around the true value of the good. In this case, logarithmic regret is achievable when competing against well behaved adversaries. In the adversarial model, the goods need not be identical. Comparing our performance against that of the best fixed bid in hindsight, we show that sublinear regret is also achievable in this case. For both the stochastic and adversarial models, we prove matching minimax lower bounds showing our strategies to be optimal up to lower-order terms. To our knowledge, this is the first complete set of strategies for bidders participating in auctions of this type.

URI

https://hdl.handle.net/1721.1/137801

Citation

Rigolette, Philippe and Weed, Jonathan. "Online learning in repeated auctions."

Version: Original manuscript

Collections

MIT Open Access Articles

Version	Item	Date	Summary
2	1721.1/137801.2	2022-08-05T17:30:05Z	Metadata changed: Verified or entered author name and department authority metadata.
1	1721.1/137801*	2021-11-08T19:41:47Z

*Selected version

DSpace@MIT

Notice