dc.contributor.author | Bastani, Hamsa | |
dc.contributor.author | Simchi-Levi, David | |
dc.contributor.author | Zhu, Ruihao | |
dc.date.accessioned | 2023-03-21T17:11:27Z | |
dc.date.available | 2023-03-21T17:11:27Z | |
dc.date.issued | 2022 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/148654 | |
dc.description.abstract | <jats:p> We study the problem of learning shared structure across a sequence of dynamic pricing experiments for related products. We consider a practical formulation in which the unknown demand parameters for each product come from an unknown distribution (prior) that is shared across products. We then propose a meta dynamic pricing algorithm that learns this prior online while solving a sequence of Thompson sampling pricing experiments (each with horizon T) for N different products. Our algorithm addresses two challenges: (i) balancing the need to learn the prior (meta-exploration) with the need to leverage the estimated prior to achieve good performance (meta-exploitation) and (ii) accounting for uncertainty in the estimated prior by appropriately “widening” the estimated prior as a function of its estimation error. We introduce a novel prior alignment technique to analyze the regret of Thompson sampling with a misspecified prior, which may be of independent interest. Unlike prior-independent approaches, our algorithm’s meta regret grows sublinearly in N, demonstrating that the price of an unknown prior in Thompson sampling can be negligible in experiment-rich environments (large N). Numerical experiments on synthetic and real auto loan data demonstrate that our algorithm significantly speeds up learning compared with prior-independent algorithms. </jats:p><jats:p> This paper was accepted by George J. Shanthikumar, Management Science Special Section on Data-Driven Prescriptive Analytics. </jats:p> | en_US |
dc.language.iso | en | |
dc.publisher | Institute for Operations Research and the Management Sciences (INFORMS) | en_US |
dc.relation.isversionof | 10.1287/MNSC.2021.4071 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | SSRN | en_US |
dc.title | Meta Dynamic Pricing: Transfer Learning Across Experiments | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Bastani, Hamsa, Simchi-Levi, David and Zhu, Ruihao. 2022. "Meta Dynamic Pricing: Transfer Learning Across Experiments." Management Science, 68 (3). | |
dc.contributor.department | Massachusetts Institute of Technology. Department of Civil and Environmental Engineering | en_US |
dc.relation.journal | Management Science | en_US |
dc.eprint.version | Original manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2023-03-21T17:08:49Z | |
dspace.orderedauthors | Bastani, H; Simchi-Levi, D; Zhu, R | en_US |
dspace.date.submission | 2023-03-21T17:08:51Z | |
mit.journal.volume | 68 | en_US |
mit.journal.issue | 3 | en_US |
mit.license | OPEN_ACCESS_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |