Show simple item record

dc.contributor.authorBastani, Hamsa
dc.contributor.authorSimchi-Levi, David
dc.contributor.authorZhu, Ruihao
dc.date.accessioned2023-03-21T17:11:27Z
dc.date.available2023-03-21T17:11:27Z
dc.date.issued2022
dc.identifier.urihttps://hdl.handle.net/1721.1/148654
dc.description.abstract<jats:p> We study the problem of learning shared structure across a sequence of dynamic pricing experiments for related products. We consider a practical formulation in which the unknown demand parameters for each product come from an unknown distribution (prior) that is shared across products. We then propose a meta dynamic pricing algorithm that learns this prior online while solving a sequence of Thompson sampling pricing experiments (each with horizon T) for N different products. Our algorithm addresses two challenges: (i) balancing the need to learn the prior (meta-exploration) with the need to leverage the estimated prior to achieve good performance (meta-exploitation) and (ii) accounting for uncertainty in the estimated prior by appropriately “widening” the estimated prior as a function of its estimation error. We introduce a novel prior alignment technique to analyze the regret of Thompson sampling with a misspecified prior, which may be of independent interest. Unlike prior-independent approaches, our algorithm’s meta regret grows sublinearly in N, demonstrating that the price of an unknown prior in Thompson sampling can be negligible in experiment-rich environments (large N). Numerical experiments on synthetic and real auto loan data demonstrate that our algorithm significantly speeds up learning compared with prior-independent algorithms. </jats:p><jats:p> This paper was accepted by George J. Shanthikumar, Management Science Special Section on Data-Driven Prescriptive Analytics. </jats:p>en_US
dc.language.isoen
dc.publisherInstitute for Operations Research and the Management Sciences (INFORMS)en_US
dc.relation.isversionof10.1287/MNSC.2021.4071en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceSSRNen_US
dc.titleMeta Dynamic Pricing: Transfer Learning Across Experimentsen_US
dc.typeArticleen_US
dc.identifier.citationBastani, Hamsa, Simchi-Levi, David and Zhu, Ruihao. 2022. "Meta Dynamic Pricing: Transfer Learning Across Experiments." Management Science, 68 (3).
dc.contributor.departmentMassachusetts Institute of Technology. Department of Civil and Environmental Engineeringen_US
dc.relation.journalManagement Scienceen_US
dc.eprint.versionOriginal manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2023-03-21T17:08:49Z
dspace.orderedauthorsBastani, H; Simchi-Levi, D; Zhu, Ren_US
dspace.date.submission2023-03-21T17:08:51Z
mit.journal.volume68en_US
mit.journal.issue3en_US
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record