Show simple item record

dc.contributor.authorFerreira, Kris Johnson
dc.contributor.authorSimchi-Levi, David
dc.contributor.authorWang, He
dc.date.accessioned2020-06-09T21:28:51Z
dc.date.available2020-06-09T21:28:51Z
dc.date.issued2018-11
dc.date.submitted2015-04
dc.identifier.issn0030-364X
dc.identifier.issn1526-5463
dc.identifier.urihttps://hdl.handle.net/1721.1/125757
dc.description.abstractWe consider a price-based network revenue management problem in which a retailer aims to maximize revenue from multiple products with limited inventory over a finite selling season. As is common in practice, we assume the demand function contains unknown parameters that must be learned from sales data. In the presence of these unknown demand parameters, the retailer faces a trade-off commonly referred to as the “exploration-exploitation trade-off.” Toward the beginning of the selling season, the retailer may offer several different prices to try to learn demand at each price (“exploration” objective). Over time, the retailer can use this knowledge to set a price that maximizes revenue throughout the remainder of the selling season (“exploitation” objective). We propose a class of dynamic pricing algorithms that builds on the simple, yet powerful, machine learning technique known as “Thompson sampling” to address the challenge of balancing the exploration-exploitation trade-off under the presence of inventory constraints. Our algorithms have both strong theoretical performance guarantees and promising numerical performance results when compared with other algorithms developed for similar settings. Moreover, we show how our algorithms can be extended for use in general multiarmed bandit problems with resource constraints as well as in applications in other revenue management settings and beyond.en_US
dc.language.isoen
dc.publisherInstitute for Operations Research and the Management Sciences (INFORMS)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1287/opre.2018.1755en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceSSRNen_US
dc.titleOnline Network Revenue Management Using Thompson Samplingen_US
dc.typeArticleen_US
dc.identifier.citation© 2018 INFORMS.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Institute for Data, Systems, and Societyen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Civil and Environmental Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Operations Research Centeren_US
dc.relation.journalOperations Researchen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2020-06-02T15:38:55Z
dspace.date.submission2020-06-02T15:38:58Z
mit.journal.volume66en_US
mit.journal.issue6en_US
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record