Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
Author(s)
Jin, Chi; Jin, Tiancheng; Luo, Haipeng; Sra, Suvrit; Yu, Tiancheng
DownloadPublished version (330.1Kb)
Publisher Policy
Publisher Policy
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordDate issued
2020Department
Massachusetts Institute of Technology. Institute for Data, Systems, and Society; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119
Citation
Jin, Chi, Jin, Tiancheng, Luo, Haipeng, Sra, Suvrit and Yu, Tiancheng. 2020. "Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition." INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 119.
Version: Final published version