Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition

Jin, Chi; Jin, Tiancheng; Luo, Haipeng; Sra, Suvrit; Yu, Tiancheng

Author(s)

Jin, Chi; Jin, Tiancheng; Luo, Haipeng; Sra, Suvrit; Yu, Tiancheng

DownloadPublished version (330.1Kb)

Publisher Policy

Terms of use

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Metadata

Show full item record

Date issued

2020

URI

https://hdl.handle.net/1721.1/143895

Department

Massachusetts Institute of Technology. Institute for Data, Systems, and Society; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Journal

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119

Citation

Jin, Chi, Jin, Tiancheng, Luo, Haipeng, Sra, Suvrit and Yu, Tiancheng. 2020. "Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition." INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 119.

Version: Final published version

Collections

MIT Open Access Articles