Human-centric dialog training via offline reinforcement learning
Author(s)
Jaques, Natasha; Shen, Judy Hanwen; Ghandeharioun, Asma; Ferguson, Craig; Lapedriza, Agata; Jones, Noah; Gu, Shixiang; Picard, Rosalind W.; ... Show more Show less
DownloadPublished version (1.700Mb)
Publisher Policy
Publisher Policy
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordDate issued
2020Department
Program in Media Arts and Sciences (Massachusetts Institute of Technology)Journal
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Publisher
Association for Computational Linguistics (ACL)
Citation
Jaques, Natasha, Shen, Judy Hanwen, Ghandeharioun, Asma, Ferguson, Craig, Lapedriza, Agata et al. 2020. "Human-centric dialog training via offline reinforcement learning." Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP).
Version: Final published version