Human-centric dialog training via offline reinforcement learning

Jaques, Natasha; Shen, Judy Hanwen; Ghandeharioun, Asma; Ferguson, Craig; Lapedriza, Agata; Jones, Noah; Gu, Shixiang; Picard, Rosalind W.

Author(s)

Jaques, Natasha; Shen, Judy Hanwen; Ghandeharioun, Asma; Ferguson, Craig; Lapedriza, Agata; ... Show more

DownloadPublished version (1.700Mb)

Publisher Policy

Terms of use

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Metadata

Show full item record

Date issued

2020

URI

https://hdl.handle.net/1721.1/146608

Department

Program in Media Arts and Sciences (Massachusetts Institute of Technology)

Journal

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Publisher

Association for Computational Linguistics (ACL)

Citation

Jaques, Natasha, Shen, Judy Hanwen, Ghandeharioun, Asma, Ferguson, Craig, Lapedriza, Agata et al. 2020. "Human-centric dialog training via offline reinforcement learning." Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP).

Version: Final published version

Collections

MIT Open Access Articles