dc.contributor.author | Narasimhan, Karthik Rajagopal | |
dc.contributor.author | Kulkarni, Tejas Dattatraya | |
dc.contributor.author | Barzilay, Regina | |
dc.date.accessioned | 2015-09-24T17:33:12Z | |
dc.date.available | 2015-09-24T17:33:12Z | |
dc.date.issued | 2015-09 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/98900 | |
dc.description.abstract | In this paper, we consider the task of learning control policies for text-based games. In these games, all interactions in the virtual world are through text and the underlying state is not observed. The resulting language barrier makes such environments challenging for automatic game players. We employ a deep reinforcement learning framework to jointly learn state representations and action policies using game rewards as feedback. This framework enables us to map text descriptions into vector representations that capture the semantics of the game states. We evaluate our approach on two game worlds, comparing against baselines using bag-of-words and bag-of-bigrams for state representations. Our algorithm outperforms the baselines on both worlds demonstrating the importance of learning expressive representations. | en_US |
dc.description.sponsorship | Leventhal Fellowship | en_US |
dc.description.sponsorship | MIT Center for Brains, Minds and Machines | en_US |
dc.language.iso | en_US | |
dc.publisher | Association for Computational Linguistics | en_US |
dc.relation.isversionof | https://aclweb.org/anthology/D/D15/D15-1001.pdf | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | Narasimhan | en_US |
dc.title | Language Understanding for Text-based Games using Deep Reinforcement Learning | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Narasimhan, Karthik, Tejas D. Kulkarni, and Regina Barzilay. "Language Understanding for Text-based Games using Deep Reinforcement Learning." 2015 Conference on Empirical Methods in Natural Language Processing (September 2015). | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.approver | Narasimhan, Karthik Rajagopal | en_US |
dc.contributor.mitauthor | Narasimhan, Karthik Rajagopal | en_US |
dc.contributor.mitauthor | Kulkarni, Tejas Dattatraya | en_US |
dc.contributor.mitauthor | Barzilay, Regina | en_US |
dc.relation.journal | Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Narasimhan, Karthik; Kulkarni, Tejas D.; Barzilay, Regina | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-7077-2765 | |
dc.identifier.orcid | https://orcid.org/0000-0002-2921-8201 | |
dc.identifier.orcid | https://orcid.org/0000-0001-9894-9983 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |