Show simple item record

dc.contributor.authorDann, Christoph
dc.contributor.authorDabney, William
dc.contributor.authorGeramifard, Alborz
dc.contributor.authorKlein, Robert Henry
dc.contributor.authorHow, Jonathan P.
dc.date.accessioned2016-12-07T19:45:44Z
dc.date.available2016-12-07T19:45:44Z
dc.date.issued2015-08
dc.date.submitted2014-11
dc.identifier.issn1532-4435
dc.identifier.issn1533-7928
dc.identifier.urihttp://hdl.handle.net/1721.1/105742
dc.description.abstractRLPy is an object-oriented reinforcement learning software package with a focus on valuefunction-based methods using linear function approximation and discrete actions. The framework was designed for both educational and research purposes. It provides a rich library of fine-grained, easily exchangeable components for learning agents (e.g., policies or representations of value functions), facilitating recently increased specialization in reinforcement learning. RLPy is written in Python to allow fast prototyping, but is also suitable for large-scale experiments through its built-in support for optimized numerical libraries and parallelization. Code profiling, domain visualizations, and data analysis are integrated in a self-contained package available under the Modified BSD License at http://github.com/rlpy/rlpy. All of these properties allow users to compare various reinforcement learning algorithms with little effort.en_US
dc.language.isoen_US
dc.publisherMIT Pressen_US
dc.relation.isversionofhttp://jmlr.org/papers/v16/geramifard15a.htmlen_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceMIT Pressen_US
dc.titleRlpy: A Value-Function-Based Reinforcement Learning Framework for Education and Researchen_US
dc.typeArticleen_US
dc.identifier.citationGeramifard, Alborz et al. "RLPy: A Value-Function-Based Reinforcement Learning Framework for Education and Research." Journal of Machine Learning Research 16 (2015):1573−1578.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systemsen_US
dc.contributor.mitauthorGeramifard, Alborz
dc.contributor.mitauthorKlein, Robert Henry
dc.contributor.mitauthorHow, Jonathan P.
dc.relation.journalJournal of Machine Learning Researchen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsGeramifard, Alborz; Dann, Christoph; Klein, Robert H.; Dabney, William; How, Jonathan P.en_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0002-2508-1957
mit.licensePUBLISHER_POLICYen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record