| dc.contributor.author | Geramifard, Alborz | |
| dc.contributor.author | Walsh, Thomas J. | |
| dc.contributor.author | Roy, Nicholas | |
| dc.contributor.author | How, Jonathan P. | |
| dc.date.accessioned | 2015-05-19T18:41:05Z | |
| dc.date.available | 2015-05-19T18:41:05Z | |
| dc.date.issued | 2013-07 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/97035 | |
| dc.description.abstract | Matching pursuit (MP) methods are a promising class of feature construction algorithms for value function approximation. Yet existing MP methods require creating a pool of potential features, mandating expert knowledge or enumeration of a large feature pool, both of which hinder scalability. This paper introduces batch incremental feature dependency discovery (Batch-iFDD) as an MP method that inherits a provable convergence property. Additionally, Batch-iFDD does not require a large pool of features, leading to lower computational complexity. Empirical policy evaluation results across three domains with up to one million states highlight the scalability of Batch-iFDD over the previous state of the art MP algorithm. | en_US |
| dc.description.sponsorship | United States. Office of Naval Research (Grant N00014-07-1-0749) | en_US |
| dc.description.sponsorship | United States. Office of Naval Research (Grant N00014-11-1-0688) | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Association for Uncertainty in Artificial Intelligence (AUAI) | en_US |
| dc.relation.isversionof | http://auai.org/uai2013/prints/papers/139.pdf | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | arXiv | en_US |
| dc.title | Batch-iFDD for representation expansion in large MDPs | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Geramifard, Alborz, Thomas J. Walsh, Nicholas Roy, and Jonathan P. How. "Batch-iFDD for representation expansion in large MDPs." 2013 Conference on Uncertainty in Artificial Intelligence (July 2013). | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Laboratory for Information and Decision Systems | en_US |
| dc.contributor.mitauthor | Geramifard, Alborz | en_US |
| dc.contributor.mitauthor | Walsh, Thomas J. | en_US |
| dc.contributor.mitauthor | Roy, Nicholas | en_US |
| dc.contributor.mitauthor | How, Jonathan P. | en_US |
| dc.relation.journal | Proceedings of the 2013 Conference on Uncertainty in Artificial Intelligence | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Geramifard, Alborz; Walsh, Thomas J.; Roy, Nicholas; How, Jonathan P. | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0001-8576-1930 | |
| dc.identifier.orcid | https://orcid.org/0000-0002-2508-1957 | |
| dc.identifier.orcid | https://orcid.org/0000-0002-8293-0492 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |