Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery

Ure, N. Kemal; Geramifard, Alborz; Chowdhary, Girish; How, Jonathan P.

dc.contributor.author	Geramifard, Alborz
dc.contributor.author	Chowdhary, Girish
dc.contributor.author	How, Jonathan P.
dc.contributor.author	Ure, Nazim Kemal
dc.date.accessioned	2013-10-25T13:18:47Z
dc.date.available	2013-10-25T13:18:47Z
dc.date.issued	2012-09
dc.identifier.isbn	978-3-642-33485-6
dc.identifier.isbn	978-3-642-33486-3
dc.identifier.issn	0302-9743
dc.identifier.issn	1611-3349
dc.identifier.uri	http://hdl.handle.net/1721.1/81767
dc.description.abstract	Solving large scale sequential decision making problems without prior knowledge of the state transition model is a key problem in the planning literature. One approach to tackle this problem is to learn the state transition model online using limited observed measurements. We present an adaptive function approximator (incremental Feature Dependency Discovery (iFDD)) that grows the set of features online to approximately represent the transition model. The approach leverages existing feature-dependencies to build a sparse representation of the state transition model. Theoretical analysis and numerical simulations in domains with state space sizes varying from thousands to millions are used to illustrate the benefit of using iFDD for incrementally building transition models in a planning framework.	en_US
dc.language.iso	en_US
dc.publisher	Springer-Verlag	en_US
dc.relation.isversionof	http://dx.doi.org/10.1007/978-3-642-33486-3_7	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	Other University Web Domain	en_US
dc.title	Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery	en_US
dc.type	Article	en_US
dc.identifier.citation	Ure, N.Kemal et al. “Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery.” Machine Learning and Knowledge Discovery in Databases. Ed. PeterA. Flach, Tijl Bie, and Nello Cristianini. Vol. 7524. Springer Berlin Heidelberg, 2012. 99–115. Lecture Notes in Computer Science.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics	en_US
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems	en_US
dc.contributor.mitauthor	Ure, Nazim Kemal	en_US
dc.contributor.mitauthor	Geramifard, Alborz	en_US
dc.contributor.mitauthor	Chowdhary, Girish	en_US
dc.contributor.mitauthor	How, Jonathan P.	en_US
dc.relation.journal	Machine Learning and Knowledge Discovery in Databases	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Ure, N. Kemal; Geramifard, Alborz; Chowdhary, Girish; How, Jonathan P.	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-2508-1957
dc.identifier.orcid	https://orcid.org/0000-0001-8576-1930
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: How_Adaptive planning.pdf
Size:: 623.4Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record