Scheduling of costly measurements for state estimation using reinforcement learning

Rogers, Keith Eric

dc.contributor.advisor	Wallace E. Vander Velde.	en_US
dc.contributor.author	Rogers, Keith Eric	en_US
dc.contributor.other	Massachusetts Institute of Technology. Dept. of Aeronautics and Astronautics.	en_US
dc.date.accessioned	2005-09-26T19:10:51Z
dc.date.available	2005-09-26T19:10:51Z
dc.date.copyright	1999	en_US
dc.date.issued	1999	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/28216
dc.description	Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Aeronautics and Astronautics, 1999.	en_US
dc.description	Includes bibliographical references (p. 257-263).	en_US
dc.description.abstract	There has long been a significant gap between the theory and practice of measurement scheduling for state estimation problems. Theoretical papers tend to deal rigorously with small-scale, linear problems using methods that are well-grounded in optimization theory. Practical applications deal with high-dimensional, nonlinear problems using heuristic policies. The work in this thesis attempts to bridge that gap by using reinforcement learning (RL) to treat real-world problems. In doing so, it makes contributions to the fields of both measurement scheduling and RL. On the measurement scheduling side, a unified formulation is presented which encompasses the wide variety of problems found in the literature as well as more complex variations. This is used with RL to handle a series of problems of increasing difficulty. Both continuous and discrete action spaces are treated, and RL is shown to be effective with both. The RL-based methods are shown to beat alternative methods from the literature in one case, and are able to consistently match or beat heuristics for both high-dimensional linear problems and simple nonlinear problems. Finally, RL is applied to a high-dimensional nonlinear problem in radar tracking and is able to outperform the best available heuristic by as much as 35%. In treating these problems, it is shown that a useful synergy exists between learned and heuristic policies, with each helping to verify and improve the performance of the other. On the reinforcement learning side, the contribution comes mainly from applying the algorithms in an extremely adverse environment. The measurement scheduling problems treated involve high-dimensional, continuous input spaces and continuous action spaces. The nonlinear cases must use sub-optimal nonlinear filters and are hence non-Markovian. Cost feedback comes in terms of internally propagated states with a sometimes tenuous connection to the environment. In a field where typical applications have both finite state spaces and finite action spaces, these problems test the limits of its usability. Some advances are also made in the treatment of problems where the cost differential is much smaller in the action direction than the state direction. Learning algorithms are presented for a class of transformations to Bellman's equation, of which Advantage Learning represents a special case. Conditions under which Advantage Learning may diverge are described, and an alternative algorithm - called G-Learning - is given which fixes the problem for a sample case.	en_US
dc.description.statementofresponsibility	by Keith Eric Rogers.	en_US
dc.format.extent	263 p.	en_US
dc.format.extent	13117662 bytes
dc.format.extent	13152680 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	application/pdf
dc.language.iso	en_US
dc.publisher	Massachusetts Institute of Technology	en_US
dc.rights	M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.	en_US
dc.rights.uri	http://dspace.mit.edu/handle/1721.1/7582
dc.subject	Aeronautics and Astronautics.	en_US
dc.title	Scheduling of costly measurements for state estimation using reinforcement learning	en_US
dc.type	Thesis	en_US
dc.description.degree	Ph.D.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
dc.identifier.oclc	44599094	en_US

Files in this item

Name:: 44599094-MIT.pdf
Size:: 12.54Mb
Format:: PDF
Description:: Full printable version

View/Open

This item appears in the following Collection(s)

Doctoral Theses

Show simple item record