Asynchronous stochastic approximation and Q-learning
Author(s)
Tsitsiklis, John N.; Massachusetts Institute of Technology. Laboratory for Information and Decision Systems.
DownloadP-2172-28187066.pdf (1.167Mb)
Metadata
Show full item recordDescription
Includes bibliographical references (p. 18-20).
Date issued
1993Publisher
Massachusetts Institute of Technology, Laboratory for Information and Decision Systems]
Series/Report no.
LIDS-P ; 2172