Browsing AI Technical Reports (1964 - 2004) by Subject "policy search"

Now showing items 1-1 of 1

Reinforcement Learning by Policy Search

Peshkin, Leonid (2003-02-14)

One objective of artificial intelligence is to model the behavior of an intelligent agent interacting with its environment. The environment's transformations can be modeled as a Markov chain, whose state is partially ...