Now showing items 1-1 of 1

    • Reinforcement Learning by Policy Search 

      Peshkin, Leonid (2003-02-14)
      One objective of artificial intelligence is to model the behavior of an intelligent agent interacting with its environment. The environment's transformations can be modeled as a Markov chain, whose state is partially ...