Chapter 9 On-policy Prediction with Approximation

时间 2021-01-02

原文原文链接

本文为《Reinforcement Learning: An Introduction》读书笔记 9.1 Value-function Approximation 9.2 The Prediction Objective ( VE¯¯¯¯¯¯¯¯ V E ¯ ) 9.3 Stochastic-gradient and Semi-gradient Methods 9.4 Linear Methods