Chapter 9 On-policy Prediction with Approximation

本文为《Reinforcement Learning: An Introduction》读书笔记 9.1 Value-function Approximation 9.2 The Prediction Objective ( VE¯¯¯¯¯¯¯¯ V E ¯ ) 9.3 Stochastic-gradient and Semi-gradient Methods 9.4 Linear Methods
相关文章
相关标签/搜索