Chapter 1 Introduction

强化学习的主要组成:agent, environment, a policy, a reward signal, a value function, [a model of the environment] Reinforcement learning is a computational approach to understanding and automating goal-directed
相关文章
相关标签/搜索