【论文】强化学习必读经典论文 | 如何学习强化学习 | 强化学习入门

Christopher JCH Watkins and Peter Dayan. Q-learning. Machine learning, 8(3-4):279–292, 1992. Gerald Tesauro. Temporal difference learning and TD-gammon. Communications of the ACM, 38(3):58–68, 1995. K
相关文章
相关标签/搜索