AlphaGo Zero 与深度强化学习(一) 概述

时间 2021-01-12

原文原文链接

AlphaGo Zero 与深度强化学习(一) 概述原文: Mastering the Game of Go without Human Knowledge(2017) AlphaGo Zero 与深度强化学习一概述概览做的什么提到的的技术优势不足老式机器学习方法强化学习前身AlphaGo Fan Lee 两个深度网络训练时规则网一个决策网训练后 AlphaZero 中