Asynchronous Advantage Actor-Critic (A3C)实现cart-pole

tensorflow实现: github代码地址如下: https://github.com/wweichn/A3C.git 1 Asynchronous Advantage Actor-Critic (A3C)简介   actor network,critic network 1 Actor观测到state,得到action 2 critic对于state和action进行打分 3 actor根
相关文章
相关标签/搜索