【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

论文题目:Continuous Control With Deep Reinforcement Learning 所解决的问题?   这篇文章将Deep Q-Learning运用到Deterministic Policy Gradient算法中。如果了解DPG的话,那这篇文章就是引入DQN改进了一下DPG的state value function。解决了DQN需要寻找maximizes actio
相关文章
相关标签/搜索