Reinforcement Learning in Continuous State and Action Spaces: A Brief Note

时间 2021-01-02

原文原文链接

Thanks Hado van Hasselt for the great work. Introduction In the problems of sequential decision making in continuous domains with delayed reward signals, the main purpose for the algorithms is to lear