Planning by Dynamic Programming

Dynamic Programming(DP) refers to a collection of algorithms that can be used to compute optimal policies given a perfect model of the environment as a MDP. Dynamic—sequential or temporal component to
相关文章
相关标签/搜索