2.6 动量梯度下降法

Gradient Descent with momentum In one sentence, the basic idea is to compute an exponentially weighted average of your gradients, and then use that gradient to update your weights instead. As a exampl
相关文章
相关标签/搜索