cs231n-notes-Lecture-7:各种优化方法介绍与比较

Lecture-7 Training Neural Networks Optimization SGD Cons Very slow progress along shallow dimension, jitter along steep direction. 2. local minima or saddle point. Saddle points are much more common i
相关文章
相关标签/搜索