pytorch 显示网络结构，显示优化器的设置代码，

时间 2020-12-30

原文原文链接

最近注意到在一些caffe模型中，偏置项的学习率通常设置为普通层的两倍。具体原因可以参考(https://datascience.stackexchange.com/questions/23549/why-is-the-learning-rate-for-the-bias-usually-twice-as-large-as-the-the-lr-for-t)，貌似没有文章提到这个。 Pytorch