AdamW优化算法 笔记

https://www.jiqizhixin.com/articles/2018-07-03-14 例子: https://github.com/ShikamaruZhang/AdamW optim_adam = torch.optim.Adam(net_Adam.parameters(), lr=LR, betas=(0.9, 0.99), weight_decay = WD) optim_W
相关文章
相关标签/搜索