ResNet

Paper: Deep Residual Learning for Image Recognition --- Q:Is learning better networks as easy as stacking more layers? A:vanishing/exploding gradients, which is largely addressed by normalization init
相关文章
相关标签/搜索