论文笔记——Rethinking the Inception Architecture for Computer Vision

时间 2019-12-13

标签论文笔记 rethinking inception architecture vision 繁體版

原文原文链接

用5G的计算量和25M的参数。With an ensemble of 4 models and multi-crop evaluation, we report 3.5% top-5 error and 17.3% top-1 error.

Avoid representational bottlenecks, especially early in the network.(简单说就是feature map的大小要慢慢的减少。)网络
Higher dimensional representations are easier to process locally within a network. Increasing the activations per tile in a convolutional network allows for more disentangled features. The resulting networks will train faster.(在网络较深层应该利用更多的feature map，有利于容纳更多的分解特征。这样能够加速训练)性能
Spatial aggregation can be done over lower dimensional embeddings without much or any loss in representational power.(也就是bottleneck layer的设计)lua
Balance the width and depth of the network.（Increasing both the width and the depth of the network can contribute to higher quality networks.同时增长网络的深度和宽度）spa

左边引入了 representational bottleneck,右边的会增长大量的计算量，最佳的作法就是减小feature map大小的同时增大channel的数目。