将3D卷积分解为spatial convolution in each channel and linear projection across channels.
(spatial convolution + linear projection.)spa
简单归纳就是spatial conv + linear projection,可是在spatial conv的时候用了一个residual connection,感受颇有道理,例如是一个vertical edge detector,那么horizontal information将丢失。这个和后来的MobileNet中的depthwise conv + pointwise conv很是的像。orm