DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices

DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices 作者将CNN中的层分为两种:(1)Tensor Layer,如conv,fc;(2)Non-tensor Layer,如pooling,norm。作者发现网络进行前向推理时,Non-tensor layer也会占用不少时间和内存,作者提出用单独的Ten
相关文章
相关标签/搜索