深度神经网络加速和压缩

模型加速与压缩方法分类总结 • Low-Rank • Pruning • Quantization • Knowledge Distillation • Compact Network Design   Low-Rank Previous low-rank based methods: • SVD - Zhang et al., “Accelerating Very Deep Convolutio
相关文章
相关标签/搜索