模型压缩:Deep Compression

第一步weight pruning 第二步trained quantization and weight sharing 第三步 Huffman coding 实验分析之压缩几十倍从何而来 实验分析之极致量化 《Deep Compression Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffm
相关文章
相关标签/搜索