文章目录
- Network Pruning
- Knowledge Distillation(知识蒸馏)
- Parameter Quantization
- Architecture Design
NETWORK COMPRESSION
Network Pruning
修剪方法:
修剪neurons
Knowledge Distillation(知识蒸馏)
Parameter Quantization
Architecture Design
Low rank approximation
Dynamic Computation