Tag: Model Compression

Comprehensive Online Network Pruning via Learnable Scaling Factors

Comprehensive Online Network Pruning via Learnable Scaling Factors

| May 20, 2021 | 0 Comments

Muhammad Umair Haider and Murtaza Taj Abstract: One of the major challenges in deploying deep neural network architectures is their size which has an adverse effect on their inference time and memory requirements. Deep CNNs can either be pruned width-wise by removing filters or depth-wise by removing layers and blocks. Width wise pruning (filter pruning) […]

Continue Reading