Model Compression – Computer Vision & Graphics Lab

ICIP2021 – Comprehensive Online Network Pruning via Learnable Scaling Factors

By May 20, 2021November 18, 2023

Muhammad Umair Haider and Murtaza Taj Abstract: One of the major challenges in deploying deep neural network architectures is their size which has an adverse effect on their inference time and memory requirements. Deep CNNs can either be pruned width-wise by removing filters or depth-wise by removing layers and blocks. Width wise pruning (filter pruning)…