Sorry, you need to enable JavaScript to visit this website.

CORRELATION-AWARE JOINT PRUNING-QUANTIZATION USING GRAPH NEURAL NETWORKS

DOI:
10.60864/07kr-kd97
Citation Author(s):
Usman Ullah Sheikh, Mohammed Sultan Mohammed, Jeevan Sirkunan, Muhammad Nadzir Marsono
Submitted by:
Muhammad Nor Az...
Last updated:
12 November 2024 - 10:09pm
Document Type:
Poster
Document Year:
2024
Event:
Presenters:
Muhammad Nor Azzafri Nor-Azman
Categories:
Keywords:
 

Deep learning in image classification has achieved remarkable success but at the cost of high resource demands. Model compression through automatic joint pruning-quantization addresses this issue, yet most existing techniques overlook a critical aspect: layer correlations. These correlations are essential as they expose redundant computations across layers, and leveraging them facilitates efficient design space exploration. This study employs Graph Neural Networks (GNN) to learn these inter-layer relationships, thereby optimizing the pruning-quantization strategy for the targeted model. This approach has yielded a 99.36% reduction in complexity for ResNet20 on CIFAR-10, with only a minimal 0.11% drop in accuracy. Furthermore, the integration of GNN sped up the convergence process, reducing iterations by 2.46 times on average, compared to methods without GNN.

up
0 users have voted: