nach oben

Erschienen in:

2024 | OriginalPaper | Buchkapitel

LPSD: Low-Rank Plus Sparse Decomposition for Highly Compressed CNN Models

verfasst von : Kuei-Hsiang Huang, Cheng-Yu Sie, Jhong-En Lin, Che-Rung Lee

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer Nature Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Low-rank decomposition that explores and eliminates the linear dependency within a tensor is often used as a structured model pruning method for deep convolutional neural networks. However, the model accuracy declines rapidly as the compression ratio increases over a threshold. We have observed that with a small amount of sparse elements, the model accuracy can be recovered significantly for the highly compressed CNN models. Based on this premise, we developed a novel method, called LPSD (Low-rank Plus Sparse Decomposition), that decomposes a CNN weight tensor into a combination of a low-rank and a sparse components, which can better maintain the accuracy for the high compression ratio. For a pretrained model, the network structure of each layer is split into two branches: one for low-rank part and one for sparse part. LPSD adapts the alternating approximation algorithm to minimize the global error and the local error alternatively. An exhausted search method with pruning is designed to search the optimal group number, ranks, and sparsity. Experimental results demonstrate that in most scenarios, LPSD achieves better accuracy compared to the state-of-the-art methods when the model is highly compressed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel A Model for Retrieving High-Utility Itemsets with Complementary and Substitute Goods

Nächstes Kapitel Modeling Treatment Effect with Cross-Domain Data

Nur mit Berechtigung zugänglich

Cai, J.F., Li, J., Xia, D.: Generalized low-rank plus sparse tensor estimation by fast Riemannian optimization (2022)

Chu, B.S., Lee, C.R.: Low-rank tensor decomposition for compression of convolutional neural networks using funnel regularization (2021)

Guo, K., Xie, X., Xu, X., Xing, X.: Compressing by learning in a low-rank and sparse decomposition form. IEEE Access 7, 150823–150832 (2019). https://doi.org/10.1109/ACCESS.2019.2947846CrossRef

Han, S., et al.: DSD: Dense-sparse-dense training for deep neural networks (2017)

Hawkins, C., Yang, H., Li, M., Lai, L., Chandra, V.: Low-rank+sparse tensor compression for neural networks (2021)

Huang, W., et al.: Deep low-rank plus sparse network for dynamic MR imaging (2021)

Idelbayev, Y., Carreira-Perpinan, M.A.: Low-rank compression of neural nets: learning the rank of each layer. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8046–8056 (2020). https://doi.org/10.1109/CVPR42600.2020.00807

Kaloshin, P.: Convolutional neural networks compression with low rank and sparse tensor decompositions (2020)

Kim, Y.D., Park, E., Yoo, S., Choi, T., Yang, L., Shin, D.: Compression of deep convolutional neural networks for fast and low power mobile applications (2016)

10.

Liang, C.C., Lee, C.R.: Automatic selection of tensor decomposition for compressing convolutional neural networks a case study on VGG-type networks. In: 2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp. 770–778 (2021). https://doi.org/10.1109/IPDPSW52791.2021.00115

11.

Liebenwein, L., Maalouf, A., Gal, O., Feldman, D., Rus, D.: Compressing neural networks: Towards determining the optimal layer-wise decomposition (2021). CoRR abs/2107.11442, https://arxiv.org/abs/2107.11442

12.

Lin, T., Stich, S.U., Barba, L., Dmitriev, D., Jaggi, M.: Dynamic model pruning with feedback (2020)

13.

Otazo, R., Candès, E., Sodickson, D.: Low-rank plus sparse matrix decomposition for accelerated dynamic MRI with separation of background and dynamic components. Magn. Reson. Med. 73, 1125–1136 (2014). https://doi.org/10.1002/mrm.25240

14.

Yin, M., Phan, H., Zang, X., Liao, S., Yuan, B.: BATUDE: budget-aware neural network compression based on tucker decomposition. Proc. AAAI Conf. Artif. Intell. 36, 8874–8882 (2022). https://doi.org/10.1609/aaai.v36i8.20869

15.

Yu, X., Liu, T., Wang, X., Tao, D.: On compressing deep models by low rank and sparse decomposition. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 67–76 (2017). https://doi.org/10.1109/CVPR.2017.15

16.

Zhang, X., Wang, L., Gu, Q.: A unified framework for low-rank plus sparse matrix recovery (2018)

Titel: LPSD: Low-Rank Plus Sparse Decomposition for Highly Compressed CNN Models
verfasst von: Kuei-Hsiang Huang
Cheng-Yu Sie
Jhong-En Lin
Che-Rung Lee
Verlag: Springer Nature Singapore
Buch: Advances in Knowledge Discovery and Data Mining
Print ISBN: 978-981-9722-41-9

Electronic ISBN: 978-981-9722-42-6

Copyright-Jahr: 2024
DOI: https://doi.org/10.1007/978-981-97-2242-6_28

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner