Skip to main content
Top

2024 | OriginalPaper | Chapter

Improving Unbalanced Security X-Ray Image Classification Using VGG16 and AlexNet with Z-Score Normalization and Augmentation

Authors : Diao Qi, Apri Junaidi, Chan Weng Howe, Azlan Mohd Zain

Published in: Proceedings of the 4th International Conference on Electronics, Biomedical Engineering, and Health Informatics

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Addressing the challenge of unbalanced data sets in convolutional neural network (CNN) models for image recognition, this study aims to investigate the impact of data augmentation and normalization. The problem lies in the limited generalization of the model due to class-based data differences, hence the need for several techniques in data preprocessing such as data augmentation and normalization. The main contribution of this research is a comprehensive analysis of the effectiveness of data augmentation and normalization techniques in improving model performance. The research utilized the AlexNet and VGG16 architectures and conducted extensive experiments on data sets with varying degrees of imbalance. Data augmentation generates additional examples, while normalization reduces convergence issues. The results show that training the AlexNet model without augmentation results in a low accuracy of 0.24, underscoring the challenges posed by skewed data distributions. In contrast, augmented data substantially improves performance, with AlexNet achieving an accuracy of 0.91 and VGG16 achieving 0.84. In addition, normalized data also made a positive contribution, showing an accuracy of 0.74 for AlexNet and 0.67 for VGG16. In conclusion, data augmentation and normalization techniques are essential in reducing the effects of data imbalance, thereby improving the generalizability of the models. The improved accuracy of the data using normalization techniques indicates the ability of the model to read the data after normalization. This study underscores the importance of preprocessing strategies in optimizing model performance and advancing the field of deep learning in image recognition.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
4.
go back to reference Gusmão G, Raposo A, Oliveira R, de, Barbosa C (2022) Treating dataset imbalance in fetal echocardiography classification. In: Communication papers of the 17th conference on computer science and intelligence systems, pp 3–9. PTI. https://doi.org/10.15439/2022f56 Gusmão G, Raposo A, Oliveira R, de, Barbosa C (2022) Treating dataset imbalance in fetal echocardiography classification. In: Communication papers of the 17th conference on computer science and intelligence systems, pp 3–9. PTI. https://​doi.​org/​10.​15439/​2022f56
6.
10.
go back to reference Sadhukhan P, Pakrashi A, Palit S, Mac Namee B (2021) Integrating unsupervised clustering and label-specific oversampling to tackle imbalanced multi-label data Sadhukhan P, Pakrashi A, Palit S, Mac Namee B (2021) Integrating unsupervised clustering and label-specific oversampling to tackle imbalanced multi-label data
12.
go back to reference Guo Y, Fang Z, Zhang S, Dong H (2021) Classification of potato early blight with unbalanced data based on GhostNet. In: 2021 3rd International academic exchange conference on science and technology innovation, IAECST 2021, pp 559–563. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/IAECST54258.2021.9695532 Guo Y, Fang Z, Zhang S, Dong H (2021) Classification of potato early blight with unbalanced data based on GhostNet. In: 2021 3rd International academic exchange conference on science and technology innovation, IAECST 2021, pp 559–563. Institute of Electrical and Electronics Engineers Inc. https://​doi.​org/​10.​1109/​IAECST54258.​2021.​9695532
13.
14.
go back to reference Shafay M, Hassan T, Ahmed A, Velayudhan D, Dias J, Werghi N (2022) Programmable broad learning system to detect concealed and imbalanced baggage threats. In: 2022 2nd international conference on digital futures and transformative technologies, ICoDT2 2022. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICoDT255437.2022.9787420 Shafay M, Hassan T, Ahmed A, Velayudhan D, Dias J, Werghi N (2022) Programmable broad learning system to detect concealed and imbalanced baggage threats. In: 2022 2nd international conference on digital futures and transformative technologies, ICoDT2 2022. Institute of Electrical and Electronics Engineers Inc. https://​doi.​org/​10.​1109/​ICoDT255437.​2022.​9787420
18.
go back to reference Qi S, He M, Hoang R, Zhou Y, Namadi P, Tom B, Sandhu P, Bai Z, Chung F, Ding Z, Anderson J, Roh DM, Huynh V (2023) Salinity modeling using deep learning with data augmentation and transfer learning. Water (Switzerland), 15. https://doi.org/10.3390/w15132482 Qi S, He M, Hoang R, Zhou Y, Namadi P, Tom B, Sandhu P, Bai Z, Chung F, Ding Z, Anderson J, Roh DM, Huynh V (2023) Salinity modeling using deep learning with data augmentation and transfer learning. Water (Switzerland), 15. https://​doi.​org/​10.​3390/​w15132482
19.
go back to reference Junaidi A, Tanjung NAF, Wijayanto S, Lasama J, Iskandar AR (2021) Overfitting problem in images classification for egg incubator using convolutional neural network. In: 2021 9th international conference on cyber and IT service management, CITSM 2021. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/CITSM52892.2021.9588815 Junaidi A, Tanjung NAF, Wijayanto S, Lasama J, Iskandar AR (2021) Overfitting problem in images classification for egg incubator using convolutional neural network. In: 2021 9th international conference on cyber and IT service management, CITSM 2021. Institute of Electrical and Electronics Engineers Inc. https://​doi.​org/​10.​1109/​CITSM52892.​2021.​9588815
20.
go back to reference Zhang F, Zhang Z (2023) Information transfer in multitask learning. Data Augment Beyond 37:16220–16221 Zhang F, Zhang Z (2023) Information transfer in multitask learning. Data Augment Beyond 37:16220–16221
25.
go back to reference Krizhevsky A, Sutskever I, Hinton GE (2021) Imagenet classification with deep convolutional neural networks Krizhevsky A, Sutskever I, Hinton GE (2021) Imagenet classification with deep convolutional neural networks
27.
go back to reference Smith M, Li Z, Landry L, Merz KM, Li P (2023) Consequences of overfitting the van der Waals Radii of ions Smith M, Li Z, Landry L, Merz KM, Li P (2023) Consequences of overfitting the van der Waals Radii of ions
29.
go back to reference Junaidi A, Lasama J, Adhinata FD, Iskandar AR (2021) Image classification for egg incubator using transfer learning of VGG16 and VGG19. In: 10th IEEE international conference on communication, networks and satellite, comnetsat 2021—Proceedings. pp. 324–328. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/COMNETSAT53002.2021.9530826 Junaidi A, Lasama J, Adhinata FD, Iskandar AR (2021) Image classification for egg incubator using transfer learning of VGG16 and VGG19. In: 10th IEEE international conference on communication, networks and satellite, comnetsat 2021—Proceedings. pp. 324–328. Institute of Electrical and Electronics Engineers Inc. https://​doi.​org/​10.​1109/​COMNETSAT53002.​2021.​9530826
30.
go back to reference Ruvinga C, Malathi D, Dorathi Jayaseeli JD (2020) Human concentration level recognition based on vgg16 cnn architecture. Int J Adv Sci Technol 29:1364–1373 Ruvinga C, Malathi D, Dorathi Jayaseeli JD (2020) Human concentration level recognition based on vgg16 cnn architecture. Int J Adv Sci Technol 29:1364–1373
31.
go back to reference Mungloo-Dilmohamud Z, Khan MHM, Jhumka K, Beedassy BN, Mungloo NZ, Peña-Reyes C (2022) Balancing data through data augmentation improves the generality of transfer learning for diabetic retinopathy classification. Appl Sci (Switzerland) 12. https://doi.org/10.3390/app12115363 Mungloo-Dilmohamud Z, Khan MHM, Jhumka K, Beedassy BN, Mungloo NZ, Peña-Reyes C (2022) Balancing data through data augmentation improves the generality of transfer learning for diabetic retinopathy classification. Appl Sci (Switzerland) 12. https://​doi.​org/​10.​3390/​app12115363
Metadata
Title
Improving Unbalanced Security X-Ray Image Classification Using VGG16 and AlexNet with Z-Score Normalization and Augmentation
Authors
Diao Qi
Apri Junaidi
Chan Weng Howe
Azlan Mohd Zain
Copyright Year
2024
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-1463-6_14