TY - GEN
T1 - Robustness of Image-Based Malware Classification Models trained with Generative Adversarial Networks
AU - Reilly, Ciaran
AU - O'Shaughnessy, Stephen
AU - Thorpe, Christina
N1 - Publisher Copyright:
© 2023 Owner/Author.
PY - 2023/6/14
Y1 - 2023/6/14
N2 - As malware continues to evolve, deep learning models are increasingly used for malware detection and classification, including image-based classification. However, adversarial attacks can be used to perturb images so as to evade detection by these models. This study investigates the effectiveness of training deep learning models with Generative Adversarial Network-generated data to improve their robustness against such attacks. Two image conversion methods, byteplot and space-filling curves, were used to represent the malware samples, and a ResNet-50 architecture was used to train models on the image datasets. The models were then tested against a projected gradient descent attack. It was found that without GAN-generated data, the models' prediction performance drastically decreased from 93-95% to 4.5% accuracy. However, the addition of adversarial images to the training data almost doubled the accuracy of the models. This study highlights the potential benefits of incorporating GAN-generated data in the training of deep learning models to improve their robustness against adversarial attacks.
AB - As malware continues to evolve, deep learning models are increasingly used for malware detection and classification, including image-based classification. However, adversarial attacks can be used to perturb images so as to evade detection by these models. This study investigates the effectiveness of training deep learning models with Generative Adversarial Network-generated data to improve their robustness against such attacks. Two image conversion methods, byteplot and space-filling curves, were used to represent the malware samples, and a ResNet-50 architecture was used to train models on the image datasets. The models were then tested against a projected gradient descent attack. It was found that without GAN-generated data, the models' prediction performance drastically decreased from 93-95% to 4.5% accuracy. However, the addition of adversarial images to the training data almost doubled the accuracy of the models. This study highlights the potential benefits of incorporating GAN-generated data in the training of deep learning models to improve their robustness against adversarial attacks.
KW - Adversarial AI
KW - Cybersecurity
KW - Generative Adversarial Networks
KW - Incident Response
KW - Malware Analysis
UR - http://www.scopus.com/inward/record.url?scp=85161441117&partnerID=8YFLogxK
U2 - 10.1145/3590777.3590792
DO - 10.1145/3590777.3590792
M3 - Conference contribution
AN - SCOPUS:85161441117
T3 - ACM International Conference Proceeding Series
SP - 92
EP - 99
BT - Proceedings of the 2023 European Interdisciplinary Cybersecurity Conference, EICC 2023
PB - Association for Computing Machinery
T2 - 2023 European Interdisciplinary Cybersecurity Conference, EICC 2023
Y2 - 14 June 2023 through 15 June 2023
ER -