Transfer Learning for Image-Based Malware Classification
read the original abstract
In this paper, we consider the problem of malware detection and classification based on image analysis. We convert executable files to images and apply image recognition using deep learning (DL) models. To train these models, we employ transfer learning based on existing DL models that have been pre-trained on massive image datasets. We carry out various experiments with this technique and compare its performance to that of an extremely simple machine learning technique, namely, k-nearest neighbors (\kNN). For our k-NN experiments, we use features extracted directly from executables, rather than image analysis. While our image-based DL technique performs well in the experiments, surprisingly, it is outperformed by k-NN. We show that DL models are better able to generalize the data, in the sense that they outperform k-NN in simulated zero-day experiments.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
MalwarePT: A Binary-Level Foundation Model for Malware Analysis
MalwarePT is a ModernBERT-style binary foundation model pretrained with masked language modeling on PE code bytes using BPE tokenization that transfers to API call prediction, functionality classification, and tempora...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.