pith. sign in

arxiv: 2605.24294 · v1 · pith:Q337LXGBnew · submitted 2026-05-22 · 💻 cs.CR · cs.AI· cs.LG

Concept Drift Adaptation Using Self-Supervised and Reinforcement Learning In Android Malware Detection

classification 💻 cs.CR cs.AIcs.LG
keywords maintenancedriftadaptationandroidframeworklatentmalwareconcept
0
0 comments X
read the original abstract

Android malware detectors often degrade after deployment because of concept drift, while full retraining at each maintenance step is costly. We propose a chronological adaptive maintenance framework that models deployment-time maintenance as a sequential decision problem. The framework learns a stable latent representation through self-supervised learning during initialization, freezes the encoder, measures latent drift in the fixed representation space, and performs lightweight downstream adaptation using a trainable adapter and classification head. A proximal policy optimization controller selects low-cost maintenance actions based on the detector state, including current utility, retention on a fixed memory set, latent drift indicators, and update cost. We evaluate the framework under a causal deployment-style protocol on emulator and real Android malware datasets with static and dynamic features. Results show that the RL controller provides a strong cost-aware adaptation strategy, consistently remaining among the top-performing policies while achieving a favorable balance between temporal performance, memory retention, and maintenance cost under non-stationary deployment conditions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.