Most plasticity interventions in DRL reduce backdoor attack success rates while SAM increases them via gradient amplification; the work introduces an SCC framework and loss-sharpness detection indicator.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
AEM adaptively modulates response-level entropy in agentic RL to improve credit assignment and exploration-exploitation balance, yielding gains on ALFWorld, WebShop, and SWE-bench.
citing papers explorer
-
Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning
Most plasticity interventions in DRL reduce backdoor attack success rates while SAM increases them via gradient amplification; the work introduces an SCC framework and loss-sharpness detection indicator.
-
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning
AEM adaptively modulates response-level entropy in agentic RL to improve credit assignment and exploration-exploitation balance, yielding gains on ALFWorld, WebShop, and SWE-bench.