Title resolution pending

Schulman, John, Wolski, Filip, Dhariwal, Prafulla, Radford, Alec, Klimov, Oleg , journal=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

Most plasticity interventions in DRL reduce backdoor attack success rates while SAM increases them via gradient amplification; the work introduces an SCC framework and loss-sharpness detection indicator.

AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning

cs.AI · 2026-05-01 · unverdicted · novelty 4.0 · 2 refs

AEM adaptively modulates response-level entropy in agentic RL to improve credit assignment and exploration-exploitation balance, yielding gains on ALFWorld, WebShop, and SWE-bench.

citing papers explorer

Showing 2 of 2 citing papers.

Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning cs.LG · 2026-05-14 · unverdicted · none · ref 17
Most plasticity interventions in DRL reduce backdoor attack success rates while SAM increases them via gradient amplification; the work introduces an SCC framework and loss-sharpness detection indicator.
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning cs.AI · 2026-05-01 · unverdicted · none · ref 5 · 2 links
AEM adaptively modulates response-level entropy in agentic RL to improve credit assignment and exploration-exploitation balance, yielding gains on ALFWorld, WebShop, and SWE-bench.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer