Letrsum denote the negated cumulative reward of the replayed candidate sequence, and letrref be a reference distribution obtained from 500 random action sequences

· 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Plan2Cleanse: Test-Time Backdoor Defense via Monte-Carlo Planning in Deep Reinforcement Learning

cs.LG · 2026-05-10 · unverdicted · novelty 6.0

Plan2Cleanse frames RL backdoor detection as a Monte Carlo planning problem to achieve over 61 percentage point gains in trigger detection and improved win rates in competitive environments.

citing papers explorer

Showing 1 of 1 citing paper.

Plan2Cleanse: Test-Time Backdoor Defense via Monte-Carlo Planning in Deep Reinforcement Learning cs.LG · 2026-05-10 · unverdicted · none · ref 17
Plan2Cleanse frames RL backdoor detection as a Monte Carlo planning problem to achieve over 61 percentage point gains in trigger detection and improved win rates in competitive environments.

Letrsum denote the negated cumulative reward of the replayed candidate sequence, and letrref be a reference distribution obtained from 500 random action sequences

fields

years

verdicts

representative citing papers

citing papers explorer