arXiv preprint arXiv:2505.22271 (2025)

Yu, Y · 2025 · arXiv 2505.22271

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning

cs.LG · 2026-04-23 · unverdicted · novelty 6.0

DDRL reduces spurious reward noise in test-time RL for math by excluding ambiguous samples, using fixed advantages, and adding consensus-based updates, outperforming prior TTRL methods on math benchmarks.

On the Vulnerability of Parameter-Level Defenses to Model Merging

cs.LG · 2026-06-29 · unverdicted · novelty 5.0

Parameter-level defenses for model merging are vulnerable to Anchor-Guided Attack because protected weights are dominated by the pretrained model, and a new defense ARF is introduced to counter it.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning cs.LG · 2026-04-23 · unverdicted · none · ref 14
DDRL reduces spurious reward noise in test-time RL for math by excluding ambiguous samples, using fixed advantages, and adding consensus-based updates, outperforming prior TTRL methods on math benchmarks.
On the Vulnerability of Parameter-Level Defenses to Model Merging cs.LG · 2026-06-29 · unverdicted · none · ref 47
Parameter-level defenses for model merging are vulnerable to Anchor-Guided Attack because protected weights are dominated by the pretrained model, and a new defense ARF is introduced to counter it.

arXiv preprint arXiv:2505.22271 (2025)

fields

years

verdicts

representative citing papers

citing papers explorer