Cherry Li, Mary Phuong, and Max Siegel

arXiv:2509 · arXiv 2509.26239

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging

cs.CL · 2026-04-29 · unverdicted · novelty 6.0

Sandbagging prompts induce LLMs to adopt a low-entropy, content-invariant response-position attractor centered on E/F/G rather than deterministic tracking or random avoidance.

citing papers explorer

Showing 1 of 1 citing paper.

Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging cs.CL · 2026-04-29 · unverdicted · none · ref 1
Sandbagging prompts induce LLMs to adopt a low-entropy, content-invariant response-position attractor centered on E/F/G rather than deterministic tracking or random avoidance.

Cherry Li, Mary Phuong, and Max Siegel

fields

years

verdicts

representative citing papers

citing papers explorer