pith. sign in

← back to paper

Review history

arxiv: 2509.21979 · 2 revisions

Benchmarking and Mitigating Sycophancy in Medical Vision Language Models

  1. 2026-05-21 UNVERDICTED LOW v0.9.0 novelty 6.0
    56307 ms 5712 in 1266 out 2026-05-21T21:21:30.010020+00:00
  2. 2026-05-18 UNVERDICTED LOW v0.9.0 novelty 6.0
    36890 ms 5703 in 1382 out 2026-05-18T13:58:53.537348+00:00