pith. sign in

Victor Gillioz

Identifiers

  • name variant Victor Gillioz 0.60 · backfill

Papers (2)

  1. Training Deliberative Monitors for Black-Box Scheming Detection cs.CL · 2026 · author #3
  2. Shifting the Gradient: Understanding How Defensive Training Methods Protect Language Model Integrity cs.LG · 2026 · author #2

Mentions

  • 2605.29601 #3 · arxiv_oai · confidence 0.70 Victor Gillioz

Frequent Coauthors