pith. sign in

Jilong Liu

Identifiers

  • name variant Jilong Liu 0.60 · backfill

Papers (4)

  1. Suppressing Forgery-Specific Shortcuts for Generalizable Deepfake Detection cs.CV · 2026 · author #3
  2. CompassDPO: Dynamics-Controlled Direct Preference Optimization for Robust Safety Alignment cs.LG · 2026 · author #1
  3. Controllable Value Alignment in Large Language Models through Neuron-Level Editing cs.LG · 2026 · author #4
  4. Revisiting Robustness for LLM Safety Alignment via Selective Geometry Control cs.LG · 2026 · author #3

Mentions

  • 2606.01843 #3 · arxiv_oai · confidence 0.70 Jilong Liu
  • 2602.07356 #4 · arxiv_oai · confidence 0.70 Jilong Liu
  • 2603.07211 #1 · arxiv_oai · confidence 0.70 Jilong Liu
  • 2602.07340 #3 · arxiv_oai · confidence 0.70 Jilong Liu

Frequent Coauthors