pith. machine review for the scientific record. sign in

Qiaozhi He

Identifiers

No identifiers captured yet.

Papers (2)

  1. SPS: Steering Probability Squeezing for Better Exploration in Reinforcement Learning for Large Language Models cs.CL · 2026 · author #7
  2. SERM: Self-Evolving Relevance Model with Agent-Driven Learning from Massive Query Streams cs.CL · 2026 · author #8

Mentions

No mention provenance yet.

Frequent Coauthors