pith. sign in

Jiazheng Zhang

Identifiers

  • name variant Jiazheng Zhang 0.60 · backfill

Papers (10)

  1. Entropy Is Not Enough: Unlocking Effective Reinforcement Learning for Visual Reasoning via Vision-Anchored Token Selection cs.AI · 2026 · author #7
  2. Prefix-Adaptive Block Diffusion for Efficient Document Recognition cs.CV · 2026 · author #5
  3. Entropy Polarity in Reinforcement Fine-Tuning: Direction, Asymmetry, and Control cs.LG · 2026 · author #1
  4. CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026 · author #11
  5. EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training cs.LG · 2026 · author #5
  6. AgentV-RL: Scaling Reward Modeling with Agentic Verifier cs.CL · 2026 · author #1
  7. SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents cs.CL · 2026 · author #6
  8. DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training cs.LG · 2026 · author #12
  9. DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training cs.LG · 2025 · author #14
  10. Correlated diffusion of colloidal particles near a liquid-liquid interface cond-mat.soft · 2013 · author #4

Mentions

  • 2606.03937 #7 · arxiv_oai · confidence 0.70 Jiazheng Zhang
  • 2602.12984 #6 · arxiv_oai · confidence 0.70 Jiazheng Zhang
  • 1304.3916 #4 · backfill · confidence 0.70 Jiazheng Zhang
  • 2605.16861 #5 · arxiv_oai · confidence 0.70 Jiazheng Zhang
  • 2605.11775 #1 · backfill · confidence 0.70 Jiazheng Zhang

Frequent Coauthors