pith. sign in

Lingzhe Zhang

Identifiers

  • name variant Lingzhe Zhang 0.60 · backfill

Papers (7)

  1. RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation cs.LG · 2026 · author #4
  2. How Much Parallelism Is "Free"? A Principle of Near-Free Parallelism for Parallel Decoding cs.PF · 2026 · author #2
  3. From Feedback Loops to Policy Updates: Reinforcement Fine-Tuning for LLM-Based Alpha Factor Discovery cs.CE · 2026 · author #1
  4. Towards In-Depth Root Cause Localization for Microservices with Multi-Agent Recursion-of-Thought cs.SE · 2026 · author #1
  5. Towards Robust LLM Post-Training: Automatic Failure Management for Reinforcement Fine-Tuning cs.SE · 2026 · author #1
  6. E2E-REME: Towards End-to-End Microservices Auto-Remediation via Experience-Simulation Reinforcement Fine-Tuning cs.SE · 2026 · author #1
  7. d-TreeRPO: Towards More Reliable Policy Optimization for Diffusion Language Models cs.CL · 2025 · author #7

Mentions

  • 2606.11709 #4 · arxiv_oai · confidence 0.70 Lingzhe Zhang
  • 2605.30851 #2 · arxiv_oai · confidence 0.70 Lingzhe Zhang
  • 2605.15412 #1 · arxiv_oai · confidence 0.70 Lingzhe Zhang

Frequent Coauthors