pith. sign in

Chenggang Zhao

Identifiers

  • name variant Chenggang Zhao 0.60 · backfill

Papers (9)

  1. mHC: Manifold-Constrained Hyper-Connections cs.CL · 2025 · author #4
  2. DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models cs.CL · 2025 · author #13
  3. DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning cs.CL · 2025 · author #26
  4. DeepSeek-V3 Technical Report cs.CL · 2024 · author #8
  5. Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts cs.LG · 2024 · author #3
  6. DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence cs.SE · 2024 · author #37
  7. DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model cs.CL · 2024 · author #7
  8. DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models cs.CL · 2024 · author #3
  9. DeepSeek LLM: Scaling Open-Source Language Models with Longtermism cs.CL · 2024 · author #81

Mentions

  • 2406.11931 #37 · arxiv_oai · confidence 0.70 Chenggang Zhao

Frequent Coauthors