pith. sign in

Rongxiang Weng

Identifiers

  • name variant Rongxiang Weng 0.60 · backfill

Papers (9)

  1. Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training cs.CL · 2026 · author #5
  2. LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance cs.CL · 2026 · author #8
  3. Prefix Teach, Suffix Fade: Local Teachability Collapse in Strong-to-Weak On-Policy Distillation cs.CL · 2026 · author #5
  4. Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization cs.LG · 2026 · author #5
  5. The Rise and Potential of Large Language Model Based Agents: A Survey cs.AI · 2023 · author #22
  6. Secrets of RLHF in Large Language Models Part I: PPO cs.CL · 2023 · author #19
  7. Learning Representation Mapping for Relation Detection in Knowledge Base Question Answering cs.CL · 2019 · author #3
  8. Learning to Discriminate Noises for Incorporating External Information in Neural Machine Translation cs.CL · 2018 · author #4
  9. Neural Machine Translation with Word Predictions cs.CL · 2017 · author #1

Mentions

  • 2606.05610 #5 · arxiv_oai · confidence 0.70 Rongxiang Weng
  • 2605.13643 #5 · arxiv_oai · confidence 0.70 Rongxiang Weng
  • 2605.22567 #8 · arxiv_oai · confidence 0.70 Rongxiang Weng
  • 2307.04964 #19 · arxiv_oai · confidence 0.70 Rongxiang Weng

Frequent Coauthors