pith. sign in

Yaojie Lu

Identifiers

  • name variant Yaojie Lu 0.60 · backfill

Papers (23)

  1. Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination cs.CL · 2026 · author #6
  2. Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation cs.CL · 2026 · author #10
  3. LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents cs.CL · 2026 · author #5
  4. MetaphorVU: Towards Metaphorical Video Understanding cs.CV · 2026 · author #9
  5. Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation cs.CV · 2026 · author #7
  6. Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards cs.CL · 2026 · author #9
  7. LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation cs.SE · 2026 · author #12
  8. ScaleBox: Enabling High-Fidelity and Scalable Code Verification for Large Language Models cs.SE · 2026 · author #8
  9. All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG cs.CL · 2026 · author #8
  10. Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models cs.AI · 2026 · author #3
  11. Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces cs.CL · 2026 · author #9
  12. P^2O: Joint Policy and Prompt Optimization cs.LG · 2026 · author #5
  13. Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards cs.LG · 2026 · author #4
  14. DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation cs.AI · 2026 · author #7
  15. Coupled Variational Reinforcement Learning for Language Model General Reasoning cs.CL · 2025 · author #8
  16. MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning cs.CL · 2025 · author #5
  17. When Models Outthink Their Safety: Unveiling and Mitigating Self-Jailbreak in Large Reasoning Models cs.AI · 2025 · author #6
  18. Across Programming Language Silos: A Study on Cross-Lingual Retrieval-augmented Code Generation cs.SE · 2025 · author #5
  19. Cost-sensitive Regularization for Label Confusion-aware Event Detection cs.CL · 2019 · author #2
  20. Sequence-to-Nuggets: Nested Entity Mention Detection via Anchor-Region Networks cs.CL · 2019 · author #2
  21. Adaptive Scaling for Sparse Detection in Information Extraction cs.CL · 2018 · author #2
  22. Nugget Proposal Networks for Chinese Event Detection cs.CL · 2018 · author #2
  23. Variational Recurrent Neural Machine Translation cs.CL · 2018 · author #4

Mentions

  • 2605.31058 #6 · arxiv_oai · confidence 0.70 Yaojie Lu
  • 2605.30833 #10 · arxiv_oai · confidence 0.70 Yaojie Lu
  • 2605.29559 #5 · arxiv_oai · confidence 0.70 Yaojie Lu
  • 2603.09117 #4 · arxiv_oai · confidence 0.70 Yaojie Lu
  • 2605.25461 #9 · arxiv_oai · confidence 0.70 Yaojie Lu
  • 2512.12576 #8 · arxiv_oai · confidence 0.70 Yaojie Lu
  • 2604.08362 #9 · arxiv_oai · confidence 0.70 Yaojie Lu
  • 2605.18740 #7 · arxiv_oai · confidence 0.70 Yaojie Lu

Frequent Coauthors