pith. sign in

Yelong Shen

Identifiers

  • name variant Yelong Shen 0.60 · backfill

Papers (27)

  1. Latent Recurrent Transformer: Architecture Exploration, Training Strategies, and Scaling Behavior cs.LG · 2026 · author #11
  2. Robust LLM Watermarking with Minimal Semantic Distortion for IP Protection cs.CR · 2026 · author #4
  3. Orchard: An Open-Source Agentic Modeling Framework cs.AI · 2026 · author #10
  4. Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation cs.CL · 2026 · author #5
  5. Rethinking Language Model Scaling under Transferable Hypersphere Optimization cs.LG · 2026 · author #3
  6. ThetaEvolve: Test-time Learning on Open Problems cs.LG · 2025 · author #16
  7. Reinforcement Learning for Reasoning in Large Language Models with One Training Example cs.LG · 2025 · author #14
  8. Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs cs.CL · 2025 · author #55
  9. Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone cs.CL · 2024 · author #92
  10. ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving cs.CL · 2023 · author #4
  11. CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing cs.CL · 2023 · author #4
  12. LoRA: Low-Rank Adaptation of Large Language Models cs.CL · 2021 · author #2
  13. Unsupervised Deep Structured Semantic Models for Commonsense Reasoning cs.CL · 2019 · author #3
  14. StoryGAN: A Sequential Conditional GAN for Story Visualization cs.CV · 2018 · author #3
  15. Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension cs.CL · 2018 · author #3
  16. M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search cs.AI · 2018 · author #1
  17. Stochastic Answer Networks for Machine Reading Comprehension cs.CL · 2017 · author #2
  18. Language-Based Image Editing with Recurrent Attentive Models cs.CV · 2017 · author #2
  19. FusionNet: Fusing via Fully-Aware Attention with Application to Machine Comprehension cs.CL · 2017 · author #3
  20. Dynamic Fusion Networks for Machine Reading Comprehension cs.CL · 2017 · author #4
  21. An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks cs.CL · 2017 · author #1
  22. Link Prediction using Embedded Knowledge Graphs cs.AI · 2016 · author #1
  23. ReasoNet: Learning to Stop Reading in Machine Comprehension cs.LG · 2016 · author #1
  24. End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture cs.LG · 2015 · author #3
  25. A Deep Embedding Model for Co-occurrence Learning cs.LG · 2015 · author #1
  26. Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval cs.CL · 2015 · author #3
  27. Limiting the Neighborhood: De-Small-World Network for Outbreak Prevention cs.SI · 2013 · author #2

Mentions

  • 1508.03398 #3 · backfill · confidence 0.70 Yelong Shen
  • 1504.02824 #1 · backfill · confidence 0.70 Yelong Shen
  • 1502.06922 #3 · backfill · confidence 0.70 Yelong Shen
  • 2605.26797 #11 · arxiv_oai · confidence 0.70 Yelong Shen
  • 1305.0513 #2 · backfill · confidence 0.70 Yelong Shen
  • 2605.23175 #4 · arxiv_oai · confidence 0.70 Yelong Shen
  • 2605.15040 #10 · arxiv_oai · confidence 0.70 Yelong Shen
  • 2309.17452 #4 · arxiv_oai · confidence 0.70 Yelong Shen
  • 2511.23473 #16 · arxiv_oai · confidence 0.70 Yelong Shen
  • 2504.20571 #14 · arxiv_oai · confidence 0.70 Yelong Shen

Frequent Coauthors