pith. sign in

Yuandong Tian

Identifiers

  • name variant Yuandong Tian 0.60 · backfill

Papers (30)

  1. Neural Computers cs.LG · 2026 · author #16
  2. AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications cs.AI · 2026 · author #11
  3. SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models cs.CL · 2025 · author #11
  4. Positional Encoding via Token-Aware Phase Attention cs.CL · 2025 · author #5
  5. Deep Think with Confidence cs.LG · 2025 · author #3
  6. Training Large Language Models to Reason in a Continuous Latent Space cs.CL · 2024 · author #7
  7. SpinQuant: LLM quantization with learned rotations cs.LG · 2024 · author #8
  8. GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection cs.LG · 2024 · author #6
  9. Efficient Streaming Language Models with Attention Sinks cs.CL · 2023 · author #2
  10. Extending Context Window of Large Language Models via Positional Interpolation cs.CL · 2023 · author #4
  11. H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models cs.LG · 2023 · author #8
  12. A Neural-based Program Decompiler cs.PL · 2019 · author #5
  13. Luck Matters: Understanding Training Dynamics of Deep ReLU Networks cs.LG · 2019 · author #1
  14. FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search cs.CV · 2018 · author #7
  15. Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search cs.CV · 2018 · author #4
  16. M$^3$RL: Mind-aware Multi-agent Management Reinforcement Learning cs.AI · 2018 · author #2
  17. Learning and Planning with a Semantic Model cs.LG · 2018 · author #6
  18. A theoretical framework for deep locally connected ReLU network cs.LG · 2018 · author #1
  19. Building Generalizable Agents with a Realistic and Rich 3D Environment cs.LG · 2018 · author #4
  20. CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication cs.CV · 2017 · author #6
  21. Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima cs.LG · 2017 · author #3
  22. When is a Convolutional Filter Easy To Learn? cs.LG · 2017 · author #3
  23. ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games cs.AI · 2017 · author #1
  24. Channel-Recurrent Autoencoding for Image Modeling cs.LG · 2017 · author #3
  25. An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis cs.LG · 2017 · author #1
  26. Single Image 3D Interpreter Network cs.CV · 2016 · author #4
  27. Simple Baseline for Visual Question Answering cs.CV · 2015 · author #2
  28. Better Computer Go Player with Neural Network and Long-term Prediction cs.LG · 2015 · author #1
  29. Semantic Amodal Segmentation cs.CV · 2015 · author #2
  30. Convolutional networks and learning invariant to homogeneous multiplicative scalings cs.LG · 2015 · author #5

Mentions

  • 2602.22769 #11 · arxiv_oai · confidence 0.70 Yuandong Tian
  • 2306.14048 #8 · arxiv_oai · confidence 0.70 Yuandong Tian
  • 2403.03507 #6 · arxiv_oai · confidence 0.70 Yuandong Tian
  • 2508.15260 #3 · arxiv_oai · confidence 0.70 Yuandong Tian
  • 2405.16406 #8 · arxiv_oai · confidence 0.70 Yuandong Tian

Frequent Coauthors