Yuexiang Zhai
Identifiers
- name variant Yuexiang Zhai 0.60 · backfill
Papers (19)
- DexHoldem: Playing Texas Hold'em with Dexterous Embodied System cs.RO · 2026 · author #7
- Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #2684
- SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training cs.AI · 2025 · author #2
- Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning cs.AI · 2024 · author #1
- Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement cs.LG · 2024 · author #2
- Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs cs.CV · 2024 · author #3
- LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models cs.CL · 2023 · author #6
- White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? cs.LG · 2023 · author #8
- RLIF: Interactive Imitation Learning as Reinforcement Learning cs.AI · 2023 · author #3
- Investigating the Catastrophic Forgetting in Multimodal Large Language Models cs.CL · 2023 · author #1
- Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning cs.LG · 2023 · author #2
- Closed-Loop Transcription via Convolutional Sparse Coding cs.CV · 2023 · author #8
- Understanding the Complexity Gains of Single-Task RL with a Curriculum cs.LG · 2022 · author #2
- Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity cs.LG · 2022 · author #3
- Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning cs.AI · 2021 · author #1
- Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training cs.CV · 2021 · author #3
- Analysis of the Optimization Landscapes for Overcomplete Representation Learning cs.LG · 2019 · author #2
- Complete Dictionary Learning via $\ell^4$-Norm Maximization over the Orthogonal Group cs.LG · 2019 · author #1
- Learning to Reconstruct 3D Manhattan Wireframes from a Single Image cs.CV · 2019 · author #3
Mentions
- 2405.10292 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2311.13110 #8 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2401.06209 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2311.12996 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2402.15703 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2303.05479 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2309.10313 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2311.18232 #6 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2212.12809 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2302.09347 #8 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2210.09579 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2107.03961 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2103.00673 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 1905.07482 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 1906.02435 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 1912.02427 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
- 2605.18727 #7 · arxiv_oai · confidence 0.70 Yuexiang Zhai
Frequent Coauthors
- Yi Ma 13 shared papers
- Sergey Levine 7 shared papers
- Shengbang Tong 6 shared papers
- Qing Qu 3 shared papers
- Saining Xie 3 shared papers
- Tianzhe Chu 3 shared papers
- Xiao Li 3 shared papers
- Aviral Kumar 2 shared papers
- Chong You 2 shared papers
- Druv Pai 2 shared papers
- Hao Bai 2 shared papers
- Ke Chen 2 shared papers
- Kelvin Xu 2 shared papers
- Mu Cai 2 shared papers
- Yann LeCun 2 shared papers
- Yichao Zhou 2 shared papers
- Zhihui Zhu 2 shared papers
- Aahil Mehta 1 shared papers
- Aaron Archer 1 shared papers
- Aaron Cohen 1 shared papers