Yuexiang Zhai — Pith Author Registry

Identifiers

name variant Yuexiang Zhai 0.60 · backfill

Papers (19)

DexHoldem: Playing Texas Hold'em with Dexterous Embodied System cs.RO · 2026 · author #7
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #2684
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training cs.AI · 2025 · author #2
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning cs.AI · 2024 · author #1
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement cs.LG · 2024 · author #2
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs cs.CV · 2024 · author #3
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models cs.CL · 2023 · author #6
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? cs.LG · 2023 · author #8
RLIF: Interactive Imitation Learning as Reinforcement Learning cs.AI · 2023 · author #3
Investigating the Catastrophic Forgetting in Multimodal Large Language Models cs.CL · 2023 · author #1
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning cs.LG · 2023 · author #2
Closed-Loop Transcription via Convolutional Sparse Coding cs.CV · 2023 · author #8
Understanding the Complexity Gains of Single-Task RL with a Curriculum cs.LG · 2022 · author #2
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity cs.LG · 2022 · author #3
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning cs.AI · 2021 · author #1
Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training cs.CV · 2021 · author #3
Analysis of the Optimization Landscapes for Overcomplete Representation Learning cs.LG · 2019 · author #2
Complete Dictionary Learning via $\ell^4$-Norm Maximization over the Orthogonal Group cs.LG · 2019 · author #1
Learning to Reconstruct 3D Manhattan Wireframes from a Single Image cs.CV · 2019 · author #3

Mentions

2405.10292 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2311.13110 #8 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2401.06209 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2311.12996 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2402.15703 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2303.05479 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2309.10313 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2311.18232 #6 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2212.12809 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2302.09347 #8 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2210.09579 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2107.03961 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2103.00673 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
1905.07482 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
1906.02435 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
1912.02427 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
2605.18727 #7 · arxiv_oai · confidence 0.70 Yuexiang Zhai

Frequent Coauthors

Yi Ma 13 shared papers
Sergey Levine 7 shared papers
Shengbang Tong 6 shared papers
Qing Qu 3 shared papers
Saining Xie 3 shared papers
Tianzhe Chu 3 shared papers
Xiao Li 3 shared papers
Aviral Kumar 2 shared papers
Chong You 2 shared papers
Druv Pai 2 shared papers
Hao Bai 2 shared papers
Ke Chen 2 shared papers
Kelvin Xu 2 shared papers
Mu Cai 2 shared papers
Yann LeCun 2 shared papers
Yichao Zhou 2 shared papers
Zhihui Zhu 2 shared papers
Aahil Mehta 1 shared papers
Aaron Archer 1 shared papers
Aaron Cohen 1 shared papers