pith. sign in

Yuexiang Zhai

Identifiers

  • name variant Yuexiang Zhai 0.60 · backfill

Papers (19)

  1. DexHoldem: Playing Texas Hold'em with Dexterous Embodied System cs.RO · 2026 · author #7
  2. Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #2684
  3. SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training cs.AI · 2025 · author #2
  4. Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning cs.AI · 2024 · author #1
  5. Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement cs.LG · 2024 · author #2
  6. Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs cs.CV · 2024 · author #3
  7. LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models cs.CL · 2023 · author #6
  8. White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? cs.LG · 2023 · author #8
  9. RLIF: Interactive Imitation Learning as Reinforcement Learning cs.AI · 2023 · author #3
  10. Investigating the Catastrophic Forgetting in Multimodal Large Language Models cs.CL · 2023 · author #1
  11. Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning cs.LG · 2023 · author #2
  12. Closed-Loop Transcription via Convolutional Sparse Coding cs.CV · 2023 · author #8
  13. Understanding the Complexity Gains of Single-Task RL with a Curriculum cs.LG · 2022 · author #2
  14. Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity cs.LG · 2022 · author #3
  15. Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning cs.AI · 2021 · author #1
  16. Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training cs.CV · 2021 · author #3
  17. Analysis of the Optimization Landscapes for Overcomplete Representation Learning cs.LG · 2019 · author #2
  18. Complete Dictionary Learning via $\ell^4$-Norm Maximization over the Orthogonal Group cs.LG · 2019 · author #1
  19. Learning to Reconstruct 3D Manhattan Wireframes from a Single Image cs.CV · 2019 · author #3

Mentions

  • 2405.10292 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2311.13110 #8 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2401.06209 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2311.12996 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2402.15703 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2303.05479 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2309.10313 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2311.18232 #6 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2212.12809 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2302.09347 #8 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2210.09579 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2107.03961 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2103.00673 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 1905.07482 #3 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 1906.02435 #1 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 1912.02427 #2 · arxiv_oai · confidence 0.70 Yuexiang Zhai
  • 2605.18727 #7 · arxiv_oai · confidence 0.70 Yuexiang Zhai

Frequent Coauthors