Yuhang Zang
Identifiers
- name variant Yuhang Zang 0.60 · backfill
Papers (21)
- CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning cs.CV · 2026 · author #4
- AdaGRPO: A Capability-Aware Adaptive Enhancement for Flow-based GRPO cs.CV · 2026 · author #5
- OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs cs.CV · 2026 · author #3
- Pave-GRPO: Beyond Instantaneous Guidance through Principled Average Velocity Decomposition cs.CV · 2026 · author #9
- Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents cs.PL · 2026 · author #2
- ETCHR: Editing To Clarify and Harness Reasoning cs.CV · 2026 · author #4
- SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction cs.CV · 2026 · author #4
- WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation cs.CL · 2026 · author #17
- Visual-ERM: Reward Modeling for Visual Equivalence cs.CV · 2026 · author #10
- GraphThinker: Reinforcing Temporally Grounded Video Reasoning with Event Graph Thinking cs.CV · 2026 · author #4
- MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing cs.CV · 2025 · author #43
- Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning cs.CV · 2025 · author #3
- Unified Reward Model for Multimodal Understanding and Generation cs.CV · 2025 · author #2
- Visual-RFT: Visual Reinforcement Fine-Tuning cs.CV · 2025 · author #3
- PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction cs.CV · 2024 · author #6
- InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #3
- Are We on the Right Way for Evaluating Large Vision-Language Models? cs.CV · 2024 · author #5
- InternLM2 Technical Report cs.CL · 2024 · author #79
- RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition cs.CV · 2024 · author #3
- InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model cs.CV · 2024 · author #3
- Scene Text Detection with Supervised Pyramid Context Network cs.CV · 2018 · author #2
Mentions
- 2606.09393 #4 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2606.06828 #5 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2606.03890 #3 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2606.01636 #9 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2605.27955 #2 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2605.23897 #4 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2605.20110 #4 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2403.13805 #3 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2509.22186 #43 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2407.03320 #3 · arxiv_oai · confidence 0.70 Yuhang Zang
- 2401.16420 #3 · arxiv_oai · confidence 0.70 Yuhang Zang
Frequent Coauthors
- Jiaqi Wang 16 shared papers
- Dahua Lin 13 shared papers
- Xiaoyi Dong 9 shared papers
- Haodong Duan 6 shared papers
- Kai Chen 6 shared papers
- Pan Zhang 6 shared papers
- Wei Li 6 shared papers
- Yibin Wang 6 shared papers
- Conghui He 5 shared papers
- Yuhang Cao 5 shared papers
- Yu Qiao 5 shared papers
- Bin Wang 4 shared papers
- Jiazi Bu 4 shared papers
- Linke Ouyang 4 shared papers
- Long Xing 4 shared papers
- Yujie Zhou 4 shared papers
- Ziyu Liu 4 shared papers
- Hang Yan 3 shared papers
- Jingwen Li 3 shared papers
- Penghui Yang 3 shared papers