Yuhang Zang — Pith Author Registry

Identifiers

name variant Yuhang Zang 0.60 · backfill

Papers (21)

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning cs.CV · 2026 · author #4
AdaGRPO: A Capability-Aware Adaptive Enhancement for Flow-based GRPO cs.CV · 2026 · author #5
OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs cs.CV · 2026 · author #3
Pave-GRPO: Beyond Instantaneous Guidance through Principled Average Velocity Decomposition cs.CV · 2026 · author #9
Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents cs.PL · 2026 · author #2
ETCHR: Editing To Clarify and Harness Reasoning cs.CV · 2026 · author #4
SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction cs.CV · 2026 · author #4
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation cs.CL · 2026 · author #17
Visual-ERM: Reward Modeling for Visual Equivalence cs.CV · 2026 · author #10
GraphThinker: Reinforcing Temporally Grounded Video Reasoning with Event Graph Thinking cs.CV · 2026 · author #4
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing cs.CV · 2025 · author #43
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning cs.CV · 2025 · author #3
Unified Reward Model for Multimodal Understanding and Generation cs.CV · 2025 · author #2
Visual-RFT: Visual Reinforcement Fine-Tuning cs.CV · 2025 · author #3
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction cs.CV · 2024 · author #6
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #3
Are We on the Right Way for Evaluating Large Vision-Language Models? cs.CV · 2024 · author #5
InternLM2 Technical Report cs.CL · 2024 · author #79
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition cs.CV · 2024 · author #3
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model cs.CV · 2024 · author #3
Scene Text Detection with Supervised Pyramid Context Network cs.CV · 2018 · author #2

Mentions

2606.09393 #4 · arxiv_oai · confidence 0.70 Yuhang Zang
2606.06828 #5 · arxiv_oai · confidence 0.70 Yuhang Zang
2606.03890 #3 · arxiv_oai · confidence 0.70 Yuhang Zang
2606.01636 #9 · arxiv_oai · confidence 0.70 Yuhang Zang
2605.27955 #2 · arxiv_oai · confidence 0.70 Yuhang Zang
2605.23897 #4 · arxiv_oai · confidence 0.70 Yuhang Zang
2605.20110 #4 · arxiv_oai · confidence 0.70 Yuhang Zang
2403.13805 #3 · arxiv_oai · confidence 0.70 Yuhang Zang
2509.22186 #43 · arxiv_oai · confidence 0.70 Yuhang Zang
2407.03320 #3 · arxiv_oai · confidence 0.70 Yuhang Zang
2401.16420 #3 · arxiv_oai · confidence 0.70 Yuhang Zang

Frequent Coauthors

Jiaqi Wang 16 shared papers
Dahua Lin 13 shared papers
Xiaoyi Dong 9 shared papers
Haodong Duan 6 shared papers
Kai Chen 6 shared papers
Pan Zhang 6 shared papers
Wei Li 6 shared papers
Yibin Wang 6 shared papers
Conghui He 5 shared papers
Yuhang Cao 5 shared papers
Yu Qiao 5 shared papers
Bin Wang 4 shared papers
Jiazi Bu 4 shared papers
Linke Ouyang 4 shared papers
Long Xing 4 shared papers
Yujie Zhou 4 shared papers
Ziyu Liu 4 shared papers
Hang Yan 3 shared papers
Jingwen Li 3 shared papers
Penghui Yang 3 shared papers