Wenhai Wang — Pith Author Registry

Identifiers

name variant Wenhai Wang 0.60 · backfill

Papers (22)

In-situ operation of amorphous circuits under heavy-ion irradiation cond-mat.mtrl-sci · 2026 · author #7
Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning cs.AI · 2026 · author #9
HSD: Training-Free Acceleration for Document Parsing Vision-Language Models with Hierarchical Speculative Decoding cs.CV · 2026 · author #13
LLM-VA: Resolving the Jailbreak-Overrefusal Trade-off via Vector Alignment cs.LG · 2026 · author #5
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling cs.CL · 2025 · author #35
GenExam: A Multidisciplinary Text-to-Image Exam cs.CV · 2025 · author #6
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #74
ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal cs.SE · 2025 · author #8
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows cs.AI · 2025 · author #18
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models cs.CV · 2025 · author #51
MM-Eureka: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning cs.CV · 2025 · author #9
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling cs.CV · 2025 · author #13
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #42
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization cs.CL · 2024 · author #3
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #19
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites cs.CV · 2024 · author #35
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks cs.CV · 2023 · author #3
VideoChat: Chat-Centric Video Understanding cs.CV · 2023 · author #5
Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2019 · author #1
Selective Kernel Networks cs.CV · 2019 · author #2
Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2018 · author #2
Mixed Link Networks cs.LG · 2018 · author #1

Mentions

2602.12957 #13 · arxiv_oai · confidence 0.70 Wenhai Wang
2605.31206 #7 · arxiv_oai · confidence 0.70 Wenhai Wang
2605.30039 #9 · arxiv_oai · confidence 0.70 Wenhai Wang
2407.03320 #19 · arxiv_oai · confidence 0.70 Wenhai Wang
2501.12386 #13 · arxiv_oai · confidence 0.70 Wenhai Wang
2411.10442 #3 · arxiv_oai · confidence 0.70 Wenhai Wang

Frequent Coauthors

Yu Qiao 12 shared papers
Jifeng Dai 9 shared papers
Tong Lu 8 shared papers
Xizhou Zhu 7 shared papers
Zhe Chen 7 shared papers
Botian Shi 6 shared papers
Min Dou 6 shared papers
Conghui He 5 shared papers
Dahua Lin 5 shared papers
Kai Chen 5 shared papers
Lewei Lu 5 shared papers
Limin Wang 5 shared papers
Shenglong Ye 5 shared papers
Weiyun Wang 5 shared papers
Xiang Li 5 shared papers
Yi Wang 5 shared papers
Zhangwei Gao 5 shared papers
Erfei Cui 4 shared papers
Junjun He 4 shared papers
Kaipeng Zhang 4 shared papers