pith. sign in

Wenhai Wang

Identifiers

  • name variant Wenhai Wang 0.60 · backfill

Papers (22)

  1. In-situ operation of amorphous circuits under heavy-ion irradiation cond-mat.mtrl-sci · 2026 · author #7
  2. Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning cs.AI · 2026 · author #9
  3. HSD: Training-Free Acceleration for Document Parsing Vision-Language Models with Hierarchical Speculative Decoding cs.CV · 2026 · author #13
  4. LLM-VA: Resolving the Jailbreak-Overrefusal Trade-off via Vector Alignment cs.LG · 2026 · author #5
  5. MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling cs.CL · 2025 · author #35
  6. GenExam: A Multidisciplinary Text-to-Image Exam cs.CV · 2025 · author #6
  7. InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #74
  8. ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal cs.SE · 2025 · author #8
  9. ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows cs.AI · 2025 · author #18
  10. InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models cs.CV · 2025 · author #51
  11. MM-Eureka: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning cs.CV · 2025 · author #9
  12. InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling cs.CV · 2025 · author #13
  13. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #42
  14. Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization cs.CL · 2024 · author #3
  15. InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #19
  16. How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites cs.CV · 2024 · author #35
  17. InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks cs.CV · 2023 · author #3
  18. VideoChat: Chat-Centric Video Understanding cs.CV · 2023 · author #5
  19. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2019 · author #1
  20. Selective Kernel Networks cs.CV · 2019 · author #2
  21. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2018 · author #2
  22. Mixed Link Networks cs.LG · 2018 · author #1

Mentions

  • 2602.12957 #13 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2605.31206 #7 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2605.30039 #9 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2407.03320 #19 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2501.12386 #13 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2411.10442 #3 · arxiv_oai · confidence 0.70 Wenhai Wang

Frequent Coauthors