pith. machine review for the scientific record. sign in

Wenhai Wang

Identifiers

  • name variant Wenhai Wang 0.60 · backfill

Papers (19)

  1. LLM-VA: Resolving the Jailbreak-Overrefusal Trade-off via Vector Alignment cs.LG · 2026 · author #5
  2. MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling cs.CL · 2025 · author #35
  3. GenExam: A Multidisciplinary Text-to-Image Exam cs.CV · 2025 · author #6
  4. InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #74
  5. ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal cs.SE · 2025 · author #8
  6. ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows cs.AI · 2025 · author #18
  7. InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models cs.CV · 2025 · author #51
  8. MM-Eureka: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning cs.CV · 2025 · author #9
  9. InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling cs.CV · 2025 · author #13
  10. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #42
  11. Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization cs.CL · 2024 · author #3
  12. InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #19
  13. How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites cs.CV · 2024 · author #35
  14. InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks cs.CV · 2023 · author #3
  15. VideoChat: Chat-Centric Video Understanding cs.CV · 2023 · author #5
  16. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2019 · author #1
  17. Selective Kernel Networks cs.CV · 2019 · author #2
  18. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2018 · author #2
  19. Mixed Link Networks cs.LG · 2018 · author #1

Mentions

  • 2407.03320 #19 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2501.12386 #13 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2411.10442 #3 · arxiv_oai · confidence 0.70 Wenhai Wang

Frequent Coauthors