pith. sign in

Yunhang Shen

Identifiers

  • name variant Yunhang Shen 0.60 · backfill

Papers (6)

  1. Toward Native Multimodal Modeling: A Roadmap cs.CV · 2026 · author #13
  2. Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding cs.CV · 2026 · author #5
  3. GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant cs.CL · 2026 · author #4
  4. VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction cs.CV · 2025 · author #5
  5. Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis cs.CV · 2024 · author #9
  6. MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models cs.CV · 2023 · author #3

Mentions

  • 2605.25343 #13 · arxiv_oai · confidence 0.70 Yunhang Shen
  • 2501.01957 #5 · arxiv_oai · confidence 0.70 Yunhang Shen

Frequent Coauthors