pith. machine review for the scientific record. sign in

Shuai Bai

Identifiers

  • name variant Shuai Bai 0.60 · backfill

Papers (17)

  1. Qwen-Image-2.0 Technical Report cs.CV · 2026 · author #56
  2. Qwen3-VL-Seg: Unlocking Open-World Referring Segmentation with Vision-Language Grounding cs.CV · 2026 · author #6
  3. CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing cs.CL · 2026 · author #12
  4. Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking cs.CL · 2026 · author #6
  5. Qwen3-VL Technical Report cs.CV · 2025 · author #1
  6. Soft Adaptive Policy Optimization cs.LG · 2025 · author #8
  7. Unify Robot Actions in Camera Frame cs.RO · 2025 · author #10
  8. Revisiting Multimodal Positional Encoding in Vision-Language Models cs.CV · 2025 · author #7
  9. Qwen3-Omni Technical Report cs.CL · 2025 · author #24
  10. Qwen-Image Technical Report cs.CV · 2025 · author #8
  11. Qwen2.5-Omni Technical Report cs.CL · 2025 · author #6
  12. Qwen2.5-VL Technical Report cs.CV · 2025 · author #1
  13. Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution cs.CV · 2024 · author #2
  14. Qwen2 Technical Report cs.CL · 2024 · author #40
  15. Qwen Technical Report cs.CL · 2023 · author #2
  16. Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond cs.CV · 2023 · author #2
  17. Multi-hierarchical Independent Correlation Filters for Visual Tracking cs.CV · 2018 · author #1

Mentions

  • 2407.10671 #40 · backfill · confidence 0.70 Shuai Bai
  • 2511.21631 #1 · backfill · confidence 0.70 Shuai Bai

Frequent Coauthors