pith. sign in

Botian Shi

Identifiers

  • name variant Botian Shi 0.60 · backfill

Papers (9)

  1. SPIRAL: Self-Evolving Action-Conditioned Video Generation via Reflective Planning Agents cs.CV · 2026 · author #11
  2. MGA: Memory-Driven GUI Agent for Observation-Centric Interaction cs.AI · 2025 · author #4
  3. EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle cs.CL · 2025 · author #11
  4. InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #40
  5. InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models cs.CV · 2025 · author #29
  6. MM-Eureka: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning cs.CV · 2025 · author #8
  7. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #21
  8. MinerU: An Open-Source Solution for Precise Document Content Extraction cs.CV · 2024 · author #15
  9. How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites cs.CV · 2024 · author #17

Mentions

  • 2603.08403 #11 · arxiv_oai · confidence 0.70 Botian Shi
  • 2510.16079 #11 · arxiv_oai · confidence 0.70 Botian Shi
  • 2409.18839 #15 · arxiv_oai · confidence 0.70 Botian Shi

Frequent Coauthors