pith. sign in

Sicong Leng

Identifiers

  • name variant Sicong Leng 0.60 · backfill

Papers (7)

  1. LDDR: Linear-DPP-Based Dynamic-Resolution Frame Sampling for Video MLLMs cs.CV · 2026 · author #6
  2. InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search cs.CV · 2026 · author #5
  3. Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling cs.CV · 2026 · author #6
  4. World Model for Robot Learning: A Comprehensive Survey cs.RO · 2026 · author #6
  5. LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling cs.CV · 2025 · author #5
  6. VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding cs.CV · 2025 · author #7
  7. VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs cs.CV · 2024 · author #2

Mentions

  • 2511.20785 #5 · arxiv_oai · confidence 0.70 Sicong Leng

Frequent Coauthors