pith. sign in

Linjie Li

Identifiers

  • name variant Linjie Li 0.60 · backfill

Papers (19)

  1. AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration cs.AI · 2026 · author #27
  2. SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects cs.AI · 2026 · author #3
  3. TextGround4M: A Prompt-Aligned Dataset for Layout-Aware Text Rendering cs.CV · 2026 · author #3
  4. Quantum-Gated Task-interaction Knowledge Distillation for Pre-trained Model-based Class-Incremental Learning cs.LG · 2026 · author #1
  5. LDEPrompt: Layer-importance guided Dual Expandable Prompt Pool for Pre-trained Model-based Class-Incremental Learning cs.CV · 2026 · author #1
  6. FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching cs.CV · 2026 · author #5
  7. RAGEN-2: Reasoning Collapse in Agentic RL cs.LG · 2026 · author #8
  8. Gym-V: A Unified Vision Environment System for Agentic Vision Research cs.CV · 2026 · author #5
  9. Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers cs.CV · 2025 · author #11
  10. OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning cs.CV · 2025 · author #2
  11. RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning cs.LG · 2025 · author #5
  12. V-MAGE: A Game Evaluation Framework for Assessing Vision-Centric Capabilities in Multimodal Large Language Models cs.CV · 2025 · author #2
  13. The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision) cs.CV · 2023 · author #2
  14. MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities cs.AI · 2023 · author #3
  15. Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning cs.CV · 2023 · author #3
  16. MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action cs.CV · 2023 · author #2
  17. GIT: A Generative Image-to-text Transformer for Vision and Language cs.CV · 2022 · author #4
  18. Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog cs.CV · 2019 · author #4
  19. Learning to see people like people cs.CV · 2017 · author #2

Mentions

  • 2605.20025 #27 · arxiv_oai · confidence 0.70 Linjie Li
  • 2605.19587 #3 · arxiv_oai · confidence 0.70 Linjie Li
  • 2505.08617 #2 · arxiv_oai · confidence 0.70 Linjie Li
  • 2205.14100 #4 · arxiv_oai · confidence 0.70 Linjie Li
  • 2309.17421 #2 · arxiv_oai · confidence 0.70 Linjie Li

Frequent Coauthors