pith. sign in

Junha Song

Identifiers

No identifiers captured yet.

Papers (2)

  1. Learning to See What You Need: Gaze Attention for Multimodal Large Language Models cs.CV · 2026 · author #1
  2. RL makes MLLMs see better than SFT cs.CV · 2025 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors