Encoding user interactions into visual in-context example pairs turns static models into controllable systems that improve IoU, PSNR, and LPIPS on guided tasks without retraining.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2roles
method 1polarities
use method 1representative citing papers
DEX is a modular network using dynamically activated experts and a group-EMA director to learn emergent modular representations for multi-modality medical vision foundation models, evaluated on a new 4M-image benchmark across 10 modalities and 26 downstream tasks.
citing papers explorer
-
From Static to Interactive: Adapting Visual in-Context Learners for User-Driven Tasks
Encoding user interactions into visual in-context example pairs turns static models into controllable systems that improve IoU, PSNR, and LPIPS on guided tasks without retraining.
-
Learning Emergent Modular Representations in Multi-modality Medical Vision Foundation Models
DEX is a modular network using dynamically activated experts and a group-EMA director to learn emergent modular representations for multi-modality medical vision foundation models, evaluated on a new 4M-image benchmark across 10 modalities and 26 downstream tasks.