arXiv preprint arXiv:2405.15734 , year=

LM4LV: A Frozen Large Language Model for Low-level Vision Tasks , author= · 2024 · arXiv 2405.15734

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Language-Assisted Super-Resolution from Real-World Low-Resolution Patches

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

LA-SR redefines unpaired super-resolution in language space by projecting images into a semantically rich representation and applying vision-language model guided losses to handle real-world degradations extracted from depth variations.

Mitigating State Aliasing in Vision-Language-Action Models via Inverse Dynamics Learning

cs.CV · 2026-05-28 · unverdicted · novelty 6.0

Inverse dynamics prediction is added as an auxiliary task to reduce state aliasing in VLA models by directly supervising the vision encoder on action-relevant visual distinctions using only standard observation-action pairs.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Language-Assisted Super-Resolution from Real-World Low-Resolution Patches cs.CV · 2026-06-30 · unverdicted · none · ref 143
LA-SR redefines unpaired super-resolution in language space by projecting images into a semantically rich representation and applying vision-language model guided losses to handle real-world degradations extracted from depth variations.
Mitigating State Aliasing in Vision-Language-Action Models via Inverse Dynamics Learning cs.CV · 2026-05-28 · unverdicted · none · ref 5
Inverse dynamics prediction is added as an auxiliary task to reduce state aliasing in VLA models by directly supervising the vision encoder on action-relevant visual distinctions using only standard observation-action pairs.

arXiv preprint arXiv:2405.15734 , year=

fields

years

verdicts

representative citing papers

citing papers explorer