Image fusion via vision-language model.arXiv preprint arXiv:2402.02235

Zixiang Zhao, Lilun Deng, Haowen Bai, Yukun Cui, Zhipeng Zhang, Yulun Zhang, Haotong Qin, Dongdong Chen, Jiangshe Zhang, Peng Wang, et al · 2024 · arXiv 2402.02235

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

MPerS: Dynamic MLLM MixExperts Perception-Guided Remote Sensing Scene Segmentation

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

MPerS dynamically mixes semantic guidance from MLLM-generated RS captions with DINOv3 features via MixExperts and Linguistic Query Guided Attention to achieve superior semantic segmentation on three public remote sensing datasets.

Adding Thermal Awareness to Visual Systems in Real-Time via Distilled Diffusion Models

cs.CV · 2026-05-07 · unverdicted · novelty 5.0

FusionProxy is a distilled diffusion-based fusion module that adds thermal awareness to RGB vision systems in real time as an independent plug-and-play component.

CNN-ViT Fusion with Adaptive Attention Gate for Brain Tumor MRI Classification: A Hybrid Deep Learning Model

cs.CV · 2026-04-25 · unverdicted · novelty 5.0

Hybrid CNN-ViT with adaptive attention gate achieves 97.6% accuracy on brain tumor MRI classification, outperforming baselines.

citing papers explorer

Showing 3 of 3 citing papers.

MPerS: Dynamic MLLM MixExperts Perception-Guided Remote Sensing Scene Segmentation cs.CV · 2026-05-11 · unverdicted · none · ref 50
MPerS dynamically mixes semantic guidance from MLLM-generated RS captions with DINOv3 features via MixExperts and Linguistic Query Guided Attention to achieve superior semantic segmentation on three public remote sensing datasets.
Adding Thermal Awareness to Visual Systems in Real-Time via Distilled Diffusion Models cs.CV · 2026-05-07 · unverdicted · none · ref 45
FusionProxy is a distilled diffusion-based fusion module that adds thermal awareness to RGB vision systems in real time as an independent plug-and-play component.
CNN-ViT Fusion with Adaptive Attention Gate for Brain Tumor MRI Classification: A Hybrid Deep Learning Model cs.CV · 2026-04-25 · unverdicted · none · ref 17
Hybrid CNN-ViT with adaptive attention gate achieves 97.6% accuracy on brain tumor MRI classification, outperforming baselines.

Image fusion via vision-language model.arXiv preprint arXiv:2402.02235

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer