IEEE Transactions on Image Processing32, 364–376 (2022)

Wu, X · 2022

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Can Multimodal Large Language Models Truly Understand Small Objects?

cs.CV · 2026-04-24 · unverdicted · novelty 7.0

Current MLLMs show weak performance on small object understanding tasks, but fine-tuning with the new SOU-Train dataset measurably improves their capabilities.

Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection

cs.CV · 2026-04-11 · unverdicted · novelty 5.0

A monotonic diff-based scale loss and learnable Gaussian convolution with adaptive pinwheel masking improve mIoU, Pd, and Fa for infrared small target detection on three benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

Can Multimodal Large Language Models Truly Understand Small Objects? cs.CV · 2026-04-24 · unverdicted · none · ref 43
Current MLLMs show weak performance on small object understanding tasks, but fine-tuning with the new SOU-Train dataset measurably improves their capabilities.
Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection cs.CV · 2026-04-11 · unverdicted · none · ref 17
A monotonic diff-based scale loss and learnable Gaussian convolution with adaptive pinwheel masking improve mIoU, Pd, and Fa for infrared small target detection on three benchmarks.

IEEE Transactions on Image Processing32, 364–376 (2022)

fields

years

verdicts

representative citing papers

citing papers explorer