In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

Yun, S · 2019 · arXiv 2019.00612

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

TextTeacher: What Can Language Teach About Images?

cs.CV · 2026-05-21 · unverdicted · novelty 6.0

TextTeacher uses frozen text embeddings from captions as semantic anchors to guide vision model training, improving ImageNet accuracy by up to 2.7 p.p. and transfer performance by 1.0 p.p. on average.

VISTA: Variance-Gated Inter-Sequence Test-Time Adaptation for Multi-Sequence MRI Segmentation

cs.CV · 2026-05-17 · unverdicted · novelty 6.0 · 2 refs

VISTA is a test-time adaptation framework for multi-sequence MRI that uses inter-sequence intervention probes and cross-view disagreement variance to gate self-training, yielding Dice gains of +1.89% on low-field African data and +2.82% on pediatric data over the source model.

Inducing Spatial Locality in Vision Transformers through the Training Protocol

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

CutMix augmentation during training induces spatial locality in early layers of Vision Transformers trained from scratch, as measured by reduced Mean Attention Distance.

Controllable Histopathology Image Synthesis with Training-free Structural Initialization and Textural Modulation

cs.CV · 2026-06-26 · unverdicted · novelty 5.0 · 2 refs

CHIS steers pretrained diffusion models to generate histopathology images aligned with input structural masks via frequency-domain structural initialization and wavelet-based textural modulation without any training on annotated data.

SignNet-1M: Large-Scale Multilingual Sign Language Video Dataset with Downstream Benchmarks

cs.CV · 2026-06-23 · unverdicted · novelty 5.0

The paper releases SignNet-1M, a 1M-scale augmented dataset for ASL, CSL and DGS with 3DGS and diffusion-based variations, plus benchmarks showing improved cross-shift generalization.

Weak-to-Strong Knowledge Distillation Accelerates Visual Learning

cs.CV · 2026-04-16 · unverdicted · novelty 5.0

Weak-to-strong knowledge distillation applied early and then turned off accelerates convergence to target performance in visual learning tasks by factors of 1.7-4.8x.

Layer-Guided UAV Tracking: Enhancing Efficiency and Occlusion Robustness

cs.CV · 2026-02-14 · unverdicted · novelty 4.0

LGTrack achieves 258.7 FPS real-time UAV tracking with 82.8% precision on UAVDT by combining dynamic layer selection, Global-Grouped Coordinate Attention, and Similarity-Guided Layer Adaptation.

Nonlinear Transformations Against Unlearnable Datasets

cs.LG · 2024-06-05 · unverdicted · novelty 4.0

Nonlinear transformations enable DNNs to achieve substantial test accuracy gains (0.34% to 249.59%) on unlearnable CIFAR10 datasets from twelve protection methods, outperforming a recent linear baseline.

SoK: A Comprehensive Analysis of the Current Status of Neural Tangent Generalization Attacks with Research Directions

cs.LG · 2026-05-12 · accept · novelty 3.0

NTGA is the first clean-label generalization attack under black-box settings but is vulnerable to adversarial training and image transformations, with newer attacks outperforming it.

citing papers explorer

Showing 8 of 8 citing papers after filters.

TextTeacher: What Can Language Teach About Images? cs.CV · 2026-05-21 · unverdicted · none · ref 73
TextTeacher uses frozen text embeddings from captions as semantic anchors to guide vision model training, improving ImageNet accuracy by up to 2.7 p.p. and transfer performance by 1.0 p.p. on average.
VISTA: Variance-Gated Inter-Sequence Test-Time Adaptation for Multi-Sequence MRI Segmentation cs.CV · 2026-05-17 · unverdicted · none · ref 24 · 2 links
VISTA is a test-time adaptation framework for multi-sequence MRI that uses inter-sequence intervention probes and cross-view disagreement variance to gate self-training, yielding Dice gains of +1.89% on low-field African data and +2.82% on pediatric data over the source model.
Inducing Spatial Locality in Vision Transformers through the Training Protocol cs.CV · 2026-05-11 · unverdicted · none · ref 17
CutMix augmentation during training induces spatial locality in early layers of Vision Transformers trained from scratch, as measured by reduced Mean Attention Distance.
Controllable Histopathology Image Synthesis with Training-free Structural Initialization and Textural Modulation cs.CV · 2026-06-26 · unverdicted · none · ref 24 · 2 links
CHIS steers pretrained diffusion models to generate histopathology images aligned with input structural masks via frequency-domain structural initialization and wavelet-based textural modulation without any training on annotated data.
SignNet-1M: Large-Scale Multilingual Sign Language Video Dataset with Downstream Benchmarks cs.CV · 2026-06-23 · unverdicted · none · ref 37
The paper releases SignNet-1M, a 1M-scale augmented dataset for ASL, CSL and DGS with 3DGS and diffusion-based variations, plus benchmarks showing improved cross-shift generalization.
Weak-to-Strong Knowledge Distillation Accelerates Visual Learning cs.CV · 2026-04-16 · unverdicted · none · ref 52
Weak-to-strong knowledge distillation applied early and then turned off accelerates convergence to target performance in visual learning tasks by factors of 1.7-4.8x.
Layer-Guided UAV Tracking: Enhancing Efficiency and Occlusion Robustness cs.CV · 2026-02-14 · unverdicted · none · ref 61
LGTrack achieves 258.7 FPS real-time UAV tracking with 82.8% precision on UAVDT by combining dynamic layer selection, Global-Grouped Coordinate Attention, and Similarity-Guided Layer Adaptation.
Nonlinear Transformations Against Unlearnable Datasets cs.LG · 2024-06-05 · unverdicted · none · ref 47
Nonlinear transformations enable DNNs to achieve substantial test accuracy gains (0.34% to 249.59%) on unlearnable CIFAR10 datasets from twelve protection methods, outperforming a recent linear baseline.

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer