Imagenet: A large-scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, Li Fei-Fei · 2009

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

browse 7 citing papers

representative citing papers

Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures

cs.CR · 2026-05-19 · unverdicted · novelty 8.0

VIPER exposes Functional Fusion in dynamic prompt architectures, enabling a backdoor that resists pruning by tightly integrating attack and utility parameters in the same high-magnitude core.

CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion

cs.CR · 2026-04-10 · unverdicted · novelty 7.0

CLIP-Inspector reconstructs OOD triggers to detect backdoors in prompt-tuned CLIP models with 94% accuracy and higher AUROC than baselines, plus a repair step via fine-tuning.

Dual-Modality Anchor-Guided Filtering for Test-time Prompt Tuning

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

Dual-modality anchors from text descriptions and test-time image statistics filter views and ensemble predictions to improve test-time prompt tuning, achieving SOTA on 15 datasets.

Generative Event Pretraining with Foundation Model Alignment

cs.CV · 2026-03-24 · unverdicted · novelty 6.0

GEP transfers semantic knowledge from image foundation models to event data via alignment and generative pretraining on mixed sequences to create transferable event-based visual models.

MaskDiME: Adaptive Masked Diffusion for Precise and Efficient Visual Counterfactual Explanations

cs.CV · 2026-02-21 · unverdicted · novelty 6.0

MaskDiME uses adaptive masked diffusion to produce 30x faster, localized, and semantically consistent visual counterfactual explanations without training, matching or exceeding prior performance on five datasets.

RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding

cs.CV · 2026-05-19 · unverdicted · novelty 5.0 · 2 refs

RE-VLM fuses RGB and event data in a dual-stream VLM with a graph-based pipeline for generating training captions and QA pairs, plus two new datasets, showing gains over RGB-only and event-only baselines especially in challenging conditions.

LIFT and PLACE: A Simple, Stable, and Effective Knowledge Distillation Framework for Lightweight Diffusion Models

cs.CV · 2026-05-19 · 2 refs

citing papers explorer

Showing 7 of 7 citing papers.

Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures cs.CR · 2026-05-19 · unverdicted · none · ref 5
VIPER exposes Functional Fusion in dynamic prompt architectures, enabling a backdoor that resists pruning by tightly integrating attack and utility parameters in the same high-magnitude core.
CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion cs.CR · 2026-04-10 · unverdicted · none · ref 5
CLIP-Inspector reconstructs OOD triggers to detect backdoors in prompt-tuned CLIP models with 94% accuracy and higher AUROC than baselines, plus a repair step via fine-tuning.
Dual-Modality Anchor-Guided Filtering for Test-time Prompt Tuning cs.CV · 2026-04-14 · unverdicted · none · ref 5
Dual-modality anchors from text descriptions and test-time image statistics filter views and ensemble predictions to improve test-time prompt tuning, achieving SOTA on 15 datasets.
Generative Event Pretraining with Foundation Model Alignment cs.CV · 2026-03-24 · unverdicted · none · ref 10
GEP transfers semantic knowledge from image foundation models to event data via alignment and generative pretraining on mixed sequences to create transferable event-based visual models.
MaskDiME: Adaptive Masked Diffusion for Precise and Efficient Visual Counterfactual Explanations cs.CV · 2026-02-21 · unverdicted · none · ref 6
MaskDiME uses adaptive masked diffusion to produce 30x faster, localized, and semantically consistent visual counterfactual explanations without training, matching or exceeding prior performance on five datasets.
RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding cs.CV · 2026-05-19 · unverdicted · none · ref 7 · 2 links
RE-VLM fuses RGB and event data in a dual-stream VLM with a graph-based pipeline for generating training captions and QA pairs, plus two new datasets, showing gains over RGB-only and event-only baselines especially in challenging conditions.
LIFT and PLACE: A Simple, Stable, and Effective Knowledge Distillation Framework for Lightweight Diffusion Models cs.CV · 2026-05-19 · unreviewed · ref 3 · 2 links

Imagenet: A large-scale hierarchical image database

fields

years

verdicts

representative citing papers

citing papers explorer