hub

Denoising dif- fusion probabilistic models.Advances in neural information processing systems, 33:6840–6851

Jonathan Ho, Ajay Jain, Pieter Abbeel · 2020

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

browse 12 citing papers

hub tools

JSON dossier citing papers JSON

representative citing papers

When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters

cs.CV · 2026-02-25 · conditional · novelty 8.0

MasqLoRA shows that an independent LoRA adapter can be trained on a few trigger-target pairs to backdoor diffusion models with 99.8% success rate while remaining stealthy when the trigger is absent.

X-Splat: Gaussian Splatting for 3D CBCT Generation from Single Panoramic Radiograph

cs.CV · 2026-07-02 · unverdicted · novelty 7.0

X-Splat is the first Gaussian Splatting method that reconstructs CBCT-like 3D dental volumes from a single panoramic radiograph by constraining learnable Gaussians to panoramic geometry and adding a residual anatomical refiner.

Modality-Aware and Anatomical Vector-Quantized Autoencoding for Multimodal Brain MRI

cs.CV · 2026-04-06 · unverdicted · novelty 7.0

NeuroQuant is a modality-aware 3D VQ-VAE that uses dual-stream encoding, a shared anatomical codebook, and FiLM to achieve superior multi-modal brain MRI reconstruction.

Face2Scene: Using Facial Degradation as an Oracle for Diffusion-Based Scene Restoration

cs.CV · 2026-03-17 · unverdicted · novelty 7.0

Face2Scene uses facial restoration as an oracle to derive degradation codes that condition a diffusion model for restoring the entire degraded scene.

Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?

cs.CV · 2026-05-15 · unverdicted · novelty 6.0

AdaScope adaptively selects optimal RL intervention points during diffusion denoising by monitoring structural and semantic changes, delivering 66% higher performance at 59% lower cost than full-trajectory RL baselines.

Fashion130K: An E-commerce Fashion Dataset for Outfit Generation with Unified Multi-modal Condition

cs.CV · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

Fashion130K dataset and UMC framework align text and visual prompts to generate more consistent fashion outfits than prior state-of-the-art methods.

SlimDiffSR: Toward Lightweight and Efficient Remote Sensing Image Super-Resolution via Diffusion Model Distillation

cs.CV · 2026-05-04 · unverdicted · novelty 6.0 · 2 refs

SlimDiffSR uses uncertainty-guided timestep assignment and structured pruning with frequency- and direction-separable convolutions plus MMD distillation to create a 200x faster, 20x smaller diffusion SR model for remote sensing while retaining competitive quality.

Bias at the End of the Score

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

Reward models used as quality scorers in text-to-image generation encode demographic biases that cause reward-guided training to sexualize female subjects, reinforce stereotypes, and reduce diversity.

EGLOCE: Training-Free Energy-Guided Latent Optimization for Concept Erasure

cs.CV · 2026-04-10 · unverdicted · novelty 6.0

EGLOCE erases target concepts in diffusion models at inference time by optimizing latents with dual energy guidance that repels unwanted concepts while retaining prompt alignment.

DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

cs.CV · 2025-11-24 · conditional · novelty 6.0

DeCo decouples high- and low-frequency generation in pixel diffusion via a DiT plus lightweight decoder and a frequency-aware flow-matching loss, reaching FID 1.62 at 256x256 and 2.22 at 512x512 on ImageNet while closing the gap to latent diffusion methods.

Seeing What Matters: Visual Preference Policy Optimization for Visual Generation

cs.CV · 2025-11-24 · unverdicted · novelty 6.0

ViPO enhances GRPO for visual generation by creating spatially and temporally aware advantage maps from pretrained vision models to focus optimization on perceptually important regions.

GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models

cs.CV · 2025-11-17 · unverdicted · novelty 5.0

GrOCE uses dynamic semantic graphs for online, training-free erasure of target concepts from diffusion model prompts via cluster identification and selective severing.

citing papers explorer

Showing 12 of 12 citing papers.

When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters cs.CV · 2026-02-25 · conditional · none · ref 17
MasqLoRA shows that an independent LoRA adapter can be trained on a few trigger-target pairs to backdoor diffusion models with 99.8% success rate while remaining stealthy when the trigger is absent.
X-Splat: Gaussian Splatting for 3D CBCT Generation from Single Panoramic Radiograph cs.CV · 2026-07-02 · unverdicted · none · ref 12
X-Splat is the first Gaussian Splatting method that reconstructs CBCT-like 3D dental volumes from a single panoramic radiograph by constraining learnable Gaussians to panoramic geometry and adding a residual anatomical refiner.
Modality-Aware and Anatomical Vector-Quantized Autoencoding for Multimodal Brain MRI cs.CV · 2026-04-06 · unverdicted · none · ref 10
NeuroQuant is a modality-aware 3D VQ-VAE that uses dual-stream encoding, a shared anatomical codebook, and FiLM to achieve superior multi-modal brain MRI reconstruction.
Face2Scene: Using Facial Degradation as an Oracle for Diffusion-Based Scene Restoration cs.CV · 2026-03-17 · unverdicted · none · ref 20
Face2Scene uses facial restoration as an oracle to derive degradation codes that condition a diffusion model for restoring the entire degraded scene.
Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models? cs.CV · 2026-05-15 · unverdicted · none · ref 15
AdaScope adaptively selects optimal RL intervention points during diffusion denoising by monitoring structural and semantic changes, delivering 66% higher performance at 59% lower cost than full-trajectory RL baselines.
Fashion130K: An E-commerce Fashion Dataset for Outfit Generation with Unified Multi-modal Condition cs.CV · 2026-05-11 · unverdicted · none · ref 15 · 2 links
Fashion130K dataset and UMC framework align text and visual prompts to generate more consistent fashion outfits than prior state-of-the-art methods.
SlimDiffSR: Toward Lightweight and Efficient Remote Sensing Image Super-Resolution via Diffusion Model Distillation cs.CV · 2026-05-04 · unverdicted · none · ref 12 · 2 links
SlimDiffSR uses uncertainty-guided timestep assignment and structured pruning with frequency- and direction-separable convolutions plus MMD distillation to create a 200x faster, 20x smaller diffusion SR model for remote sensing while retaining competitive quality.
Bias at the End of the Score cs.CV · 2026-04-14 · unverdicted · none · ref 28
Reward models used as quality scorers in text-to-image generation encode demographic biases that cause reward-guided training to sexualize female subjects, reinforce stereotypes, and reduce diversity.
EGLOCE: Training-Free Energy-Guided Latent Optimization for Concept Erasure cs.CV · 2026-04-10 · unverdicted · none · ref 21
EGLOCE erases target concepts in diffusion models at inference time by optimizing latents with dual energy guidance that repels unwanted concepts while retaining prompt alignment.
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation cs.CV · 2025-11-24 · conditional · none · ref 19
DeCo decouples high- and low-frequency generation in pixel diffusion via a DiT plus lightweight decoder and a frequency-aware flow-matching loss, reaching FID 1.62 at 256x256 and 2.22 at 512x512 on ImageNet while closing the gap to latent diffusion methods.
Seeing What Matters: Visual Preference Policy Optimization for Visual Generation cs.CV · 2025-11-24 · unverdicted · none · ref 12
ViPO enhances GRPO for visual generation by creating spatially and temporally aware advantage maps from pretrained vision models to focus optimization on perceptually important regions.
GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models cs.CV · 2025-11-17 · unverdicted · none · ref 13
GrOCE uses dynamic semantic graphs for online, training-free erasure of target concepts from diffusion model prompts via cluster identification and selective severing.

Denoising dif- fusion probabilistic models.Advances in neural information processing systems, 33:6840–6851

hub tools

fields

years

verdicts

representative citing papers

citing papers explorer