Improved denoising diffusion probabilistic models

Alexander Quinn Nichol, Prafulla Dhariwal

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

How to Spin an Object: First, Get the Shape Right

cs.CV · 2024-12-13 · unverdicted · novelty 7.0

Camera-Relative Object Coordinates (CROCS) as an intermediate geometry representation in two-stage image-to-3D models yields superior novel-view quality, geometric accuracy, and multiview consistency over depth maps, visual features, and other pointmap alternatives.

DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation

cs.CV · 2025-07-17 · unverdicted · novelty 6.0

DiffClean applies text-guided diffusion to erase makeup from faces, boosting age estimation and verification accuracy over makeup-affected images.

Retrievals Can Be Detrimental: Unveiling the Backdoor Vulnerability of Retrieval-Augmented Diffusion Models

cs.CV · 2025-01-23 · conditional · novelty 6.0

BadRDM is a backdoor attack on retrieval-augmented diffusion models that poisons the retrieval database with toxicity surrogates and uses multimodal contrastive learning to force toxic generations from text triggers while preserving benign performance.

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

cs.RO · 2024-11-29 · unverdicted · novelty 6.0

CogACT is a new VLA model that uses a conditioned diffusion action transformer to achieve over 35% higher average success rates than OpenVLA in simulation and 55% in real-robot experiments while generalizing to new robots and objects.

citing papers explorer

Showing 4 of 4 citing papers.

How to Spin an Object: First, Get the Shape Right cs.CV · 2024-12-13 · unverdicted · none · ref 33
Camera-Relative Object Coordinates (CROCS) as an intermediate geometry representation in two-stage image-to-3D models yields superior novel-view quality, geometric accuracy, and multiview consistency over depth maps, visual features, and other pointmap alternatives.
DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation cs.CV · 2025-07-17 · unverdicted · none · ref 35
DiffClean applies text-guided diffusion to erase makeup from faces, boosting age estimation and verification accuracy over makeup-affected images.
Retrievals Can Be Detrimental: Unveiling the Backdoor Vulnerability of Retrieval-Augmented Diffusion Models cs.CV · 2025-01-23 · conditional · none · ref 37
BadRDM is a backdoor attack on retrieval-augmented diffusion models that poisons the retrieval database with toxicity surrogates and uses multimodal contrastive learning to force toxic generations from text triggers while preserving benign performance.
CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation cs.RO · 2024-11-29 · unverdicted · none · ref 47
CogACT is a new VLA model that uses a conditioned diffusion action transformer to achieve over 35% higher average success rates than OpenVLA in simulation and 55% in real-robot experiments while generalizing to new robots and objects.

Improved denoising diffusion probabilistic models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer