Denoising diffusion probabilistic models

· 2020

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

browse 7 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

DrawMotion: Generating 3D Human Motions by Freehand Drawing

cs.CV · 2026-05-20 · unverdicted · novelty 7.0

DrawMotion is a diffusion-based framework that fuses text and hand-drawn stickman conditions via a Multi-Condition Module and training-free guidance to generate 3D human motions.

Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection

cs.CV · 2026-04-30 · unverdicted · novelty 7.0

Noise2Map repurposes diffusion model denoising into a direct predictor for semantic segmentation and change detection tasks in remote sensing, achieving top average ranks on benchmark datasets.

Unified Reward Model for Multimodal Understanding and Generation

cs.CV · 2025-03-07 · unverdicted · novelty 7.0

UnifiedReward is the first unified reward model that jointly assesses multimodal understanding and generation to provide better preference signals for aligning vision models via DPO.

DiffVC: A Non-autoregressive Framework Based on Diffusion Model for Video Captioning

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

DiffVC applies diffusion models for non-autoregressive video captioning, outperforming prior non-AR methods and matching AR ones in quality with faster speed on standard benchmarks.

SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification

cs.CV · 2025-04-13 · unverdicted · novelty 6.0

SD-ReID trains a ViT to extract identity and view conditions, fine-tunes Stable Diffusion to generate view-mimicking features, adds a View-Refined Decoder, and combines both identity and all-view features for retrieval on aerial-ground re-identification benchmarks.

GCDance: Genre-Controlled Music-Driven 3D Full Body Dance Generation

cs.GR · 2025-02-25 · unverdicted · novelty 6.0

GCDance is a text-and-music-conditioned diffusion framework that generates genre-consistent 3D dance sequences and reports better results than prior methods on FineDance and AIST++.

SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation

cs.CV · 2024-11-28 · unverdicted · novelty 5.0

SOW uses MLLMs and attention to selectively control unidirectional diffusion for pixel-level fidelity and contextual coherence in text-vision-to-image tasks.

citing papers explorer

Showing 7 of 7 citing papers.

DrawMotion: Generating 3D Human Motions by Freehand Drawing cs.CV · 2026-05-20 · unverdicted · none · ref 12
DrawMotion is a diffusion-based framework that fuses text and hand-drawn stickman conditions via a Multi-Condition Module and training-free guidance to generate 3D human motions.
Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection cs.CV · 2026-04-30 · unverdicted · none · ref 38
Noise2Map repurposes diffusion model denoising into a direct predictor for semantic segmentation and change detection tasks in remote sensing, achieving top average ranks on benchmark datasets.
Unified Reward Model for Multimodal Understanding and Generation cs.CV · 2025-03-07 · unverdicted · none · ref 36
UnifiedReward is the first unified reward model that jointly assesses multimodal understanding and generation to provide better preference signals for aligning vision models via DPO.
DiffVC: A Non-autoregressive Framework Based on Diffusion Model for Video Captioning cs.CV · 2026-04-09 · unverdicted · none · ref 16
DiffVC applies diffusion models for non-autoregressive video captioning, outperforming prior non-AR methods and matching AR ones in quality with faster speed on standard benchmarks.
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification cs.CV · 2025-04-13 · unverdicted · none · ref 31
SD-ReID trains a ViT to extract identity and view conditions, fine-tunes Stable Diffusion to generate view-mimicking features, adds a View-Refined Decoder, and combines both identity and all-view features for retrieval on aerial-ground re-identification benchmarks.
GCDance: Genre-Controlled Music-Driven 3D Full Body Dance Generation cs.GR · 2025-02-25 · unverdicted · none · ref 7
GCDance is a text-and-music-conditioned diffusion framework that generates genre-consistent 3D dance sequences and reports better results than prior methods on FineDance and AIST++.
SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation cs.CV · 2024-11-28 · unverdicted · none · ref 21
SOW uses MLLMs and attention to selectively control unidirectional diffusion for pixel-level fidelity and contextual coherence in text-vision-to-image tasks.

Denoising diffusion probabilistic models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer