Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

Rombach, Robin, Blattmann, Andreas, Lorenz, Dominik, Esser, Patrick · 2022

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

browse 9 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Building Normalizing Flows with Stochastic Interpolants

cs.LG · 2022-09-30 · conditional · novelty 8.0

Normalizing flows are constructed by learning the velocity of a stochastic interpolant via a quadratic loss derived from its probability current, yielding an efficient ODE-based alternative to diffusion models.

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

cs.CL · 2026-05-14 · unverdicted · novelty 7.0

DiHAL uses geometry proxies to pick where to replace the lower layers of a pretrained transformer with a diffusion bridge for hidden-state reconstruction, improving over token-level diffusion baselines on 8B models.

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

Variable codebook sizes that increase along the sequence in visual tokenizers reduce generation FID scores significantly for autoregressive models on ImageNet.

Sparse-to-Complete: From Sparse Image Captures to Complete 3D Scenes

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

S2C-3D reconstructs complete high-fidelity 3D scenes from as few as 6-8 images by finetuning a diffusion model on scene data, applying consistency-conditioned sampling, and planning trajectories for full coverage.

Stochastic Schr\"odinger Diffusion Models for Pure-State Ensemble Generation

stat.ML · 2026-05-05 · unverdicted · novelty 7.0 · 2 refs

SSDMs introduce an intrinsic score-based diffusion framework on the Fubini-Study manifold to sample quantum pure-state ensembles without classical re-preparation.

Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation

cs.LG · 2026-05-02 · unverdicted · novelty 7.0

LHSD uses spectral filtering on the log-density Hessian to isolate tangent directions from noise and estimate local intrinsic dimension scalably via Stochastic Lanczos Quadrature.

Probing Visual Planning in Image Editing Models

cs.CV · 2026-04-23 · unverdicted · novelty 7.0

Image editing models fail zero-shot visual planning on abstract mazes and queen puzzles but generalize after finetuning, yet still cannot match human zero-shot efficiency.

Exploring Data-Free LoRA Transferability for Video Diffusion Models

cs.CV · 2026-05-03 · unverdicted · novelty 6.0

CASA uses spectral density to arbitrate between preserving the target model's manifold and restoring LoRA alignment, mitigating style degradation and structural collapse in distilled video diffusion models.

Stable and Near-Reversible Diffusion ODE Solvers for Image Editing

cs.CV · 2026-05-12 · unverdicted · novelty 5.0

Near-reversible Runge-Kutta diffusion ODE solvers with vector-field smoothing improve stability and edit fidelity for large changes in text-guided image editing compared to exactly reversible alternatives.

citing papers explorer

Showing 9 of 9 citing papers.

Building Normalizing Flows with Stochastic Interpolants cs.LG · 2022-09-30 · conditional · none · ref 91
Normalizing flows are constructed by learning the velocity of a stochastic interpolant via a quadratic loss derived from its probability current, yielding an efficient ODE-based alternative to diffusion models.
Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement cs.CL · 2026-05-14 · unverdicted · none · ref 43
DiHAL uses geometry proxies to pick where to replace the lower layers of a pretrained transformer with a diffusion bridge for hidden-state reconstruction, improving over token-level diffusion baselines on 8B models.
Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation cs.CV · 2026-05-07 · unverdicted · none · ref 15
Variable codebook sizes that increase along the sequence in visual tokenizers reduce generation FID scores significantly for autoregressive models on ImageNet.
Sparse-to-Complete: From Sparse Image Captures to Complete 3D Scenes cs.CV · 2026-05-07 · unverdicted · none · ref 21
S2C-3D reconstructs complete high-fidelity 3D scenes from as few as 6-8 images by finetuning a diffusion model on scene data, applying consistency-conditioned sampling, and planning trajectories for full coverage.
Stochastic Schr\"odinger Diffusion Models for Pure-State Ensemble Generation stat.ML · 2026-05-05 · unverdicted · none · ref 31 · 2 links
SSDMs introduce an intrinsic score-based diffusion framework on the Fubini-Study manifold to sample quantum pure-state ensembles without classical re-preparation.
Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation cs.LG · 2026-05-02 · unverdicted · none · ref 61
LHSD uses spectral filtering on the log-density Hessian to isolate tangent directions from noise and estimate local intrinsic dimension scalably via Stochastic Lanczos Quadrature.
Probing Visual Planning in Image Editing Models cs.CV · 2026-04-23 · unverdicted · none · ref 17
Image editing models fail zero-shot visual planning on abstract mazes and queen puzzles but generalize after finetuning, yet still cannot match human zero-shot efficiency.
Exploring Data-Free LoRA Transferability for Video Diffusion Models cs.CV · 2026-05-03 · unverdicted · none · ref 31
CASA uses spectral density to arbitrate between preserving the target model's manifold and restoring LoRA alignment, mitigating style degradation and structural collapse in distilled video diffusion models.
Stable and Near-Reversible Diffusion ODE Solvers for Image Editing cs.CV · 2026-05-12 · unverdicted · none · ref 24
Near-reversible Runge-Kutta diffusion ODE solvers with vector-field smoothing improve stability and edit fidelity for large changes in text-guided image editing compared to exactly reversible alternatives.

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer