High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Bj ¨orn Ommer · 2022

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment

cs.RO · 2025-04-22 · unverdicted · novelty 5.0

DriVerse is a generative model that simulates driving scenes from an image and trajectory using multimodal prompting and motion alignment, achieving better performance on nuScenes and Waymo datasets with minimal training.

Rethinking the Global Knowledge of CLIP in Training-Free Open-Vocabulary Semantic Segmentation

cs.LG · 2025-02-05 · unverdicted · novelty 4.0

GCLIP improves TF-OVSS by reshaping last-block attention via fusion of global-token block attention with Query-Query attention and applying channel suppression to Value embeddings, outperforming prior methods on five benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment cs.RO · 2025-04-22 · unverdicted · none · ref 50
DriVerse is a generative model that simulates driving scenes from an image and trajectory using multimodal prompting and motion alignment, achieving better performance on nuScenes and Waymo datasets with minimal training.
Rethinking the Global Knowledge of CLIP in Training-Free Open-Vocabulary Semantic Segmentation cs.LG · 2025-02-05 · unverdicted · none · ref 34
GCLIP improves TF-OVSS by reshaping last-block attention via fusion of global-token block attention with Query-Query attention and applying channel suppression to Value embeddings, outperforming prior methods on five benchmarks.

High-resolution image synthesis with latent diffusion models

fields

years

verdicts

representative citing papers

citing papers explorer