arXiv preprint arXiv:2503.08354 , year=

Robust latent matters: Boosting image generation with sampling error synthesis , author= · 2025 · arXiv 2503.08354

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers

cs.CV · 2026-06-11 · unverdicted · novelty 6.0

HYDRA-X presents the first unified multimodal model using a single ViT for holistic image-video tokenization, with ablations on attention and compression plus a latent-level editing improvement.

Aligning Latent Geometry for Spherical Flow Matching in Image Generation

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

Projecting VAE latents to a fixed spherical radius and replacing linear interpolation with spherical linear interpolation improves class-conditional ImageNet-256 FID while leaving the diffusion architecture unchanged.

citing papers explorer

Showing 2 of 2 citing papers after filters.

HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers cs.CV · 2026-06-11 · unverdicted · none · ref 81
HYDRA-X presents the first unified multimodal model using a single ViT for holistic image-video tokenization, with ablations on attention and compression plus a latent-level editing improvement.
Aligning Latent Geometry for Spherical Flow Matching in Image Generation cs.CV · 2026-05-14 · unverdicted · none · ref 34
Projecting VAE latents to a fixed spherical radius and replacing linear interpolation with spherical linear interpolation improves class-conditional ImageNet-256 FID while leaving the diffusion architecture unchanged.

arXiv preprint arXiv:2503.08354 , year=

fields

years

verdicts

representative citing papers

citing papers explorer