pith. sign in

Detailflow: 1d coarse-to-fine autoregressive image generation via next-detail prediction.arXiv preprint arXiv:2505.21473, 2025b

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.CV 7

years

2026 7

verdicts

UNVERDICTED 7

roles

background 2

polarities

background 2

clear filters

representative citing papers

ChannelTok: Efficient Flexible-Length Vision Tokenization

cs.CV · 2026-06-03 · unverdicted · novelty 7.0

ChannelTok introduces channel-wise tokenization with stochastic tail-dropping to achieve rFID 2.92 on ImageNet at 8.6x faster decoding and 2.1x smaller size than prior flexible tokenizers.

Autoregressive Visual Generation Needs a Prologue

cs.CV · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

Prologue adds a small set of learnable tokens trained exclusively with AR cross-entropy loss to decouple generation from reconstruction in autoregressive visual models, yielding lower gFID on ImageNet 256x256.

Diffusion Image Generation with Explicit Modeling of Data Manifold Geometry

cs.CV · 2026-05-25 · unverdicted · novelty 6.0 · 2 refs

MIND integrates discrete patch tokenization into diffusion score functions via soft top-k and dual-branch layers, achieving FID 22.73 (no guidance) and 2.06 (with guidance) on ImageNet-256 after 80 epochs, outperforming DiT and larger LlamaGen models.

citing papers explorer

Showing 7 of 7 citing papers after filters.