Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction , url =

Tian, Keyu, Jiang, Yi, Yuan, Zehuan, Peng, Bingyue, Wang, Liwei , booktitle = · DOI 10.52202/079017-2694

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

Variable codebook sizes that increase along the sequence in visual tokenizers reduce generation FID scores significantly for autoregressive models on ImageNet.

Training-Free Semantic Correction for Autoregressive Visual Models

cs.CV · 2026-06-21 · unverdicted · novelty 6.0

Gazer uses MLLM feedback in two stages to diagnose semantic errors in intermediate AVM states and rewind/rectify the generation trajectory, improving alignment on compositional benchmarks without training.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation cs.CV · 2026-05-07 · unverdicted · none · ref 41
Variable codebook sizes that increase along the sequence in visual tokenizers reduce generation FID scores significantly for autoregressive models on ImageNet.
Training-Free Semantic Correction for Autoregressive Visual Models cs.CV · 2026-06-21 · unverdicted · none · ref 6
Gazer uses MLLM feedback in two stages to diagnose semantic errors in intermediate AVM states and rewind/rectify the generation trajectory, improving alignment on compositional benchmarks without training.

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction , url =

fields

years

verdicts

representative citing papers

citing papers explorer