pith. sign in

hub

Nextstep-1: Toward autoregressive image generation with continuous tokens at scale

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

hub tools

citation-role summary

background 2 baseline 1 other 1

citation-polarity summary

years

2026 16

clear filters

representative citing papers

Editing Everything Everywhere All at Once

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

MICE modifies joint attention biases in Multimodal Diffusion Transformers to enable concurrent multi-instance edits while reducing semantic interference via user masks.

Channel-wise Vector Quantization

cs.CV · 2026-05-25 · unverdicted · novelty 6.0

CVQ replaces patch-wise vector quantization with channel-wise quantization of feature maps, enabling a next-channel autoregressive model that reports 100% codebook utilization and text-to-image scores of DPG 86.7 and GenEval 0.79.

Generative Refinement Networks for Visual Synthesis

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

GRN uses hierarchical binary quantization and entropy-guided refinement to set new ImageNet records of 0.56 rFID for reconstruction and 1.81 gFID for class-conditional generation while releasing code and models.

F3-Tokenizer: Taming Audio Autoencoder Latents for Understanding and Generation

cs.SD · 2026-06-04 · unverdicted · novelty 5.0

F3-Tokenizer adapts audio autoencoder latents with noise-regularized bottleneck (channel normalization and stochastic perturbation) and a representation encoder (RQ-MTP plus frozen-LLM supervision) to support both high-dimensional understanding representations and normalized continuous generation ta

citing papers explorer

Showing 1 of 1 citing paper after filters.