Flexdit: Dynamic token density control for diffusion transformer

Chang, S · 2024 · arXiv 2412.06028

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Learning to Adaptively Allocate Gaussians for Arbitrary-Scale Image Super-Resolution

cs.CV · 2026-06-28 · unverdicted · novelty 7.0

QuADA-GS learns to predict local complexity-driven Gaussian densification from low-resolution inputs and uses Hierarchical Pointer Convolution for efficient arbitrary-scale super-resolution.

CoReDiT: Spatial Coherence-Guided Token Pruning and Reconstruction for Efficient Diffusion Transformers

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

CoReDiT reduces self-attention FLOPs in DiTs by up to 55% via linear-time spatial coherence pruning and neighbor-based reconstruction, delivering 1.33x-1.72x speedups with maintained quality.

DC-DiT: Adaptive Compute and Elastic Inference for Visual Generation via Dynamic Chunking

cs.CV · 2026-03-06 · unverdicted · novelty 7.0

DC-DiT learns dynamic chunking to allocate fewer tokens to smooth or noisy regions and more to detailed or late-stage areas, cutting inference FLOPs up to 36.8% while improving FID up to 37.8% on class-conditional ImageNet generation.

Vitality-Aware Compression for Efficient Image-to-Shape Diffusion Transformers

cs.CV · 2026-07-01 · unverdicted · novelty 5.0

Introduces vitality-aware compression for image-to-3D DiT models via structured pruning, adaptive quantization, and fine-tuning, claiming 66% size reduction with comparable fidelity.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Learning to Adaptively Allocate Gaussians for Arbitrary-Scale Image Super-Resolution cs.CV · 2026-06-28 · unverdicted · none · ref 57
QuADA-GS learns to predict local complexity-driven Gaussian densification from low-resolution inputs and uses Hierarchical Pointer Convolution for efficient arbitrary-scale super-resolution.
CoReDiT: Spatial Coherence-Guided Token Pruning and Reconstruction for Efficient Diffusion Transformers cs.CV · 2026-05-13 · unverdicted · none · ref 5
CoReDiT reduces self-attention FLOPs in DiTs by up to 55% via linear-time spatial coherence pruning and neighbor-based reconstruction, delivering 1.33x-1.72x speedups with maintained quality.
DC-DiT: Adaptive Compute and Elastic Inference for Visual Generation via Dynamic Chunking cs.CV · 2026-03-06 · unverdicted · none · ref 9
DC-DiT learns dynamic chunking to allocate fewer tokens to smooth or noisy regions and more to detailed or late-stage areas, cutting inference FLOPs up to 36.8% while improving FID up to 37.8% on class-conditional ImageNet generation.
Vitality-Aware Compression for Efficient Image-to-Shape Diffusion Transformers cs.CV · 2026-07-01 · unverdicted · none · ref 6
Introduces vitality-aware compression for image-to-3D DiT models via structured pruning, adaptive quantization, and fine-tuning, claiming 66% size reduction with comparable fidelity.

Flexdit: Dynamic token density control for diffusion transformer

fields

years

verdicts

representative citing papers

citing papers explorer