This confirms that our projection-based compression effectively retains core semantic structures that are often lost in standard reconstruction-based V AE training

Superiority over Baselines:RePack achieves a validation accuracy of42

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

RePack then Refine: Efficient Diffusion Transformer with Vision Foundation Model

cs.CV · 2025-12-12 · conditional · novelty 6.0

RePack projects VFM features to a low-dimensional manifold for efficient DiT training, followed by a Latent-Guided Refiner that improves FID to 1.65 on ImageNet-1K after 64 epochs.

citing papers explorer

Showing 1 of 1 citing paper.

RePack then Refine: Efficient Diffusion Transformer with Vision Foundation Model cs.CV · 2025-12-12 · conditional · none · ref 6
RePack projects VFM features to a low-dimensional manifold for efficient DiT training, followed by a Latent-Guided Refiner that improves FID to 1.65 on ImageNet-1K after 64 epochs.

This confirms that our projection-based compression effectively retains core semantic structures that are often lost in standard reconstruction-based V AE training

fields

years

verdicts

representative citing papers

citing papers explorer