Available: https://arxiv.org/abs/2312.00858

[Online] · 2023 · arXiv 2312.00858

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

cs.LG · 2026-05-26 · unverdicted · novelty 6.0

RT-Lynx shifts DiT sparsity from weights to activations, reports up to 1.55x linear-layer speedup while preserving generation quality across multiple diffusion models.

OTCache: Optimal Transport for Geometry-Aware Caching in Diffusion Models

cs.LG · 2026-06-30 · unverdicted · novelty 5.0

OTCache uses optimal transport to interpolate caching schedules between a graph-based reference and an Optuna-optimized anchor, delivering 3.66x-4.7x speedups on FLUX.1, Qwen-Image and HunyuanVideo with improved fidelity.

Inside the Latent Flow: Causal Deciphering of Attention Dynamics in Audio Separation Foundation Models

cs.SD · 2026-06-08 · unverdicted · novelty 5.0

Causal probing of attention in audio separation transformers identifies dual pathways and asynchronous convergence, enabling a training-free Layer-Selective Attention Caching method that reduces self-attention computation by ~25% with negligible quality loss.

citing papers explorer

Showing 3 of 3 citing papers after filters.

RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models cs.LG · 2026-05-26 · unverdicted · none · ref 41
RT-Lynx shifts DiT sparsity from weights to activations, reports up to 1.55x linear-layer speedup while preserving generation quality across multiple diffusion models.
OTCache: Optimal Transport for Geometry-Aware Caching in Diffusion Models cs.LG · 2026-06-30 · unverdicted · none · ref 30
OTCache uses optimal transport to interpolate caching schedules between a graph-based reference and an Optuna-optimized anchor, delivering 3.66x-4.7x speedups on FLUX.1, Qwen-Image and HunyuanVideo with improved fidelity.
Inside the Latent Flow: Causal Deciphering of Attention Dynamics in Audio Separation Foundation Models cs.SD · 2026-06-08 · unverdicted · none · ref 46
Causal probing of attention in audio separation transformers identifies dual pathways and asynchronous convergence, enabling a training-free Layer-Selective Attention Caching method that reduces self-attention computation by ~25% with negligible quality loss.

Available: https://arxiv.org/abs/2312.00858

fields

years

verdicts

representative citing papers

citing papers explorer