Star: Spatial-temporal augmentation with text-to-video models for real-world video super-resolution.arXiv preprint arXiv:2501.02976

Rui Xie, Yinhong Liu, Penghao Zhou, Chen Zhao, Jun Zhou, Kai Zhang, Zhenyu Zhang, Jian Yang, Zhenheng Yang, Ying Tai · 2025 · arXiv 2501.02976

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

representative citing papers

ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation

cs.CV · 2026-03-18 · unverdicted · novelty 7.0

ChopGrad truncates backpropagation to local frame windows in video diffusion models, reducing memory from linear in frame count to constant while enabling pixel-wise loss fine-tuning.

DiffST: Spatiotemporal-Aware Diffusion for Real-World Space-Time Video Super-Resolution

cs.CV · 2026-05-13 · unverdicted · novelty 6.0

DiffST delivers state-of-the-art real-world space-time video super-resolution with 17x faster inference than prior diffusion methods by using one-step sampling, cross-frame context aggregation, and video representation guidance.

DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restoration

cs.CV · 2026-04-16 · unverdicted · novelty 6.0

DVFace uses a spatio-temporal dual-codebook and asymmetric fusion in a one-step diffusion model to deliver better video face restoration quality, temporal consistency, and identity preservation than recent methods.

Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution

cs.CV · 2025-09-28 · unverdicted · novelty 5.0

OASIS reduces redundancy in diffusion models for real-world video super-resolution via attention specialization routing and progressive training, delivering state-of-the-art quality with 6.2x faster inference than prior one-step baselines.

citing papers explorer

Showing 4 of 4 citing papers.

ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation cs.CV · 2026-03-18 · unverdicted · none · ref 65
ChopGrad truncates backpropagation to local frame windows in video diffusion models, reducing memory from linear in frame count to constant while enabling pixel-wise loss fine-tuning.
DiffST: Spatiotemporal-Aware Diffusion for Real-World Space-Time Video Super-Resolution cs.CV · 2026-05-13 · unverdicted · none · ref 50
DiffST delivers state-of-the-art real-world space-time video super-resolution with 17x faster inference than prior diffusion methods by using one-step sampling, cross-frame context aggregation, and video representation guidance.
DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restoration cs.CV · 2026-04-16 · unverdicted · none · ref 45
DVFace uses a spatio-temporal dual-codebook and asymmetric fusion in a one-step diffusion model to deliver better video face restoration quality, temporal consistency, and identity preservation than recent methods.
Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution cs.CV · 2025-09-28 · unverdicted · none · ref 17
OASIS reduces redundancy in diffusion models for real-world video super-resolution via attention specialization routing and progressive training, delivering state-of-the-art quality with 6.2x faster inference than prior one-step baselines.

Star: Spatial-temporal augmentation with text-to-video models for real-world video super-resolution.arXiv preprint arXiv:2501.02976

fields

years

verdicts

representative citing papers

citing papers explorer