Stream-DiffVSR enables practical low-latency video super-resolution by combining a four-step distilled denoiser, auto-regressive temporal guidance, and a temporal processor in a strictly causal pipeline.
Vrt: A video restoration transformer.arXiv preprint arXiv:2201.12288, 2022
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CV 3representative citing papers
Q-Align trains LMMs on discrete text-defined levels for visual scoring, achieving SOTA on IQA, IAA, and VQA while unifying the tasks in OneAlign.
A hybrid AI system combines super-resolution, YOLO-based detection, and vision-language models to semantically classify building damage severity in pre- and post-disaster satellite images.
citing papers explorer
-
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion
Stream-DiffVSR enables practical low-latency video super-resolution by combining a four-step distilled denoiser, auto-regressive temporal guidance, and a temporal processor in a strictly causal pipeline.
-
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Q-Align trains LMMs on discrete text-defined levels for visual scoring, achieving SOTA on IQA, IAA, and VQA while unifying the tasks in OneAlign.
-
From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery
A hybrid AI system combines super-resolution, YOLO-based detection, and vision-language models to semantically classify building damage severity in pre- and post-disaster satellite images.