Parallel Context Modeling for Sliding Window Attention in Neural Video Coding

· 2026 · eess.IV · arXiv 2605.20977

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Most neural video codecs rely on temporal conditioning, which makes them susceptible to error propagation over long sequences. While Transformer-based architectures like the VCT offer a drift-free alternative, they suffer from high computational complexity and inferior RD performance. The recent SWA addresses these shortcomings by reducing complexity and enhancing RD performance, yet it restricts decoding to a strictly sequential raster-scan order, creating a critical bottleneck in decoding latency. To resolve this, we propose P-SWA, utilizing diagonal wavefronts to enable parallel decoding. By embedding a hyperprior and introducing an accumulator to fuse side information and local spatial context, our method increases decoding speed by 36% over the parallel VCT. Simultaneously, it achieves Bj{\o}ntegaard Delta-rate savings of up to 10.0% for I-frames and 7.1% for P-frames over the SWA baseline.

representative citing papers

Parallel Context Modeling for Sliding Window Attention in Neural Video Coding

eess.IV · 2026-05-20 · unverdicted · novelty 5.0

P-SWA enables parallel decoding via diagonal wavefronts, a hyperprior, and an accumulator, delivering 36% higher decoding speed than parallel VCT and BD-rate savings of up to 10.0% on I-frames and 7.1% on P-frames versus SWA.

citing papers explorer

Showing 1 of 1 citing paper.

Parallel Context Modeling for Sliding Window Attention in Neural Video Coding eess.IV · 2026-05-20 · unverdicted · none · ref 2 · internal anchor
P-SWA enables parallel decoding via diagonal wavefronts, a hyperprior, and an accumulator, delivering 36% higher decoding speed than parallel VCT and BD-rate savings of up to 10.0% on I-frames and 7.1% on P-frames versus SWA.

Parallel Context Modeling for Sliding Window Attention in Neural Video Coding

fields

years

verdicts

representative citing papers

citing papers explorer