pith. sign in

Bert: Pre-training of deep bidirectional trans- formers for language understanding

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

fields

cs.CV 6

years

2026 6

verdicts

UNVERDICTED 6

representative citing papers

Depth Adaptive Efficient Visual Autoregressive Modeling

cs.CV · 2026-04-19 · unverdicted · novelty 7.0

DepthVAR adaptively allocates per-token computational depth in VAR models using a cyclic rotated scheduler and dynamic layer masking to achieve 2.3-3.1x inference speedup with minimal quality loss.

On The Application of Linear Attention in Multimodal Transformers

cs.CV · 2026-04-11 · unverdicted · novelty 4.0

Linear attention delivers significant computational savings in multimodal transformers and follows the same scaling laws as softmax attention on ViT models trained on LAION-400M with ImageNet-21K zero-shot validation.

citing papers explorer

Showing 6 of 6 citing papers.