pith. sign in

Lan- guage models are few-shot learners.Advances in neural in- formation processing systems, 33:1877–1901

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.CV 6 cs.MM 1

years

2026 6 2025 1

roles

background 1

polarities

background 1

representative citing papers

On The Application of Linear Attention in Multimodal Transformers

cs.CV · 2026-04-11 · unverdicted · novelty 4.0

Linear attention delivers significant computational savings in multimodal transformers and follows the same scaling laws as softmax attention on ViT models trained on LAION-400M with ImageNet-21K zero-shot validation.

citing papers explorer

Showing 7 of 7 citing papers.