The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation

Meta AI, “The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation,” Meta AI Blog, Apr · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Fast NF4 Dequantization Kernels for Large Language Model Inference

cs.LG · 2026-04-02 · unverdicted · novelty 5.0

A lightweight shared-memory technique for NF4 dequantization kernels yields 2.0-2.2x kernel speedup and 1.54x end-to-end gains on models up to 70B parameters while using only 64 bytes of shared memory per block.

citing papers explorer

Showing 1 of 1 citing paper.

Fast NF4 Dequantization Kernels for Large Language Model Inference cs.LG · 2026-04-02 · unverdicted · none · ref 2
A lightweight shared-memory technique for NF4 dequantization kernels yields 2.0-2.2x kernel speedup and 1.54x end-to-end gains on models up to 70B parameters while using only 64 bytes of shared memory per block.

The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer