pith. sign in

The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

roles

background 1

polarities

background 1

clear filters

representative citing papers

Fast NF4 Dequantization Kernels for Large Language Model Inference

cs.LG · 2026-04-02 · unverdicted · novelty 5.0

A lightweight shared-memory technique for NF4 dequantization kernels yields 2.0-2.2x kernel speedup and 1.54x end-to-end gains on models up to 70B parameters while using only 64 bytes of shared memory per block.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Fast NF4 Dequantization Kernels for Large Language Model Inference cs.LG · 2026-04-02 · unverdicted · none · ref 2

    A lightweight shared-memory technique for NF4 dequantization kernels yields 2.0-2.2x kernel speedup and 1.54x end-to-end gains on models up to 70B parameters while using only 64 bytes of shared memory per block.