Woodfisher: Efficient second-order approximation for neural network compression

Sidak Pal Singh, Dan Alistarh · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

TOAST: Transformer Optimization using Adaptive and Simple Transformations

cs.LG · 2024-10-07 · unverdicted · novelty 5.0

TOAST approximates full transformer blocks in pretrained models via lightweight closed-form mappings to cut parameters and FLOPs without retraining or finetuning.

citing papers explorer

Showing 1 of 1 citing paper.

TOAST: Transformer Optimization using Adaptive and Simple Transformations cs.LG · 2024-10-07 · unverdicted · none · ref 31
TOAST approximates full transformer blocks in pretrained models via lightweight closed-form mappings to cut parameters and FLOPs without retraining or finetuning.

Woodfisher: Efficient second-order approximation for neural network compression

fields

years

verdicts

representative citing papers

citing papers explorer