Q-bert: Hessian based ultra low precision quantization of bert

· 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

BEExformer: A Fast Inferencing Binarized Transformer with Early Exits

cs.CL · 2024-12-06 · unverdicted · novelty 6.0

BEExformer integrates binarization-aware training using a second-order sign approximation and entropy-based early exits with SLFN to achieve 21.3x size reduction, 52% fewer FLOPs, and 3.22% accuracy gain on NLP tasks.

citing papers explorer

Showing 1 of 1 citing paper.

BEExformer: A Fast Inferencing Binarized Transformer with Early Exits cs.CL · 2024-12-06 · unverdicted · none · ref 11
BEExformer integrates binarization-aware training using a second-order sign approximation and entropy-based early exits with SLFN to achieve 21.3x size reduction, 52% fewer FLOPs, and 3.22% accuracy gain on NLP tasks.

Q-bert: Hessian based ultra low precision quantization of bert

fields

years

verdicts

representative citing papers

citing papers explorer