Understanding and improving knowledge distillation for quantization aware training of large transformer encoders.EMNLP, 2022

Minsoo Kim, Sihwa Lee, Sukjin Hong, Du-Seong Chang, Jungwook Choi · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

FTerViT: Fully Ternary Vision Transformer

cs.CV · 2026-05-20 · conditional · novelty 7.0

FTerViT introduces fully ternary Vision Transformers with TernaryBitConv2d and TernaryLayerNorm operators, achieving 82.43% ImageNet top-1 at 6.09 MB with 15x compression.

citing papers explorer

Showing 1 of 1 citing paper.

FTerViT: Fully Ternary Vision Transformer cs.CV · 2026-05-20 · conditional · none · ref 33
FTerViT introduces fully ternary Vision Transformers with TernaryBitConv2d and TernaryLayerNorm operators, achieving 82.43% ImageNet top-1 at 6.09 MB with 15x compression.

Understanding and improving knowledge distillation for quantization aware training of large transformer encoders.EMNLP, 2022

fields

years

verdicts

representative citing papers

citing papers explorer