Bitnet distillation.arXiv preprint arXiv:2510.13998, 2025

Xun Wu, Shaohan Huang, Wenhui Wang, Ting Song, Li Dong, Yan Xia, Furu Wei · 2025 · arXiv 2510.13998

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

cs.CL · 2026-06-24 · unverdicted · novelty 6.0

BITEMBED converts LLM backbones to ternary BitNet-style encoders, adapts them with contrastive pre-training and teacher distillation, and produces text embeddings at multiple precisions that perform comparably to full-precision baselines on MMTEB.

On the Expressive Power of Weight Quantization in Large Language Models

cs.LG · 2026-06-20 · unverdicted · novelty 4.0

Weight-quantized LLMs retain universal approximation up to 1.58 bits with expressive collapse below it and polynomial degradation in capacity as bit count falls.

citing papers explorer

Showing 2 of 2 citing papers.

BitNet Text Embeddings cs.CL · 2026-06-24 · unverdicted · none · ref 69
BITEMBED converts LLM backbones to ternary BitNet-style encoders, adapts them with contrastive pre-training and teacher distillation, and produces text embeddings at multiple precisions that perform comparably to full-precision baselines on MMTEB.
On the Expressive Power of Weight Quantization in Large Language Models cs.LG · 2026-06-20 · unverdicted · none · ref 41
Weight-quantized LLMs retain universal approximation up to 1.58 bits with expressive collapse below it and polynomial degradation in capacity as bit count falls.

Bitnet distillation.arXiv preprint arXiv:2510.13998, 2025

fields

years

verdicts

representative citing papers

citing papers explorer