Rethinking soft labels for knowledge distillation: A bias-variance tradeoff perspective

Helong Zhou, Liangchen Song, Jiajie Chen, Ye Zhou, Guoli Wang, Junsong Yuan, Qian Zhang · 2021 · arXiv 2102.00650

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Sample-wise Adaptive Weighting for Transfer Consistency in Adversarial Distillation

cs.CV · 2025-12-11 · conditional · novelty 6.0

SAAD adaptively weights adversarial training samples by their transferability to the teacher, yielding higher AutoAttack robustness than prior distillation methods on CIFAR and Tiny-ImageNet without extra compute.

Distilling Tabular Foundation Models for Structured Health Data

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

Leakage-aware distillation transfers at least 90% of tabular foundation model AUC to lightweight students across 19 health datasets, with 26x CPU speedup and preserved calibration/fairness.

Assessing and Mitigating Miscalibration in LLM-Based Social Science Measurement

cs.AI · 2026-05-12 · unverdicted · novelty 5.0

LLM confidence for social science text measurements is poorly calibrated across models, and a soft-label distillation pipeline reduces expected calibration error by 43% and Brier score by 34%.

citing papers explorer

Showing 3 of 3 citing papers.

Sample-wise Adaptive Weighting for Transfer Consistency in Adversarial Distillation cs.CV · 2025-12-11 · conditional · none · ref 82
SAAD adaptively weights adversarial training samples by their transferability to the teacher, yielding higher AutoAttack robustness than prior distillation methods on CIFAR and Tiny-ImageNet without extra compute.
Distilling Tabular Foundation Models for Structured Health Data cs.LG · 2026-05-18 · unverdicted · none · ref 19
Leakage-aware distillation transfers at least 90% of tabular foundation model AUC to lightweight students across 19 health datasets, with 26x CPU speedup and preserved calibration/fairness.
Assessing and Mitigating Miscalibration in LLM-Based Social Science Measurement cs.AI · 2026-05-12 · unverdicted · none · ref 40
LLM confidence for social science text measurements is poorly calibrated across models, and a soft-label distillation pipeline reduces expected calibration error by 43% and Brier score by 34%.

Rethinking soft labels for knowledge distillation: A bias-variance tradeoff perspective

fields

years

verdicts

representative citing papers

citing papers explorer