pith. sign in

Well-read students learn better: The impact of student initialization on knowledge distillation

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

representative citing papers

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

cs.CL · 2022-10-17 · conditional · novelty 7.0

DiffuSeq adapts diffusion models to conditional sequence-to-sequence text generation and reports performance matching or exceeding strong baselines including pretrained language model systems while generating more diverse outputs.

citing papers explorer

Showing 8 of 8 citing papers.