Log-likelihood vectors establish a consistent KL divergence scale across pretraining, model sizes, seeds, quantization, fine-tuning, and layers, revealing subdiffusive trajectories and early stabilization in Pythia models.
InInternational Conference on Learning Representations
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CL 2verdicts
UNVERDICTED 2roles
dataset 1polarities
background 1representative citing papers
A systematic literature review that organizes recent work on LLMs for code generation into a taxonomy covering data curation, model advances, evaluations, ethics, environmental impact, and applications, with benchmark comparisons.
citing papers explorer
-
Establishing a Scale for Kullback-Leibler Divergence in Language Models Across Various Settings
Log-likelihood vectors establish a consistent KL divergence scale across pretraining, model sizes, seeds, quantization, fine-tuning, and layers, revealing subdiffusive trajectories and early stabilization in Pythia models.
-
A Survey on Large Language Models for Code Generation
A systematic literature review that organizes recent work on LLMs for code generation into a taxonomy covering data curation, model advances, evaluations, ethics, environmental impact, and applications, with benchmark comparisons.