An empirical analysis of compute-optimal large language model training

Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katherine Millican, George van den Driessc · 2022

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SFT-then-RL Outperforms Mixed-Policy Methods for LLM Reasoning

cs.LG · 2026-04-26 · conditional · novelty 6.0

Correcting DeepSpeed optimizer and OpenRLHF loss bugs reveals SFT-then-RL outperforms mixed-policy methods by 3.8-22.2 points on math benchmarks.

BenchHAR: Benchmarking Self-Supervised Learning for Generalizable Sensor-based Activity Recognition

cs.CV · 2026-05-08 · unverdicted · novelty 5.0

BenchHAR finds that hybrid reconstruction-plus-contrastive SSL with CNN encoders generalizes best for sensor HAR but overall performance on unseen distributions remains unsatisfactory.

citing papers explorer

Showing 2 of 2 citing papers.

SFT-then-RL Outperforms Mixed-Policy Methods for LLM Reasoning cs.LG · 2026-04-26 · conditional · none · ref 48
Correcting DeepSpeed optimizer and OpenRLHF loss bugs reveals SFT-then-RL outperforms mixed-policy methods by 3.8-22.2 points on math benchmarks.
BenchHAR: Benchmarking Self-Supervised Learning for Generalizable Sensor-based Activity Recognition cs.CV · 2026-05-08 · unverdicted · none · ref 24
BenchHAR finds that hybrid reconstruction-plus-contrastive SSL with CNN encoders generalizes best for sensor HAR but overall performance on unseen distributions remains unsatisfactory.

An empirical analysis of compute-optimal large language model training

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer