pith. sign in

hub Canonical reference

Deepspeed: System optimiza- tions enable training deep learning models with over 100 billion parameters

Canonical reference. 71% of citing Pith papers cite this work as background.

18 Pith papers citing it
Background 71% of classified citations

hub tools

citation-role summary

background 6 method 1

citation-polarity summary

representative citing papers

Llemma: An Open Language Model For Mathematics

cs.CL · 2023-10-16 · unverdicted · novelty 6.0

Continued pretraining of Code Llama on Proof-Pile-2 yields Llemma, an open math-specialized LLM that beats known open base models on MATH and supports tool use plus formal proving out of the box.

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

cs.CL · 2022-11-09 · unverdicted · novelty 6.0

BLOOM is a 176B-parameter open-access multilingual language model trained on the ROOTS corpus that achieves competitive performance on benchmarks, with improved results after multitask prompted finetuning.

torchtune: PyTorch native post-training library

cs.LG · 2026-05-20 · unverdicted · novelty 5.0

torchtune is a modular PyTorch library for LLM post-training that delivers competitive performance and memory efficiency while supporting rapid research iteration through hackable components.

citing papers explorer

Showing 18 of 18 citing papers.