pith. sign in

Zero: Memory optimizations toward training trillion parameter models

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

fields

cs.CL 5

years

2023 4 2022 1

representative citing papers

Llemma: An Open Language Model For Mathematics

cs.CL · 2023-10-16 · unverdicted · novelty 6.0

Continued pretraining of Code Llama on Proof-Pile-2 yields Llemma, an open math-specialized LLM that beats known open base models on MATH and supports tool use plus formal proving out of the box.

citing papers explorer

Showing 5 of 5 citing papers.