Title resolution pending

Hanyu Lai, Xiao Liu, Junjie Gao, Jiale Cheng, Zehan Qi, Yifan Xu, Shuntian Yao, Dan Zhang, Jinhua Du, Zhenyu Hou, et al · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

cs.CL · 2026-04-06 · conditional · novelty 5.0

MegaTrain enables reliable full-precision training of up to 120B parameter LLMs on one H200 GPU with 1.5TB host memory via host-memory streaming, pipelined double-buffered execution, and stateless layer templates, achieving 1.84x throughput over DeepSpeed ZeRO-3 for 14B models.

citing papers explorer

Showing 1 of 1 citing paper.

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU cs.CL · 2026-04-06 · conditional · none · ref 5
MegaTrain enables reliable full-precision training of up to 120B parameter LLMs on one H200 GPU with 1.5TB host memory via host-memory streaming, pipelined double-buffered execution, and stateless layer templates, achieving 1.84x throughput over DeepSpeed ZeRO-3 for 14B models.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer