pith. sign in

Zero: Memory optimiza- tions toward training trillion parameter models

7 Pith papers cite this work, alongside 603 external citations. Polarity classification is still indexing.

7 Pith papers citing it
603 external citations · external index

citation-role summary

method 1

citation-polarity summary

years

2026 7

roles

method 1

polarities

use method 1

representative citing papers

Uno-Orchestra: Parsimonious Agent Routing via Selective Delegation

cs.AI · 2026-05-06 · unverdicted · novelty 6.0

A learned orchestration policy for LLM agents that jointly optimizes task decomposition and selective routing to (model, primitive) pairs, delivering 77% macro pass@1 at 10x lower cost than strong baselines across 13 benchmarks.

MeMo: Memory as a Model

cs.CL · 2026-05-14 · unverdicted · novelty 5.0 · 2 refs

MeMo encodes new knowledge into a separate memory model that integrates with frozen LLMs, showing strong performance on QA benchmarks while avoiding catastrophic forgetting and working without access to model weights.

citing papers explorer

Showing 7 of 7 citing papers.