Title resolution pending

Eldan, Ronen, Li, Yuanzhi , journal=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

cs.AI · 2026-05-15 · unverdicted · novelty 6.0

Multi-agent LLM systems discover new Transformer and hybrid architectures that outperform Llama 3.2 at 1B scale and approach human SOTA on long-range benchmarks.

Dense vs Sparse Pretraining at Tiny Scale: Active-Parameter vs Total-Parameter Matching

cs.CL · 2026-05-13 · accept · novelty 5.0

At tiny scale, MoE transformers lower validation loss versus dense models when active parameters match but raise it when total stored parameters match.

citing papers explorer

Showing 2 of 2 citing papers.

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design cs.AI · 2026-05-15 · unverdicted · none · ref 65
Multi-agent LLM systems discover new Transformer and hybrid architectures that outperform Llama 3.2 at 1B scale and approach human SOTA on long-range benchmarks.
Dense vs Sparse Pretraining at Tiny Scale: Active-Parameter vs Total-Parameter Matching cs.CL · 2026-05-13 · accept · none · ref 13
At tiny scale, MoE transformers lower validation loss versus dense models when active parameters match but raise it when total stored parameters match.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer