Trends in ai supercomputers,

· 2025 · arXiv 2504.16026

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

representative citing papers

LLMSpace: Carbon Footprint Modeling for Large Language Model Inference on LEO Satellites

cs.LG · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

LLMSpace is the first framework to jointly model operational and embodied carbon for LLM inference on LEO satellites, incorporating radiation-hardened hardware, peripheral systems, and workload patterns such as prefill-decode behavior.

On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning

cs.LG · 2025-07-09 · unverdicted · novelty 7.0

A single global merge at the final step of decentralized SGD matches the convergence rate of parallel SGD while improving test accuracy under high data heterogeneity.

StickyInvoc: Rethinking Task Models for High-throughput Workflows in the LLM Era

cs.DC · 2026-06-20 · unverdicted · novelty 6.0

StickyInvoc introduces sticky tasks that load LLM model state once and invocation tasks that reuse it, yielding 3.6x speedup on a 150k-inference workflow.

Communication-Semantic-Aware RDMA Loss Recovery for QP-scalable Hyperscale AI Training

cs.NI · 2026-05-08 · unverdicted · novelty 6.0

CSA-UD is a communication-semantic-aware unreliable datagram RDMA loss recovery mechanism that improves QP scalability and reduces 99th percentile flow completion times in hyperscale AI training collectives.

Switching Efficiency: A Novel Framework for Dissecting AI Data Center Network Efficiency

cs.NI · 2026-04-16 · unverdicted · novelty 6.0

Introduces Switching Efficiency (η) decomposed into data, routing efficiency, and port utilization factors to analyze and improve communication bottlenecks in AI data center networks for LLM training.

How Sovereign Is Sovereign Compute? A Review of 775 Non-U.S. Data Centers

cs.CY · 2025-07-30 · unverdicted · novelty 6.0

U.S. operators control 48% of non-U.S. data center projects by investment value, limiting digital sovereignty for host nations and offering the U.S. an additional governance tool for deployed AI infrastructure.

Extreme-Scale Interconnection Networks

cs.NI · 2026-05-26 · unverdicted · novelty 4.0

MRLS leaf-spine networks deliver 50% higher throughput than Fat-Tree and 100% higher than Dragonfly for All2All collectives with 100k endpoints via simulation evaluation.

citing papers explorer

Showing 1 of 1 citing paper after filters.

StickyInvoc: Rethinking Task Models for High-throughput Workflows in the LLM Era cs.DC · 2026-06-20 · unverdicted · none · ref 29
StickyInvoc introduces sticky tasks that load LLM model state once and invocation tasks that reuse it, yielding 3.6x speedup on a 150k-inference workflow.

Trends in ai supercomputers,

fields

years

verdicts

representative citing papers

citing papers explorer