Reducing energy bloat in large model training

Qiu, Yiming, Kon, Patrick Tser Jern, Beckett, Ryan, Chen, Ang , month = nov, year = · 2024 · arXiv 4715.369597

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Feedback-Driven Execution for LLM-Based Binary Analysis

cs.CR · 2026-04-16 · unverdicted · novelty 7.0

FORGE uses a reasoning-action-observation loop and Dynamic Forest of Agents to perform scalable LLM-based binary analysis, finding 1,274 vulnerabilities across 591 of 3,457 real-world firmware binaries at 72.3% precision and broader coverage than prior methods.

Designing Datacenter Power Delivery Hierarchies for the AI Era

cs.DC · 2026-05-15 · unverdicted · novelty 6.0

Develops a simulation framework showing multi-resource stranding changes deployable capacity and effective costs in AI datacenters, arguing the key metric is deployable capacity over time rather than installed megawatts.

EasyRider: Mitigating Power Transients in Datacenter-Scale Training Workloads

cs.AR · 2026-04-16 · unverdicted · novelty 6.0

EasyRider uses passive components plus actively controlled energy storage at the rack level, paired with lifetime-maximizing software, to keep AI training power transients inside grid safety limits without code changes or energy waste.

Amoeba: Runtime Tensor Parallel Transformation for LLM Inference Services

cs.DC · 2025-09-24 · unverdicted · novelty 6.0

Amoeba adaptively adjusts tensor parallelism at runtime for LLM inference services to handle mixed short and long context requests, delivering 1.75x-6.57x throughput gains over prior solutions in real-world trace evaluations.

Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery

eess.SY · 2026-05-06 · unverdicted · novelty 5.0

The paper introduces Experiment-as-Code Labs as a declarative stack synthesizing AI agents, systems orchestration, and physical lab control for AI-driven discovery.

EnergAIzer: Fast and Accurate GPU Power Estimation Framework for AI Workloads

cs.AR · 2026-04-22 · unverdicted · novelty 5.0

EnergAIzer predicts module-level GPU utilization from structured kernel patterns and feeds it into a power model to estimate dynamic power with 8% error on Ampere GPUs and 7% on H100 forecasts.

citing papers explorer

Showing 6 of 6 citing papers.

Feedback-Driven Execution for LLM-Based Binary Analysis cs.CR · 2026-04-16 · unverdicted · none · ref 38
FORGE uses a reasoning-action-observation loop and Dynamic Forest of Agents to perform scalable LLM-based binary analysis, finding 1,274 vulnerabilities across 591 of 3,457 real-world firmware binaries at 72.3% precision and broader coverage than prior methods.
Designing Datacenter Power Delivery Hierarchies for the AI Era cs.DC · 2026-05-15 · unverdicted · none · ref 12
Develops a simulation framework showing multi-resource stranding changes deployable capacity and effective costs in AI datacenters, arguing the key metric is deployable capacity over time rather than installed megawatts.
EasyRider: Mitigating Power Transients in Datacenter-Scale Training Workloads cs.AR · 2026-04-16 · unverdicted · none · ref 15
EasyRider uses passive components plus actively controlled energy storage at the rack level, paired with lifetime-maximizing software, to keep AI training power transients inside grid safety limits without code changes or energy waste.
Amoeba: Runtime Tensor Parallel Transformation for LLM Inference Services cs.DC · 2025-09-24 · unverdicted · none · ref 21
Amoeba adaptively adjusts tensor parallelism at runtime for LLM inference services to handle mixed short and long context requests, delivering 1.75x-6.57x throughput gains over prior solutions in real-world trace evaluations.
Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery eess.SY · 2026-05-06 · unverdicted · none · ref 113
The paper introduces Experiment-as-Code Labs as a declarative stack synthesizing AI agents, systems orchestration, and physical lab control for AI-driven discovery.
EnergAIzer: Fast and Accurate GPU Power Estimation Framework for AI Workloads cs.AR · 2026-04-22 · unverdicted · none · ref 13
EnergAIzer predicts module-level GPU utilization from structured kernel patterns and feeds it into a power model to estimate dynamic power with 8% error on Ampere GPUs and 7% on H100 forecasts.

Reducing energy bloat in large model training

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer