Full development of 7B and 32B Olmo 3 models used 12.3 GWh datacenter energy and emitted 4,251 tCO2eq, with development overheads accounting for 82% of compute and reasoning models costing 17x more to post-train than instruction-tuned ones.
Smith, Nicole DeCario, and Will Buchanan
4 Pith papers cite this work. Polarity classification is still indexing.
4
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
years
2026 4verdicts
UNVERDICTED 4roles
background 1polarities
background 1representative citing papers
US hyperscale data centers consumed 68-99 TWh electricity and emitted 37-54 Mt CO2, representing 1.8% of US electricity use with average carbon intensity 48% above the national grid average.
Qualitative studies show creatives prefer self-experimentation over structured guidance for GenAI image tools to preserve creative autonomy despite terminology barriers.
Empirical study finds diminishing accuracy returns against steep energy growth for deeper and wider ResNet speaker verification models on VoxCeleb2.
citing papers explorer
No citing papers match the current filters.