Sustainable AI: Environmental Implications, Challenges and Opportunities

Anastasia Melnikov; Anurag Gupta; Benjamin Lee; Bilge Acun; Bugra Akyildiz; Carole-Jean Wu; Charles Bai; David Brooks; Fiona Aga Behram; Geeta Chauhan

arxiv: 2111.00364 · v2 · pith:USXFPSZOnew · submitted 2021-10-30 · 💻 cs.LG · cs.AI· cs.AR

Sustainable AI: Environmental Implications, Challenges and Opportunities

Carole-Jean Wu , Ramya Raghavendra , Udit Gupta , Bilge Acun , Newsha Ardalani , Kiwan Maeng , Gloria Chang , Fiona Aga Behram

show 17 more authors

James Huang Charles Bai Michael Gschwind Anurag Gupta Myle Ott Anastasia Melnikov Salvatore Candido David Brooks Geeta Chauhan Benjamin Lee Hsien-Hsin S. Lee Bugra Akyildiz Maximilian Balandat Joe Spisak Ravi Jain Mike Rabbat Kim Hazelwood

This is my paper

classification 💻 cs.LG cs.AIcs.AR

keywords carbonfootprintacrosschallengescomputingcycledevelopmentenvironmental

0 comments

read the original abstract

This paper explores the environmental impact of the super-linear growth trends for AI from a holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the carbon footprint of AI computing by examining the model development cycle across industry-scale machine learning use cases and, at the same time, considering the life cycle of system hardware. Taking a step further, we capture the operational and manufacturing carbon footprint of AI computing and present an end-to-end analysis for what and how hardware-software design and at-scale optimization can help reduce the overall carbon footprint of AI. Based on the industry experience and lessons learned, we share the key challenges and chart out important development directions across the many dimensions of AI. We hope the key messages and insights presented in this paper can inspire the community to advance the field of AI in an environmentally-responsible manner.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

WattLayer: Get Layers Right to Estimate Inference Energy of Neural Networks
cs.LG 2026-06 unverdicted novelty 6.0

WattLayer is a layer-wise energy estimation model achieving 19.6% median error on over 100k layers from 295 architectures across 3 tasks and 3 platforms, with generalization to new tasks via shared layers.
The Energy Cost of Execution-Idle in GPU Clusters
cs.DC 2026-04 unverdicted novelty 6.0

Execution-idle accounts for 19.7% of GPU execution time and 10.7% of energy in a large cluster, motivating power management that treats it as a distinct operating state.
MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
cs.CL 2022-05 unverdicted novelty 6.0

MRKL is a modular neuro-symbolic architecture that integrates LLMs with external knowledge and discrete reasoning to overcome limitations of pure neural language models.
Carbon-Aware Mapping and Scheduling for Deadline-Constrained Workflows
cs.DC 2026-05 unverdicted novelty 5.0

CWM combines dynamic programming and heuristics to achieve 42% median carbon cost reduction over CaWoSched for workflows when deadline is twice the carbon-agnostic makespan.
Position: LLM Inference Should Be Evaluated as Energy-to-Token Production
cs.CE 2026-05 unverdicted novelty 5.0

LLM inference should be reframed and evaluated as energy-to-token production with a Token Production Function that accounts for power, cooling, and efficiency ceilings.
Conditional Electrocardiogram Generation Using Hierarchical Variational Autoencoders
eess.SP 2025-03 unverdicted novelty 5.0

A publicly released conditional hierarchical VAE generates high-resolution multi-pathology ECGs and raises downstream AUROC by up to 2% over GAN baselines in transfer-learning tests.