pith. sign in

arxiv: 2111.00364 · v2 · pith:USXFPSZOnew · submitted 2021-10-30 · 💻 cs.LG · cs.AI· cs.AR

Sustainable AI: Environmental Implications, Challenges and Opportunities

classification 💻 cs.LG cs.AIcs.AR
keywords carbonfootprintacrosschallengescomputingcycledevelopmentenvironmental
0
0 comments X
read the original abstract

This paper explores the environmental impact of the super-linear growth trends for AI from a holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the carbon footprint of AI computing by examining the model development cycle across industry-scale machine learning use cases and, at the same time, considering the life cycle of system hardware. Taking a step further, we capture the operational and manufacturing carbon footprint of AI computing and present an end-to-end analysis for what and how hardware-software design and at-scale optimization can help reduce the overall carbon footprint of AI. Based on the industry experience and lessons learned, we share the key challenges and chart out important development directions across the many dimensions of AI. We hope the key messages and insights presented in this paper can inspire the community to advance the field of AI in an environmentally-responsible manner.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. WattLayer: Get Layers Right to Estimate Inference Energy of Neural Networks

    cs.LG 2026-06 unverdicted novelty 6.0

    WattLayer is a layer-wise energy estimation model achieving 19.6% median error on over 100k layers from 295 architectures across 3 tasks and 3 platforms, with generalization to new tasks via shared layers.

  2. The Energy Cost of Execution-Idle in GPU Clusters

    cs.DC 2026-04 unverdicted novelty 6.0

    Execution-idle accounts for 19.7% of GPU execution time and 10.7% of energy in a large cluster, motivating power management that treats it as a distinct operating state.

  3. MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning

    cs.CL 2022-05 unverdicted novelty 6.0

    MRKL is a modular neuro-symbolic architecture that integrates LLMs with external knowledge and discrete reasoning to overcome limitations of pure neural language models.

  4. Carbon-Aware Mapping and Scheduling for Deadline-Constrained Workflows

    cs.DC 2026-05 unverdicted novelty 5.0

    CWM combines dynamic programming and heuristics to achieve 42% median carbon cost reduction over CaWoSched for workflows when deadline is twice the carbon-agnostic makespan.

  5. Position: LLM Inference Should Be Evaluated as Energy-to-Token Production

    cs.CE 2026-05 unverdicted novelty 5.0

    LLM inference should be reframed and evaluated as energy-to-token production with a Token Production Function that accounts for power, cooling, and efficiency ceilings.

  6. Conditional Electrocardiogram Generation Using Hierarchical Variational Autoencoders

    eess.SP 2025-03 unverdicted novelty 5.0

    A publicly released conditional hierarchical VAE generates high-resolution multi-pathology ECGs and raises downstream AUROC by up to 2% over GAN baselines in transfer-learning tests.