Optimizing Large Language Models: Metrics, Energy Efficiency, and Case Study Insights

Sedef Akinli Kocak; Shaina Raza; Soroor Motie; Tahniat Khan

arxiv: 2504.06307 · v2 · submitted 2025-04-07 · 💻 cs.LG · cs.AI

Optimizing Large Language Models: Metrics, Energy Efficiency, and Case Study Insights

Tahniat Khan , Soroor Motie , Sedef Akinli Kocak , Shaina Raza This is my paper

Pith reviewed 2026-05-22 20:18 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords large language modelsquantizationenergy efficiencycarbon emissionssustainabilitylocal inferenceoptimization techniques

0 comments

The pith

Quantization and local inference reduce LLM energy use and emissions by up to 45 percent.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines how to deploy large language models more sustainably by combining quantization with local inference. It presents a case study and framework that apply these techniques to lower energy demands and carbon output while preserving accuracy and speed. A sympathetic reader would care because expanding LLM use creates measurable environmental costs, and the work identifies a concrete path to cut those costs in settings with limited computing power. The results supply practical guidance for balancing performance against resource limits.

Core claim

The integration of strategic quantization and local inference techniques substantially lowers the carbon footprints of LLMs without compromising their operational effectiveness, with experimental results showing reductions in energy consumption and carbon emissions by up to 45% post quantization, making the approach suitable for resource-constrained environments.

What carries the argument

Strategic quantization paired with local inference, which shrinks model size and shifts computation to on-device processing to cut energy requirements.

If this is right

LLMs can operate effectively on devices with limited power or bandwidth.
Deployments can achieve lower overall carbon output while retaining responsiveness.
Organizations gain a tested route to meet both accuracy targets and sustainability targets.
Measurement frameworks for LLM efficiency receive direct experimental support from the case study.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same quantization steps could be tested on other generative models such as diffusion or multimodal systems to check for comparable savings.
Hardware vendors might prioritize accelerators that support quantized inference to amplify the reported gains.
Repeated case studies across different geographic regions could reveal how local electricity grids affect the net emissions reduction.

Load-bearing premise

The case study measurements of energy and emissions accurately represent typical LLM usage without selective conditions or models that favor the reported reductions.

What would settle it

Re-running the energy and emissions measurements on a broader set of LLMs and hardware platforms that yields an average reduction below 20 percent would falsify the up-to-45-percent claim.

Figures

Figures reproduced from arXiv: 2504.06307 by Sedef Akinli Kocak, Shaina Raza, Soroor Motie, Tahniat Khan.

**Figure 2.** Figure 2: Sentiment Assessment Instructions and Indicators Checklist. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Key Examples of Sentiment Analysis Experiments [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

read the original abstract

The rapid adoption of large language models (LLMs) has led to significant energy consumption and carbon emissions, posing a critical challenge to the sustainability of generative AI technologies. This paper explores the integration of energy-efficient optimization techniques in the deployment of LLMs to address these environmental concerns. We present a case study and framework that demonstrate how strategic quantization and local inference techniques can substantially lower the carbon footprints of LLMs without compromising their operational effectiveness. Experimental results reveal that these methods can reduce energy consumption and carbon emissions by up to 45\% post quantization, making them particularly suitable for resource-constrained environments. The findings provide actionable insights for achieving sustainability in AI while maintaining high levels of accuracy and responsiveness.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a case study applying standard quantization and local inference to LLMs with a 45% energy savings claim, but the measurements lack any described protocol or baselines.

read the letter

The paper's main takeaway is a case study claiming that quantization and local inference can cut energy consumption and carbon emissions in LLMs by up to 45%. The underlying techniques are established ones, so this is more of an application report than a novel advance. It does well at highlighting the sustainability challenges with generative AI and pointing to practical optimizations for constrained environments. That framing could be useful for teams focused on deployment rather than pure research. The soft spots are in the evidence. The headline result rests on experimental findings, yet the paper provides little on the measurement protocol, hardware setup, model specifics, or baseline conditions. This absence makes it tough to judge if the savings are reliable or representative. The work doesn't include detailed tables, error analysis, or comparisons that would make the claim more solid. No new math or formal results appear here, and the citations follow standard patterns without relying on anything particularly groundbreaking. This would suit readers in industry or policy who are scanning for ideas on greener AI systems. Someone needing rigorous, reproducible data would likely look elsewhere. I would recommend putting it through peer review. The topic has real-world weight, and referees could guide the authors toward clearer reporting that turns the case study into something more dependable.

Referee Report

2 major / 1 minor

Summary. The paper claims that integrating quantization and local inference techniques into LLM deployment can reduce energy consumption and carbon emissions by up to 45% without compromising accuracy or responsiveness, based on a presented case study and framework for sustainable AI in resource-constrained settings.

Significance. If the 45% reduction claim holds under reproducible conditions, the work would offer practical, actionable guidance for lowering the environmental footprint of LLMs. The absence of any described measurement protocol, however, prevents evaluation of whether the result is representative or load-bearing.

major comments (2)

[Abstract] Abstract: the headline empirical claim ('reduce energy consumption and carbon emissions by up to 45% post quantization') is presented with no accompanying description of the experimental protocol, hardware platform, model(s) tested, quantization bit-widths, inference workload, or baseline configuration. This information is required to assess the central result.
[Case Study] Case study section (implied by abstract): no details are supplied on how energy or emissions were measured (tool, scope of measurement, averaging procedure, or carbon-intensity assumptions), nor are any tables or figures showing raw values, error bars, or comparisons provided. Without these, the 45% figure cannot be verified or replicated.

minor comments (1)

The abstract refers to both 'a case study and framework' yet does not clarify how the framework was used to generate or validate the reported percentage.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for highlighting the need for greater transparency in our experimental reporting. We agree that the current manuscript lacks sufficient detail on protocols and measurements to allow verification of the 45% reduction claim, and we will revise accordingly to strengthen the work.

read point-by-point responses

Referee: [Abstract] Abstract: the headline empirical claim ('reduce energy consumption and carbon emissions by up to 45% post quantization') is presented with no accompanying description of the experimental protocol, hardware platform, model(s) tested, quantization bit-widths, inference workload, or baseline configuration. This information is required to assess the central result.

Authors: We accept this criticism. The abstract will be revised to briefly specify the models evaluated, quantization bit-widths (e.g., 4-bit and 8-bit), hardware platforms, inference workloads, and baseline configurations. Full methodological details will be expanded in the main text. revision: yes
Referee: [Case Study] Case study section (implied by abstract): no details are supplied on how energy or emissions were measured (tool, scope of measurement, averaging procedure, or carbon-intensity assumptions), nor are any tables or figures showing raw values, error bars, or comparisons provided. Without these, the 45% figure cannot be verified or replicated.

Authors: We agree the measurement protocol is insufficiently described. The revised manuscript will add a dedicated methods subsection detailing the energy measurement tools, measurement scope, averaging procedures, carbon-intensity assumptions, and will include new tables and figures presenting raw values, error bars, and baseline comparisons. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical case study reports measurements without derivations or self-referential reductions

full rationale

The paper's central claim is an empirical observation from a case study that quantization reduces energy and emissions by up to 45%. The abstract and provided text contain no equations, parameter-fitting steps presented as predictions, uniqueness theorems, or self-citations that bear load on the result. The derivation chain is therefore a direct reporting of experimental outcomes rather than any reduction of outputs to inputs by construction, satisfying the criteria for a self-contained empirical paper with score 0.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no extractable free parameters, axioms, or invented entities; the central claim rests on an unreported case-study measurement whose validity cannot be audited.

pith-pipeline@v0.9.0 · 5653 in / 937 out tokens · 51528 ms · 2026-05-22T20:18:30.710624+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Experimental results reveal that these methods can reduce energy consumption and carbon emissions by up to 45% post quantization
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We use Ollama for local AI model deployment... 4-bit quantization strategy (b = 4)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Quantifying the Climate Risk of Generative AI: Region-Aware Carbon Accounting with G-TRACE and the AI Sustainability Pyramid
cs.CY 2025-11 unverdicted novelty 4.0

G-TRACE quantifies region-aware GenAI emissions and estimates 4,309 MWh energy use plus 2,068 tCO2 from the Ghibli-style image generation trend, paired with the AI Sustainability Pyramid for translating metrics into policy.
Quantifying the Climate Risk of Generative AI: Region-Aware Carbon Accounting with G-TRACE and the AI Sustainability Pyramid
cs.CY 2025-11 unverdicted novelty 4.0

G-TRACE provides region-aware estimates of GenAI carbon emissions including 4309 MWh and 2068 tCO2 for a 2024-2025 image generation trend, paired with a seven-level AI Sustainability Pyramid for policy guidance.

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages · cited by 1 Pith paper · 3 internal anchors

[1]

Llama 3.2 model card, 2024

Meta AI. Llama 3.2 model card, 2024. Accessed: 2024-09-25

work page 2024
[2]

Towards green ai

Enrico Barbierato and Alice Gatti. Towards green ai. a methodological survey of the scientific literature. IEEE Access, 2024

work page 2024
[3]

Lessons learned from developing a sus- tainability awareness framework for software engineering using design science

Stefanie Betz, Birgit Penzenstadler, Leticia Duboc, Ruzanna Chitchyan, Sedef Akinli Kocak, Ian Brooks, Shola Oyedeji, Jari Porras, Norbert Seyff, and Colin C Venters. Lessons learned from developing a sus- tainability awareness framework for software engineering using design science. ACM Transactions on Software Engineering and Methodology , 33(5):1–39, 2024

work page 2024
[4]

A review of green artificial intelligence: Towards a more sustainable future.Neurocomputing, page 128096, 2024

Ver ´onica Bol ´on-Canedo, Laura Mor ´an-Fern´andez, Brais Cancela, and Amparo Alonso-Betanzos. A review of green artificial intelligence: Towards a more sustainable future.Neurocomputing, page 128096, 2024

work page 2024
[5]

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

Lingjiao Chen, Matei Zaharia, and James Zou. Frugalgpt: How to use large language models while reducing cost and improving performance. arXiv preprint arXiv:2305.05176 , 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[6]

Frugalml: How to use ml prediction apis more accurately and cheaply

Lingjiao Chen, Matei Zaharia, and James Y Zou. Frugalml: How to use ml prediction apis more accurately and cheaply. Advances in neural information processing systems , 33:10685–10696, 2020

work page 2020
[7]

Do we really know what we are building? raising awareness of potential sustainability effects of software systems in requirements engineering

Leticia Duboc, Stefanie Betz, Birgit Penzenstadler, Sedef Akinli Kocak, Ruzanna Chitchyan, Ola Leifler, Jari Porras, Norbert Seyff, and Colin C Venters. Do we really know what we are building? raising awareness of potential sustainability effects of software systems in requirements engineering. In 2019 IEEE 27th international requirements engineering conf...

work page 2019
[8]

Qwen technical report, 2023

Jinze Bai et al. Qwen technical report, 2023. Accessed: 2024-09-25

work page 2023
[9]

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Elias Frantar, Saleh Ashkboos, Torsten Hoefler, and Dan Alistarh. Gptq: Accurate post-training quantization for generative pre-trained transformers. arXiv preprint arXiv:2210.17323 , 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022
[10]

Energy cost modelling for optimizing large language model inference on hardware accelerators

Robin Geens, Man Shi, Arne Symons, Chao Fang, and Marian Verhelst. Energy cost modelling for optimizing large language model inference on hardware accelerators. In 2024 IEEE 37th International System-on-Chip Conference (SOCC), pages 1–6. IEEE, 2024

work page 2024
[11]

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Song Han, Huizi Mao, and William J Dally. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 , 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[12]

Towards green ai in fine-tuning large language models via adaptive backpropagation

Kai Huang, Hanyun Yin, Heng Huang, and Wei Gao. Towards green ai in fine-tuning large language models via adaptive backpropagation. arXiv preprint arXiv:2309.13192 , 2023

work page arXiv 2023
[13]

Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, L ´elio Renard Lavaud, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timoth ´ee Lacroix, and William El Sayed. Mistral 7b,

work page
[14]

Accessed: 2024-09-25

work page 2024
[15]

Visual instruction tuning, 2023

Haotian Liu, Chunyuan Li, Qingyang Wu, and Yong Jae Lee. Visual instruction tuning, 2023. Accessed: 2024-09-25

work page 2023
[16]

Green ai: exploring carbon footprints, mitigation strategies, and trade offs in large language model training

Vivian Liu and Yiqiao Yin. Green ai: exploring carbon footprints, mitigation strategies, and trade offs in large language model training. Discover Artificial Intelligence, 4(1):49, 2024

work page 2024
[17]

Llm-qat: Data-free quantization aware training for large language models

Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, and Vikas Chandra. Llm-qat: Data-free quantization aware training for large language models. arXiv preprint arXiv:2305.17888 , 2023

work page arXiv 2023
[18]

Estimating the carbon footprint of bloom, a 176b parameter language model

Alexandra Sasha Luccioni, Sylvain Viguier, and Anne-Laure Ligozat. Estimating the carbon footprint of bloom, a 176b parameter language model. Journal of Machine Learning Research , 24(253):1–15, 2023

work page 2023
[19]

Good debt or bad debt: Detecting semantic orientations in economic texts

Pekka Malo, Ankur Sinha, Pekka Korhonen, Jyrki Wallenius, and Pyry Takala. Good debt or bad debt: Detecting semantic orientations in economic texts. Journal of the Association for Information Science and Technology, 65(4):782–796, 2014

work page 2014
[20]

Ollama: Ai-powered insights for language models,

Ollama Technologies. Ollama: Ai-powered insights for language models,

work page
[21]

Accessed: [Access Date]

work page
[22]

Developing safe and responsible large language models–a comprehensive framework

Shaina Raza, Oluwanifemi Bamgbose, Shardul Ghuge, Fatemeh Tavakoli, and Deepak John Reji. Developing safe and responsible large language models–a comprehensive framework. arXiv preprint arXiv:2404.01399, 2024

work page arXiv 2024
[23]

Phi-3 technical report: A highly capable language model locally on your phone, 2024

Microsoft AI Research. Phi-3 technical report: A highly capable language model locally on your phone, 2024. Accessed: 2024-09-25

work page 2024
[24]

Towards optimizing the costs of llm usage

Shivanshu Shekhar, Tanishq Dubey, Koyel Mukherjee, Apoorv Saxena, Atharv Tyagi, and Nishanth Kotla. Towards optimizing the costs of llm usage. arXiv preprint arXiv:2402.01742 , 2024

work page arXiv 2024
[25]

Greening large language models of code

Jieke Shi, Zhou Yang, Hong Jin Kang, Bowen Xu, Junda He, and David Lo. Greening large language models of code. In Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Society , pages 142–153, 2024

work page 2024
[26]

Efficient and green large language models for software engineering: Vision and the road ahead

Jieke Shi, Zhou Yang, and David Lo. Efficient and green large language models for software engineering: Vision and the road ahead. ACM Transactions on Software Engineering and Methodology , 2024

work page 2024
[27]

To- wards sustainable ai: a comprehensive framework for green ai

Abdulaziz Tabbakh, Lisan Al Amin, Mahbubul Islam, GM Iqbal Mah- mud, Imranul Kabir Chowdhury, and Md Saddam Hossain Mukta. To- wards sustainable ai: a comprehensive framework for green ai. Discover Sustainability, 5(1):408, 2024

work page 2024
[28]

Software sustainability: beyond the tower of babel

Colin C Venters, Sedef Akinli Kocak, Stefanie Betz, Ian Brooks, Rafael Capilla, Ruzanna Chitchyan, Let ´ıcia Duboc, Rogardt Heldal, Ana Moreira, Shola Oyedeji, et al. Software sustainability: beyond the tower of babel. In 2021 IEEE/ACM International Workshop on Body of Knowledge for Software Sustainability (BoKSS), pages 3–4. IEEE, 2021

work page 2021
[29]

A systematic review of green ai

Roberto Verdecchia, June Sallou, and Lu ´ıs Cruz. A systematic review of green ai. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 13(4):e1507, 2023

work page 2023

[1] [1]

Llama 3.2 model card, 2024

Meta AI. Llama 3.2 model card, 2024. Accessed: 2024-09-25

work page 2024

[2] [2]

Towards green ai

Enrico Barbierato and Alice Gatti. Towards green ai. a methodological survey of the scientific literature. IEEE Access, 2024

work page 2024

[3] [3]

Lessons learned from developing a sus- tainability awareness framework for software engineering using design science

Stefanie Betz, Birgit Penzenstadler, Leticia Duboc, Ruzanna Chitchyan, Sedef Akinli Kocak, Ian Brooks, Shola Oyedeji, Jari Porras, Norbert Seyff, and Colin C Venters. Lessons learned from developing a sus- tainability awareness framework for software engineering using design science. ACM Transactions on Software Engineering and Methodology , 33(5):1–39, 2024

work page 2024

[4] [4]

A review of green artificial intelligence: Towards a more sustainable future.Neurocomputing, page 128096, 2024

Ver ´onica Bol ´on-Canedo, Laura Mor ´an-Fern´andez, Brais Cancela, and Amparo Alonso-Betanzos. A review of green artificial intelligence: Towards a more sustainable future.Neurocomputing, page 128096, 2024

work page 2024

[5] [5]

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

Lingjiao Chen, Matei Zaharia, and James Zou. Frugalgpt: How to use large language models while reducing cost and improving performance. arXiv preprint arXiv:2305.05176 , 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[6] [6]

Frugalml: How to use ml prediction apis more accurately and cheaply

Lingjiao Chen, Matei Zaharia, and James Y Zou. Frugalml: How to use ml prediction apis more accurately and cheaply. Advances in neural information processing systems , 33:10685–10696, 2020

work page 2020

[7] [7]

Do we really know what we are building? raising awareness of potential sustainability effects of software systems in requirements engineering

Leticia Duboc, Stefanie Betz, Birgit Penzenstadler, Sedef Akinli Kocak, Ruzanna Chitchyan, Ola Leifler, Jari Porras, Norbert Seyff, and Colin C Venters. Do we really know what we are building? raising awareness of potential sustainability effects of software systems in requirements engineering. In 2019 IEEE 27th international requirements engineering conf...

work page 2019

[8] [8]

Qwen technical report, 2023

Jinze Bai et al. Qwen technical report, 2023. Accessed: 2024-09-25

work page 2023

[9] [9]

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Elias Frantar, Saleh Ashkboos, Torsten Hoefler, and Dan Alistarh. Gptq: Accurate post-training quantization for generative pre-trained transformers. arXiv preprint arXiv:2210.17323 , 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022

[10] [10]

Energy cost modelling for optimizing large language model inference on hardware accelerators

Robin Geens, Man Shi, Arne Symons, Chao Fang, and Marian Verhelst. Energy cost modelling for optimizing large language model inference on hardware accelerators. In 2024 IEEE 37th International System-on-Chip Conference (SOCC), pages 1–6. IEEE, 2024

work page 2024

[11] [11]

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Song Han, Huizi Mao, and William J Dally. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 , 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[12] [12]

Towards green ai in fine-tuning large language models via adaptive backpropagation

Kai Huang, Hanyun Yin, Heng Huang, and Wei Gao. Towards green ai in fine-tuning large language models via adaptive backpropagation. arXiv preprint arXiv:2309.13192 , 2023

work page arXiv 2023

[13] [13]

Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, L ´elio Renard Lavaud, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timoth ´ee Lacroix, and William El Sayed. Mistral 7b,

work page

[14] [14]

Accessed: 2024-09-25

work page 2024

[15] [15]

Visual instruction tuning, 2023

Haotian Liu, Chunyuan Li, Qingyang Wu, and Yong Jae Lee. Visual instruction tuning, 2023. Accessed: 2024-09-25

work page 2023

[16] [16]

Green ai: exploring carbon footprints, mitigation strategies, and trade offs in large language model training

Vivian Liu and Yiqiao Yin. Green ai: exploring carbon footprints, mitigation strategies, and trade offs in large language model training. Discover Artificial Intelligence, 4(1):49, 2024

work page 2024

[17] [17]

Llm-qat: Data-free quantization aware training for large language models

Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, and Vikas Chandra. Llm-qat: Data-free quantization aware training for large language models. arXiv preprint arXiv:2305.17888 , 2023

work page arXiv 2023

[18] [18]

Estimating the carbon footprint of bloom, a 176b parameter language model

Alexandra Sasha Luccioni, Sylvain Viguier, and Anne-Laure Ligozat. Estimating the carbon footprint of bloom, a 176b parameter language model. Journal of Machine Learning Research , 24(253):1–15, 2023

work page 2023

[19] [19]

Good debt or bad debt: Detecting semantic orientations in economic texts

Pekka Malo, Ankur Sinha, Pekka Korhonen, Jyrki Wallenius, and Pyry Takala. Good debt or bad debt: Detecting semantic orientations in economic texts. Journal of the Association for Information Science and Technology, 65(4):782–796, 2014

work page 2014

[20] [20]

Ollama: Ai-powered insights for language models,

Ollama Technologies. Ollama: Ai-powered insights for language models,

work page

[21] [21]

Accessed: [Access Date]

work page

[22] [22]

Developing safe and responsible large language models–a comprehensive framework

Shaina Raza, Oluwanifemi Bamgbose, Shardul Ghuge, Fatemeh Tavakoli, and Deepak John Reji. Developing safe and responsible large language models–a comprehensive framework. arXiv preprint arXiv:2404.01399, 2024

work page arXiv 2024

[23] [23]

Phi-3 technical report: A highly capable language model locally on your phone, 2024

Microsoft AI Research. Phi-3 technical report: A highly capable language model locally on your phone, 2024. Accessed: 2024-09-25

work page 2024

[24] [24]

Towards optimizing the costs of llm usage

Shivanshu Shekhar, Tanishq Dubey, Koyel Mukherjee, Apoorv Saxena, Atharv Tyagi, and Nishanth Kotla. Towards optimizing the costs of llm usage. arXiv preprint arXiv:2402.01742 , 2024

work page arXiv 2024

[25] [25]

Greening large language models of code

Jieke Shi, Zhou Yang, Hong Jin Kang, Bowen Xu, Junda He, and David Lo. Greening large language models of code. In Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Society , pages 142–153, 2024

work page 2024

[26] [26]

Efficient and green large language models for software engineering: Vision and the road ahead

Jieke Shi, Zhou Yang, and David Lo. Efficient and green large language models for software engineering: Vision and the road ahead. ACM Transactions on Software Engineering and Methodology , 2024

work page 2024

[27] [27]

To- wards sustainable ai: a comprehensive framework for green ai

Abdulaziz Tabbakh, Lisan Al Amin, Mahbubul Islam, GM Iqbal Mah- mud, Imranul Kabir Chowdhury, and Md Saddam Hossain Mukta. To- wards sustainable ai: a comprehensive framework for green ai. Discover Sustainability, 5(1):408, 2024

work page 2024

[28] [28]

Software sustainability: beyond the tower of babel

Colin C Venters, Sedef Akinli Kocak, Stefanie Betz, Ian Brooks, Rafael Capilla, Ruzanna Chitchyan, Let ´ıcia Duboc, Rogardt Heldal, Ana Moreira, Shola Oyedeji, et al. Software sustainability: beyond the tower of babel. In 2021 IEEE/ACM International Workshop on Body of Knowledge for Software Sustainability (BoKSS), pages 3–4. IEEE, 2021

work page 2021

[29] [29]

A systematic review of green ai

Roberto Verdecchia, June Sallou, and Lu ´ıs Cruz. A systematic review of green ai. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 13(4):e1507, 2023

work page 2023