Lightweight Diffusion Models for Resource-Constrained Semantic Communication

Danilo Comminiello; Eleonora Grassucci; Giordano Cicchetti; Giovanni Pignata

arxiv: 2410.02491 · v1 · pith:CGQIUCKInew · submitted 2024-10-03 · 📡 eess.SP

Lightweight Diffusion Models for Resource-Constrained Semantic Communication

Giovanni Pignata , Eleonora Grassucci , Giordano Cicchetti , Danilo Comminiello This is my paper

Pith reviewed 2026-05-23 20:21 UTC · model grok-4.3

classification 📡 eess.SP

keywords semantic communicationdiffusion modelspost-training quantizationresource-constrained devicesimage reconstructionchannel noise robustness

0 comments

The pith

A post-training quantized diffusion model regenerates images from semantic maps while cutting memory use by 75 percent and operations by 79 percent.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Q-GESCO as a framework that applies post-training quantization to a semantic diffusion model so it can reconstruct images from received semantic information under channel noise. The goal is to keep generative performance close to the full-precision version while making the model small enough for devices with tight memory and compute limits. A reader would care because current generative semantic communication approaches are too heavy for many practical systems, and this method claims to remove that barrier without retraining. The reported results show the quantized model stays robust across noise types and scenarios while delivering the stated resource savings.

Core claim

Q-GESCO uses a quantized semantic diffusion model to regenerate transmitted images from received semantic maps. Post-training quantization lowers the memory footprint and computational load of the diffusion process. The resulting model matches the reconstruction quality of its full-precision version across different channel conditions and achieves up to 75 percent memory reduction together with 79 percent fewer floating-point operations.

What carries the argument

Post-training quantization applied to the semantic diffusion model, which lowers numerical precision to shrink memory and arithmetic cost while retaining image regeneration accuracy.

If this is right

Resource-constrained devices become able to run generative semantic communication without custom hardware.
The same quantized model works across multiple channel noise conditions without adjustments.
Image reconstruction from semantic maps remains feasible at substantially lower memory and compute cost.
No retraining step is required to obtain the reported savings and robustness.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could be tested on other generative architectures beyond diffusion models used in communication.
Similar quantization might allow real-time semantic transmission of video on mobile hardware.
Edge devices could adopt the method to expand semantic communication into new low-power applications.

Load-bearing premise

Post-training quantization preserves the diffusion model's generative quality and its resistance to channel noise without any retraining or per-scenario tuning.

What would settle it

Running the quantized and full-precision models on the same test images and noise levels and finding a clear drop in reconstruction metrics for the quantized version would show the preservation claim does not hold.

Figures

Figures reproduced from arXiv: 2410.02491 by Danilo Comminiello, Eleonora Grassucci, Giordano Cicchetti, Giovanni Pignata.

**Figure 1.** Figure 1: Q-GESCO pipeline. introduces the core methods of Q-GESCO, experiments are conducted in Section III and conclusions are drawn in Section IV. II. QUANTIZING DIFFUSION MODELS FOR SEMANTIC COMMUNICATION In this work, we propose the Quantized GEnerative Semantic COmmunication (Q-GESCO) framework that relies on post-training quantization (PTQ) techniques for the semantic diffusion model. Specifically, we focus… view at source ↗

**Figure 2.** Figure 2: Sample results of Q-GESCO (right) compared to its full-precision [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

read the original abstract

Recently, generative semantic communication models have proliferated as they are revolutionizing semantic communication frameworks, improving their performance, and opening the way to novel applications. Despite their impressive ability to regenerate content from the compressed semantic information received, generative models pose crucial challenges for communication systems in terms of high memory footprints and heavy computational load. In this paper, we present a novel Quantized GEnerative Semantic COmmunication framework, Q-GESCO. The core method of Q-GESCO is a quantized semantic diffusion model capable of regenerating transmitted images from the received semantic maps while simultaneously reducing computational load and memory footprint thanks to the proposed post-training quantization technique. Q-GESCO is robust to different channel noises and obtains comparable performance to the full precision counterpart in different scenarios saving up to 75% memory and 79% floating point operations. This allows resource-constrained devices to exploit the generative capabilities of Q-GESCO, widening the range of applications and systems for generative semantic communication frameworks. The code is available at https://github.com/ispamm/Q-GESCO.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Q-GESCO shows post-training quantization can shrink diffusion models for semantic image communication enough to fit constrained devices while keeping noise robustness close to full precision.

read the letter

The main point is that a standard post-training quantization step applied to a semantic diffusion model delivers 75% memory reduction and 79% fewer floating-point operations with performance that stays comparable to the unquantized version across different channel conditions. The paper frames this as Q-GESCO and positions it as a way to bring generative semantic communication to edge hardware without retraining or scenario-specific tuning. Code release on GitHub is a practical plus that lets others check the numbers directly. The work stays within the existing semantic-communication-plus-diffusion line rather than introducing new theory or a first-principles method. What it does cleanly is demonstrate that the quantization transfers without breaking the regeneration quality or the claimed robustness to noise. The empirical focus on image reconstruction from received semantic maps matches the stated goal. The soft spot is that the abstract gives limited visibility into baselines, data splits, or how the savings were measured exactly, so the strength of the comparability claim depends on the full tables and ablations. If those hold up without post-hoc exclusions, the result is usable engineering progress. This paper is for people already working on resource-aware semantic systems who need a concrete implementation path rather than a broad theoretical shift. It is not a paradigm changer, but the combination is new enough and the savings large enough that it deserves a serious referee to check the experimental details and confirm the claims generalize.

Referee Report

2 major / 2 minor

Summary. The paper proposes Q-GESCO, a quantized generative semantic communication framework built around a post-training quantized semantic diffusion model. The model regenerates transmitted images from received semantic maps, with the quantization step intended to cut memory usage by up to 75 % and floating-point operations by up to 79 % while preserving generative quality and robustness to channel noise across scenarios.

Significance. If the empirical claims hold under rigorous controls, the work would demonstrate a practical route to deploying diffusion-based generative semantic communication on resource-limited hardware, directly addressing the memory and compute barriers that currently restrict such models.

major comments (2)

[Abstract and §4] Abstract and §4 (experimental results): the central claim of “comparable performance” and “robustness to different channel noises” is presented without any description of the evaluation protocol, baselines, metrics, data splits, number of runs, or error bars. Because the contribution is entirely empirical, this omission is load-bearing for the main result.
[§3] §3 (quantization method): the post-training quantization procedure is described at a high level but lacks the precise bit-width schedule, calibration dataset size, and any analysis of how quantization interacts with the diffusion sampling process or the semantic-map encoder. Without these details it is impossible to assess whether the reported savings are reproducible or scenario-specific.

minor comments (2)

The GitHub link is provided; confirming that the released code reproduces the tables and figures would strengthen the submission.
[§2] Notation for the quantized diffusion steps and the channel model should be introduced once and used consistently; several symbols appear without prior definition in the early sections.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive review. The comments highlight important areas where additional clarity will strengthen the manuscript. We address each major comment below and will revise the paper accordingly.

read point-by-point responses

Referee: [Abstract and §4] Abstract and §4 (experimental results): the central claim of “comparable performance” and “robustness to different channel noises” is presented without any description of the evaluation protocol, baselines, metrics, data splits, number of runs, or error bars. Because the contribution is entirely empirical, this omission is load-bearing for the main result.

Authors: We agree that the evaluation protocol requires explicit description to support the empirical claims. In the revised manuscript we will expand §4 with a dedicated subsection detailing the full evaluation protocol, including the datasets and splits used, the baselines compared against, the metrics (e.g., FID, PSNR, SSIM, perceptual scores), the number of independent runs, and the reporting of mean ± standard deviation or error bars. We will also clarify how channel noise robustness was assessed across the tested SNR ranges and noise types. revision: yes
Referee: [§3] §3 (quantization method): the post-training quantization procedure is described at a high level but lacks the precise bit-width schedule, calibration dataset size, and any analysis of how quantization interacts with the diffusion sampling process or the semantic-map encoder. Without these details it is impossible to assess whether the reported savings are reproducible or scenario-specific.

Authors: We acknowledge the need for greater technical specificity. In the revised §3 we will provide the exact bit-width schedule (per-layer or uniform), the size and composition of the calibration dataset, the number of calibration samples, and a new analysis subsection examining the interaction between quantization and the diffusion sampling steps (including any observed effects on the reverse process variance or semantic-map encoder stability). These additions will allow readers to reproduce the reported memory and FLOP reductions. revision: yes

Circularity Check

0 steps flagged

No significant circularity; purely empirical contribution

full rationale

The manuscript describes an application of post-training quantization to a diffusion model for semantic communication (Q-GESCO). The abstract and available text contain no equations, derivations, fitted parameters presented as predictions, or self-citations that serve as load-bearing premises. All performance claims (memory/ops savings, robustness to channel noise, comparable generative quality) are framed as outcomes of experimental comparisons between the quantized model and its full-precision counterpart. No self-definitional loops, ansatz smuggling, or renaming of known results appear. The work is self-contained against external benchmarks via direct empirical evaluation.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no equations, training details, or modeling choices; therefore no free parameters, axioms, or invented entities can be identified.

pith-pipeline@v0.9.0 · 5716 in / 980 out tokens · 21713 ms · 2026-05-23T20:21:16.083069+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

24 extracted references · 24 canonical work pages · 1 internal anchor

[1]

Ai empowered wireless communications: From bits to semantics,

Z. Qin, L. Liang, Z. Wang, S. Jin, X. Tao, W. Tong, and G. Y . Li, “Ai empowered wireless communications: From bits to semantics,” Proceedings of the IEEE , 2024

work page 2024
[2]

Enhancing semantic communication with deep generative models: An overview,

E. Grassucci, Y . Mitsufuji, P. Zhang, and D. Comminiello, “Enhancing semantic communication with deep generative models: An overview,” in IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) , 2024

work page 2024
[3]

Generative AI meets semantic communication: Evolution and revolution of communication tasks,

E. Grassucci, J. Park, S. Barbarossa, S.-L. Kim, J. Choi, and D. Com- miniello, “Generative AI meets semantic communication: Evolution and revolution of communication tasks,” ArXiv preprint: arXiv:2401.06803 , 2024

work page arXiv 2024
[4]

Semantic communications based on adaptive generative models and information bottleneck,

S. Barbarossa, D. Comminiello, E. Grassucci, F. Pezone, S. Sardellitti, and P. Di Lorenzo, “Semantic communications based on adaptive generative models and information bottleneck,” IEEE Comm. Magazine , 2023

work page 2023
[5]

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

E. Grassucci, S. Barbarossa, and D. Comminiello, “Generative semantic communication: Diffusion models beyond bit recovery,” ArXiv preprint: ArXiv:2306.04321, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[6]

Language-oriented communication with semantic coding and knowledge distillation for text- to-image generation,

H. Nam, J. Park, J. Choi, M. Bennis, and S.-L. Kim, “Language-oriented communication with semantic coding and knowledge distillation for text- to-image generation,” in IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2024

work page 2024
[7]

Language-oriented semantic latent representation for image transmission,

G. Cicchetti, E. Grassucci, J. Park, J. Choi, S. Barbarossa, and D. Com- miniello, “Language-oriented semantic latent representation for image transmission,” in IEEE Int. Workshop on Machine Learning for Signal Processing (MLSP), 2024

work page 2024
[8]

Diffusion-driven semantic communication for generative models with bandwidth constraints,

L. Guo, W. Chen, Y . Sun, B. Ai, N. Pappas, and T. Y . A. Quek, “Diffusion-driven semantic communication for generative models with bandwidth constraints,” ArXiv preprint: arXiv:2407.18468 , 2024

work page arXiv 2024
[9]

Dif- fusion models for audio semantic communication,

E. Grassucci, C. Marinoni, S. Barbarossa, and D. Comminiello, “Dif- fusion models for audio semantic communication,” in IEEE Int. Conf. on Audio, Speech, and Signal Process. (ICASSP) , 2024

work page 2024
[10]

Synchronous semantic communications for video and speech,

Y: Tian, J. Ying, Z. Qin, Y . Jin, and X. Tao, “Synchronous semantic communications for video and speech,” in IEEE Int. Conf. on Comm. (ICC), 2024

work page 2024
[11]

SG2SC: A generative semantic communication framework for scene understanding- oriented image transmission,

M. Yang, D. Gao, F. Xie, J. Li, X. Song, and G. Shi, “SG2SC: A generative semantic communication framework for scene understanding- oriented image transmission,” in IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP) , 2024

work page 2024
[12]

DIFFSC: Semantic communication framework with enhanced denoising through diffusion probabilistic models,

Z. Jiang, X. Liu, G. Yang, W. Li, A. Li, and G. Wang, “DIFFSC: Semantic communication framework with enhanced denoising through diffusion probabilistic models,” in IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP) , 2024

work page 2024
[13]

Rethinking multi-user semantic communications with deep generative models,

E. Grassucci, J. Choi, J. Park, R. F. Gramaccioni, G. Cicchetti, and D. Comminiello, “Rethinking multi-user semantic communications with deep generative models,” 2024

work page 2024
[14]

Resource-efficient generative mobile edge networks in 6g era: Funda- mentals, framework and case study,

B. Lai, J. Wen, J. Kang, H. Du, J. Nie, C. Yi, D. I. Kim, and S. Xie, “Resource-efficient generative mobile edge networks in 6g era: Funda- mentals, framework and case study,” IEEE Wireless Communications , vol. 31, no. 4, pp. 66–74, 2024

work page 2024
[15]

A unified framework for guiding generative ai with wireless perception in resource constrained mobile edge networks,

J. Wang, H. Du, D. Niyato, J. Kang, Z. Xiong, D. Rajan, S. Mao, and X. Shen, “A unified framework for guiding generative ai with wireless perception in resource constrained mobile edge networks,” IEEE Trans. on Mobile Computing , pp. 1–17, 2024

work page 2024
[16]

Diffusion models beat GANs on image synthesis,

P. Dhariwal and A. Nichol, “Diffusion models beat GANs on image synthesis,” in Advances in neural Information Processing (NeurIPS) , 2021

work page 2021
[17]

Semantic-aware power allocation for generative semantic communications with foundation models,

C. Xu, M. B. Mashhadi, Y . Ma, and R. Tafazolli, “Semantic-aware power allocation for generative semantic communications with foundation models,” ArXiv preprint: arXiv:2407.03050 , 2024

work page arXiv 2024
[18]

Latency-aware generative semantic communications with pre-trained diffusion models,

L. Qiao, M. B. Mashhadi, Z. Gao, C. H. Foh, P. Xiao, and M. Bennis, “Latency-aware generative semantic communications with pre-trained diffusion models,” IEEE Wireless Comm. Letters , 2024

work page 2024
[19]

Up or down? adaptive rounding for post-training quantization,

M. Nagel, R. A. Amjad, M. van Baalen, C. Louizos, and T. Blankevoort, “Up or down? adaptive rounding for post-training quantization,” in Int. Conf. on Machine Learning , 2021

work page 2021
[20]

BRECQ: Pushing the limit of post-training quantization by block reconstruction,

Y . Li, R. Gong, X. Tan, Y . Yang, P. Hu, Q. Zhang, F. Yu, W. Wang, and S. Gu, “BRECQ: Pushing the limit of post-training quantization by block reconstruction,” in Int. Conf. on learning Representation , 2021

work page 2021
[21]

Q-diffusion: Quantizing diffusion models,

X. Li, Y . Liu, L. Lian, H. Yang, Z. Dong, D. Kang, S. Zhang, and K. Keutzer, “Q-diffusion: Quantizing diffusion models,” in Int. Conf. on Computer Vision (ICCV) , 2023

work page 2023
[22]

Post-training quantization on diffusion models,

Y . Shang, Z. Yuan, B. Xie, B. Wu, and Y . Yan, “Post-training quantization on diffusion models,” in IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR) , 2023

work page 2023
[23]

Improved denoising diffusion probabilis- tic models,

A. Q. Nichol and P. Dhariwal, “Improved denoising diffusion probabilis- tic models,” in International Conference on Machine Learning (ICML) , 2021, pp. 8162—-8171

work page 2021
[24]

GPT3.int8(): 8-bit matrix multiplication for transformers at scale,

Tim Dettmers, Mike Lewis, Younes Belkada, and Luke Zettlemoyer, “GPT3.int8(): 8-bit matrix multiplication for transformers at scale,” in Advances in Neural Information Processing (NeurIPS) , 2022

work page 2022

[1] [1]

Ai empowered wireless communications: From bits to semantics,

Z. Qin, L. Liang, Z. Wang, S. Jin, X. Tao, W. Tong, and G. Y . Li, “Ai empowered wireless communications: From bits to semantics,” Proceedings of the IEEE , 2024

work page 2024

[2] [2]

Enhancing semantic communication with deep generative models: An overview,

E. Grassucci, Y . Mitsufuji, P. Zhang, and D. Comminiello, “Enhancing semantic communication with deep generative models: An overview,” in IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) , 2024

work page 2024

[3] [3]

Generative AI meets semantic communication: Evolution and revolution of communication tasks,

E. Grassucci, J. Park, S. Barbarossa, S.-L. Kim, J. Choi, and D. Com- miniello, “Generative AI meets semantic communication: Evolution and revolution of communication tasks,” ArXiv preprint: arXiv:2401.06803 , 2024

work page arXiv 2024

[4] [4]

Semantic communications based on adaptive generative models and information bottleneck,

S. Barbarossa, D. Comminiello, E. Grassucci, F. Pezone, S. Sardellitti, and P. Di Lorenzo, “Semantic communications based on adaptive generative models and information bottleneck,” IEEE Comm. Magazine , 2023

work page 2023

[5] [5]

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

E. Grassucci, S. Barbarossa, and D. Comminiello, “Generative semantic communication: Diffusion models beyond bit recovery,” ArXiv preprint: ArXiv:2306.04321, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[6] [6]

Language-oriented communication with semantic coding and knowledge distillation for text- to-image generation,

H. Nam, J. Park, J. Choi, M. Bennis, and S.-L. Kim, “Language-oriented communication with semantic coding and knowledge distillation for text- to-image generation,” in IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2024

work page 2024

[7] [7]

Language-oriented semantic latent representation for image transmission,

G. Cicchetti, E. Grassucci, J. Park, J. Choi, S. Barbarossa, and D. Com- miniello, “Language-oriented semantic latent representation for image transmission,” in IEEE Int. Workshop on Machine Learning for Signal Processing (MLSP), 2024

work page 2024

[8] [8]

Diffusion-driven semantic communication for generative models with bandwidth constraints,

L. Guo, W. Chen, Y . Sun, B. Ai, N. Pappas, and T. Y . A. Quek, “Diffusion-driven semantic communication for generative models with bandwidth constraints,” ArXiv preprint: arXiv:2407.18468 , 2024

work page arXiv 2024

[9] [9]

Dif- fusion models for audio semantic communication,

E. Grassucci, C. Marinoni, S. Barbarossa, and D. Comminiello, “Dif- fusion models for audio semantic communication,” in IEEE Int. Conf. on Audio, Speech, and Signal Process. (ICASSP) , 2024

work page 2024

[10] [10]

Synchronous semantic communications for video and speech,

Y: Tian, J. Ying, Z. Qin, Y . Jin, and X. Tao, “Synchronous semantic communications for video and speech,” in IEEE Int. Conf. on Comm. (ICC), 2024

work page 2024

[11] [11]

SG2SC: A generative semantic communication framework for scene understanding- oriented image transmission,

M. Yang, D. Gao, F. Xie, J. Li, X. Song, and G. Shi, “SG2SC: A generative semantic communication framework for scene understanding- oriented image transmission,” in IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP) , 2024

work page 2024

[12] [12]

DIFFSC: Semantic communication framework with enhanced denoising through diffusion probabilistic models,

Z. Jiang, X. Liu, G. Yang, W. Li, A. Li, and G. Wang, “DIFFSC: Semantic communication framework with enhanced denoising through diffusion probabilistic models,” in IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP) , 2024

work page 2024

[13] [13]

Rethinking multi-user semantic communications with deep generative models,

E. Grassucci, J. Choi, J. Park, R. F. Gramaccioni, G. Cicchetti, and D. Comminiello, “Rethinking multi-user semantic communications with deep generative models,” 2024

work page 2024

[14] [14]

Resource-efficient generative mobile edge networks in 6g era: Funda- mentals, framework and case study,

B. Lai, J. Wen, J. Kang, H. Du, J. Nie, C. Yi, D. I. Kim, and S. Xie, “Resource-efficient generative mobile edge networks in 6g era: Funda- mentals, framework and case study,” IEEE Wireless Communications , vol. 31, no. 4, pp. 66–74, 2024

work page 2024

[15] [15]

A unified framework for guiding generative ai with wireless perception in resource constrained mobile edge networks,

J. Wang, H. Du, D. Niyato, J. Kang, Z. Xiong, D. Rajan, S. Mao, and X. Shen, “A unified framework for guiding generative ai with wireless perception in resource constrained mobile edge networks,” IEEE Trans. on Mobile Computing , pp. 1–17, 2024

work page 2024

[16] [16]

Diffusion models beat GANs on image synthesis,

P. Dhariwal and A. Nichol, “Diffusion models beat GANs on image synthesis,” in Advances in neural Information Processing (NeurIPS) , 2021

work page 2021

[17] [17]

Semantic-aware power allocation for generative semantic communications with foundation models,

C. Xu, M. B. Mashhadi, Y . Ma, and R. Tafazolli, “Semantic-aware power allocation for generative semantic communications with foundation models,” ArXiv preprint: arXiv:2407.03050 , 2024

work page arXiv 2024

[18] [18]

Latency-aware generative semantic communications with pre-trained diffusion models,

L. Qiao, M. B. Mashhadi, Z. Gao, C. H. Foh, P. Xiao, and M. Bennis, “Latency-aware generative semantic communications with pre-trained diffusion models,” IEEE Wireless Comm. Letters , 2024

work page 2024

[19] [19]

Up or down? adaptive rounding for post-training quantization,

M. Nagel, R. A. Amjad, M. van Baalen, C. Louizos, and T. Blankevoort, “Up or down? adaptive rounding for post-training quantization,” in Int. Conf. on Machine Learning , 2021

work page 2021

[20] [20]

BRECQ: Pushing the limit of post-training quantization by block reconstruction,

Y . Li, R. Gong, X. Tan, Y . Yang, P. Hu, Q. Zhang, F. Yu, W. Wang, and S. Gu, “BRECQ: Pushing the limit of post-training quantization by block reconstruction,” in Int. Conf. on learning Representation , 2021

work page 2021

[21] [21]

Q-diffusion: Quantizing diffusion models,

X. Li, Y . Liu, L. Lian, H. Yang, Z. Dong, D. Kang, S. Zhang, and K. Keutzer, “Q-diffusion: Quantizing diffusion models,” in Int. Conf. on Computer Vision (ICCV) , 2023

work page 2023

[22] [22]

Post-training quantization on diffusion models,

Y . Shang, Z. Yuan, B. Xie, B. Wu, and Y . Yan, “Post-training quantization on diffusion models,” in IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR) , 2023

work page 2023

[23] [23]

Improved denoising diffusion probabilis- tic models,

A. Q. Nichol and P. Dhariwal, “Improved denoising diffusion probabilis- tic models,” in International Conference on Machine Learning (ICML) , 2021, pp. 8162—-8171

work page 2021

[24] [24]

GPT3.int8(): 8-bit matrix multiplication for transformers at scale,

Tim Dettmers, Mike Lewis, Younes Belkada, and Luke Zettlemoyer, “GPT3.int8(): 8-bit matrix multiplication for transformers at scale,” in Advances in Neural Information Processing (NeurIPS) , 2022

work page 2022