Rate-Distortion-Classification Representation Theory for Bernoulli Sources

Bella Bose; Nam Nguyen; Thinh Nguyen

arxiv: 2601.11919 · v2 · pith:SDLTSK34new · submitted 2026-01-17 · 💻 cs.IT · math.IT

Rate-Distortion-Classification Representation Theory for Bernoulli Sources

Nam Nguyen , Thinh Nguyen , Bella Bose This is my paper

Pith reviewed 2026-05-21 15:56 UTC · model grok-4.3

classification 💻 cs.IT math.IT

keywords rate-distortion-classificationBernoulli sourceHamming distortionlinear programuniversal encodertask-oriented compressionone-shot tradeoff

0 comments

The pith

Closed-form one-shot RDC and DRC tradeoffs are derived for Bernoulli sources with Hamming distortion under binary symmetric classification coupling.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines task-oriented lossy compression for Bernoulli sources by introducing rate-distortion-classification representations. It first obtains closed-form characterizations of the one-shot RDC tradeoff and its dual DRC version using a common-randomness formulation. A representation-based approach then reduces the achievable distortion-classification region for any fixed representation to the solution of a linear program that traces its lower boundary. Finally the work supplies computable lower and upper bounds on the minimum asymptotic rate needed by universal encoders that must serve an entire family of distortion-classification operating points.

Core claim

Building on the one-shot common-randomness formulation, the paper derives closed-form characterizations of the one-shot RDC and the dual DRC tradeoffs for a Bernoulli source under Hamming distortion when the binary classification variable is coupled to the source by a binary symmetric model. It further characterizes the achievable distortion-classification region induced by any fixed representation by deriving the lower boundary of that region via a linear program, and it obtains computable lower and upper bounds on the minimum asymptotic rate required by universal encoders that support a family of DC operating points, thereby quantifying the associated rate penalty.

What carries the argument

The linear program that computes the lower boundary of the achievable distortion-classification region induced by any fixed representation.

If this is right

The closed-form RDC and DRC expressions permit exact evaluation of the one-shot tradeoffs without iterative optimization.
Any chosen representation yields an explicitly computable lower boundary for its achievable distortion-classification region through the linear program.
The derived bounds quantify the exact rate penalty incurred when a single encoder must support an entire family of distortion-classification operating points.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The closed-form results could guide the construction of practical encoders when classification accuracy and reconstruction fidelity must be traded against each other.
The linear-program formulation may extend to other discrete memoryless sources once the binary-symmetric coupling assumption is relaxed.
Bounds on universal rates suggest a concrete way to measure the overhead of building flexible representations that remain useful across changing task requirements.

Load-bearing premise

The binary classification variable is coupled to the source via a binary symmetric model.

What would settle it

A direct numerical computation of the one-shot RDC tradeoff for concrete parameter values that deviates from the claimed closed-form expression would refute the characterization.

Figures

Figures reproduced from arXiv: 2601.11919 by Bella Bose, Nam Nguyen, Thinh Nguyen.

**Figure 1.** Figure 1: Task-oriented lossy compression framework. [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 4.** Figure 4: Illustration of Theorem 2. R(B) (D, C) versus D with given C = 0.8, qX = 0.3, qS1 = 0.2. (4) is feasible if C ≥ Hb(qS1 ). Moreover, under common randomness, R (B) (D, C) =    Hb(qX)(qX−D) qX , 0 ≤ D < qX(C−Hb(qS1 ) Hb(m)−Hb(qS1 ) Hb(qX)[Hb(m)−C] Hb(m)−Hb(qS1 ) , qX[C−Hb(qS1 )] Hb(m)−Hb(qS1 ) ≤ D ≤ 1 0, C ≥ Hb (m) and qX < D ≤ 1. where m = (1−qX)(1−qS1 ) +qXqS1 and Hb(.) denotes the binary entropy fu… view at source ↗

**Figure 5.** Figure 5: RDC curves for a fixed C: R(B) (D, C) and R(∞) (D, C) versus D, for C = 0.9, qX = 0.3, and qS1 = 0.2. variable S ∼ pS, the corresponding one-shot distortion minimization problem can be expressed as D∗ (R, C) = min pU , pXˆ|X,U E[∆(X, Xˆ)] (5a) s.t. H(Xˆ|U, X) = 0, I(X, U) = 0, (5b) H(Xˆ|U) ≤ R, H(S|Xˆ) ≤ C. (5c) where pU,X,Xˆ = pU pX pXˆ|U,X [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 6.** Figure 6: Illustration of Theorem 4. R(B) (D, C) versus D with given C = 0.8, qX = 0.3, qS1 = 0.2. Theorem 4. Let X ∼ Bern(qX) be a Bernoulli source and let S be a binary task variable jointly distributed with X via S = X ⊕ S1, where S ∼ Bern(qS) and S1 ∼ Bern(qS1 ) (0 ≤ qX, qS, qS1 ≤ 1 2 ). Assume the Hamming distortion measure. Then the optimization problem (5) is feasible if C ≥ H(qS1 ). Moreover, under common ra… view at source ↗

**Figure 9.** Figure 9: The universal encoder representation framework. [PITH_FULL_IMAGE:figures/full_fig_p005_9.png] view at source ↗

**Figure 7.** Figure 7: Lower boundary of achievable region, Π(pZ|X) [PITH_FULL_IMAGE:figures/full_fig_p005_7.png] view at source ↗

**Figure 8.** Figure 8: DRC curves for a fixed R: D(∞) [PITH_FULL_IMAGE:figures/full_fig_p005_8.png] view at source ↗

read the original abstract

We study task-oriented lossy compression through the lens of rate-distortion-classification (RDC) representations. The source is Bernoulli, the distortion measure is Hamming, and the binary classification variable is coupled to the source via a binary symmetric model. Building on the one-shot common-randomness formulation, we first derive closed-form characterizations of the one-shot RDC and the dual distortion-rate-classification (DRC) tradeoffs. We then use a representation-based viewpoint and characterize the achievable distortion-classification (DC) region induced by a fixed representation by deriving its lower boundary via a linear program. Finally, we study universal encoders that must support a family of DC operating points and derive computable lower and upper bounds on the minimum asymptotic rate required for universality, thereby yielding bounds on the corresponding rate penalty. Numerical examples are provided to illustrate the achievable regions and the resulting universal RDC/DRC curves.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 4 minor

Summary. The manuscript develops a rate-distortion-classification (RDC) representation theory for Bernoulli sources under Hamming distortion, where the binary classification variable is coupled to the source through a binary-symmetric channel. Using a one-shot common-randomness formulation, it derives closed-form expressions for the one-shot RDC and dual distortion-rate-classification (DRC) tradeoffs. It then characterizes the achievable distortion-classification (DC) region for a fixed representation via the lower boundary of a linear program and derives computable lower and upper bounds on the minimum asymptotic rate required for universal encoders that must support a family of DC operating points, along with the associated rate penalty. Numerical examples illustrate the regions and curves.

Significance. If the derivations hold, the paper supplies explicit, computable characterizations of RDC and DRC tradeoffs together with an LP formulation of the DC region and bounds on the universal-rate penalty. These results are grounded in standard information-theoretic optimization and provide concrete tools for analyzing task-oriented compression that jointly controls distortion and classification performance. The explicit forms and numerical illustrations strengthen the contribution for the Bernoulli/Hamming case, which serves as a canonical setting for further extensions.

minor comments (4)

The abstract states that closed-form characterizations are obtained, but the manuscript should explicitly state the range of parameters (e.g., crossover probability of the binary-symmetric coupling and distortion levels) for which the closed forms remain valid without case distinctions.
In the LP formulation of the DC region (presumably §4), the decision variables and the objective should be written with explicit dependence on the representation mapping so that the boundary computation is reproducible from the given expressions.
The universal-rate bounds are presented as computable; the manuscript would benefit from a short algorithmic description or pseudocode showing how the lower and upper bounds are evaluated for a given family of DC points.
Notation for the one-shot common-randomness auxiliary variables should be introduced once and used consistently across the RDC, DRC, and universal sections to avoid reader confusion.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of the manuscript, the recognition of its explicit characterizations for the Bernoulli/Hamming setting, and the recommendation for minor revision. No specific major comments were raised in the report.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper starts from an explicitly stated source model (Bernoulli with Hamming distortion and binary-symmetric coupling to the classification variable) and proceeds through standard one-shot common-randomness optimization to obtain closed-form RDC/DRC expressions, followed by an LP characterization of the DC region and computable bounds on the universal rate. These steps are internally consistent once the coupling model is granted and do not reduce any claimed prediction or first-principles result to a fitted parameter or self-referential definition by construction. No load-bearing self-citations, uniqueness theorems imported from the authors' prior work, or ansatzes smuggled via citation are exhibited in the derivation chain.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no explicit free parameters, axioms, or invented entities are detailed in the provided text.

pith-pipeline@v0.9.0 · 5679 in / 1106 out tokens · 71683 ms · 2026-05-21T15:56:56.610024+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

closed-form characterizations of the one-shot RDC and the dual distortion-rate-classification (DRC) tradeoffs... lower boundary via a linear program
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

R(B)(D, C) = Hb(qX)(qX − D)/qX ... with m = (1−qX)(1−qS1) + qX qS1

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Cross-Domain Lossy Compression via Constrained Minimum Entropy Coupling
cs.IT 2026-05 unverdicted novelty 7.0

Rate-constrained minimum entropy coupling enables cross-domain lossy compression with classification preservation, providing closed-form solutions for Bernoulli sources and neural implementations for MNIST and SVHN tasks.

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages · cited by 1 Pith paper · 1 internal anchor

[1]

Prentice Hall, Englewood Cliffs, NJ, USA, 1971

Toby Berger.Rate Distortion Theory: A Mathematical Basis for Data Compression. Prentice Hall, Englewood Cliffs, NJ, USA, 1971

work page 1971
[2]

Cover and Joy A

Thomas M. Cover and Joy A. Thomas.Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing). Wiley- Interscience, USA, 2006

work page 2006
[3]

Rethinking lossy compression: The rate-distortion-perception tradeoff

Yochai Blau and Tomer Michaeli. Rethinking lossy compression: The rate-distortion-perception tradeoff. InInternational Conference on Machine Learning, pages 675–685, 2019

work page 2019
[4]

The perception-distortion tradeoff

Yochai Blau and Tomer Michaeli. The perception-distortion tradeoff. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6228–6237, 2018

work page 2018
[5]

Cross-domain lossy compression as entropy constrained optimal transport.IEEE Journal on Selected Areas in Information Theory, 3(3):513–527, 2022

Huan Liu, George Zhang, Jun Chen, and Ashish Khisti. Cross-domain lossy compression as entropy constrained optimal transport.IEEE Journal on Selected Areas in Information Theory, 3(3):513–527, 2022

work page 2022
[6]

Autoencoding beyond pixels using a learned similarity metric

Anders Boesen Lindbo Larsen, Søren Kaae Sønderby, Hugo Larochelle, and Ole Winther. Autoencoding beyond pixels using a learned similarity metric. InInternational Conference on Machine Learning, pages 1558– 1566, 2016

work page 2016
[7]

Generative adversarial networks for extreme learned image compression

Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, and Luc Van Gool. Generative adversarial networks for extreme learned image compression. InProceedings of the IEEE International Conference on Computer Vision, pages 221–231, 2019

work page 2019
[8]

High-fidelity generative image compression

Fabian Mentzer, George D Toderici, Michael Tschannen, and Eirikur Agustsson. High-fidelity generative image compression. InAdvances in Neural Information Processing Systems, volume 33, 2020

work page 2020
[9]

Lossy image compression with conditional diffusion models

Ruihan Yang and Stephan Mandt. Lossy image compression with conditional diffusion models. InAdvances in Neural Information Processing Systems, volume 36, pages 64971–64995. Curran Associates, Inc., 2023. NeurIPS 2023

work page 2023
[10]

The information bottleneck method

Naftali Tishby, Fernando C. Pereira, and William Bialek. The informa- tion bottleneck method.arXiv preprint physics/0004057, 2000

work page internal anchor Pith review Pith/arXiv arXiv 2000
[11]

On the information bottleneck and its applications to learning.IEEE Transactions on Information Theory, 2016

Yihong Wu et al. On the information bottleneck and its applications to learning.IEEE Transactions on Information Theory, 2016

work page 2016
[12]

On the classification- distortion-perception tradeoff

Dong Liu, Haochen Zhang, and Zhiwei Xiong. On the classification- distortion-perception tradeoff. In H. Wallach, H. Larochelle, A. Beygelz- imer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors,Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019

work page 2019
[13]

A rate-distortion-classification approach for lossy image compression.Digital Signal Processing, 141:104163, September 2023

Yuefeng Zhang. A rate-distortion-classification approach for lossy image compression.Digital Signal Processing, 141:104163, September 2023

work page 2023
[14]

Task-oriented lossy compression with data, perception, and classification constraints.IEEE Journal on Selected Areas in Communications, 43(7):2635–2650, 2025

Yuhan Wang, Youlong Wu, Shuai Ma, and Ying-Jun Angela Zhang. Task-oriented lossy compression with data, perception, and classification constraints.IEEE Journal on Selected Areas in Communications, 43(7):2635–2650, 2025

work page 2025
[15]

Lossy compression for lossless prediction

Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, and Lucas Theis. Lossy compression for lossless prediction. InAdvances in Neural Information Processing Systems, volume 36, 2023

work page 2023
[16]

Conditional encoder-based adaptive deep image compression with classification-driven semantic awareness.Electronics, 12(13), 2023

Zhongyue Lei, Weicheng Zhang, Xuemin Hong, Jianghong Shi, Minxian Su, and Chaoheng Lin. Conditional encoder-based adaptive deep image compression with classification-driven semantic awareness.Electronics, 12(13), 2023

work page 2023
[17]

Zhongyue Lei, Peng Duan, Xuemin Hong, João F. C. Mota, Jianghong Shi, and Cheng-Xiang Wang. Progressive deep image compression for hybrid contexts of image classification and reconstruction.IEEE Journal on Selected Areas in Communications, 41(1):72–89, 2023

work page 2023
[18]

The rate-distortion-accuracy tradeoff: Jpeg case study

Xiyang Luo, Hossein Talebi, Feng Yang, Michael Elad, and Peyman Milanfar. The rate-distortion-accuracy tradeoff: Jpeg case study. In 2021 Data Compression Conference (DCC), pages 354–354, 2021

work page 2021
[19]

Universal rate-distortion-perception representations for lossy compression.IEEE Transactions on Information Theory, pages 1–1, 2025

George Zhang, Jingjing Qian, Jun Chen, and Ashish Khisti. Universal rate-distortion-perception representations for lossy compression.IEEE Transactions on Information Theory, pages 1–1, 2025

work page 2025
[20]

A rate- distortion-perception theory for binary sources

Jingjing Qian, George Zhang, Jun Chen, and Ashish Khisti. A rate- distortion-perception theory for binary sources. In Amos Lapidoth and Stefan M. Moser, editors,International Zurich Seminar on Information and Communication (IZS 2022). Proceedings, pages 34 – 38, Zurich,

work page 2022
[21]

International Zurich Seminar on Information and Communication (IZS 2022); Conference Location: Zurich, Switzerland; Conference Date: March 2–4, 2022

ETH Zurich. International Zurich Seminar on Information and Communication (IZS 2022); Conference Location: Zurich, Switzerland; Conference Date: March 2–4, 2022

work page 2022
[22]

Universal rate-distortion-classification representations for lossy compression

Nam Nguyen, Thuan Nguyen, Thinh Nguyen, and Bella Bose. Universal rate-distortion-classification representations for lossy compression. In 2025 IEEE Information Theory Workshop (ITW), pages 1–6, 2025

work page 2025
[23]

CVX: Matlab software for disciplined convex programming, version 2.0

CVX Research, Inc. CVX: Matlab software for disciplined convex programming, version 2.0. https://cvxr.com/cvx, August 2012

work page 2012
[24]

Graph implementations for nons- mooth convex programs

Michael Grant and Stephen Boyd. Graph implementations for nons- mooth convex programs. In V . Blondel, S. Boyd, and H. Kimura, editors, Recent Advances in Learning and Control, Lecture Notes in Control and Information Sciences, pages 95–110. Springer, 2008

work page 2008
[25]

CVXPY: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

Steven Diamond and Stephen Boyd. CVXPY: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

work page 2016
[26]

A rewriting system for convex optimization problems.Journal of Control and Decision, 5(1):42–60, 2018

Akshay Agrawal, Robin Verschueren, Steven Diamond, and Stephen Boyd. A rewriting system for convex optimization problems.Journal of Control and Decision, 5(1):42–60, 2018

work page 2018
[27]

Cambridge University Press, 2011

Abbas El Gamal and Young-Han Kim.Network Information Theory. Cambridge University Press, 2011

work page 2011
[28]

Strong functional representation lemma and applications to coding theorems.IEEE Transactions on Information Theory, 64(11):6967–6978, 2018

Cheuk Ting Li and Abbas El Gamal. Strong functional representation lemma and applications to coding theorems.IEEE Transactions on Information Theory, 64(11):6967–6978, 2018

work page 2018
[29]

John Wiley & Sons, 1999

Thomas M Cover and Joy A Thomas.Elements of information theory. John Wiley & Sons, 1999

work page 1999
[30]

Wyner and J

A. Wyner and J. Ziv. A theorem on the entropy of certain binary sequences and applications–i.IEEE Transactions on Information Theory, 19(6):769–772, 1973

work page 1973
[31]

PhD thesis, McMaster University, 2023

Jingjing Qian.On the Rate-Distortion-Perception Tradeoff for Lossy Compression. PhD thesis, McMaster University, 2023. APPENDIX A. Proof of Theorem 1 We start from the one-shot RDC formulation with common randomness given in Definition 1: R∗(D, C) = minpU , pZ|X,U , p ˆX|Z,U H(Z|U) s.t.E[∆(X, ˆX)]≤D, H(S| ˆX)≤C. wherep U,X,Z, ˆX =p U pX pZ|X,U p ˆX|Z,U . ...

work page 2023
[32]

p00|0 log p00|0 (1−q)p 00|0 +qp 00|1 +p 01|0 log p01|0 (1−q)p 01|0 +qp 01|1 +p 11|0 log p11|0 (1−q)p 11|0 +qp 11|1 # (25) +q

implies H(S| ˆX)≥H(S|X) =H(X⊕S 1|X) =H(S 1). Hence, feasibility of the classification constraint requiresC≥ H(S1). We now evaluateH(S| ˆX, U=u)for each mapping. ForU= 1: ˆX=X H(S| ˆX, U= 1) =H(S|X) =H(X⊕S 1|X) =H(S 1) =H b(qS1). ForU= 2: ˆX= 1−X H(S| ˆX, U= 2) =H(S|X) =H(S 1) =H b(qS1). ForU= 3: ˆX= 0 S=X⊕S 1 ⇒P(S= 0) = (1−q X)(1−q S1) +q X qS1 , H(S| ˆX,...

work page

[1] [1]

Prentice Hall, Englewood Cliffs, NJ, USA, 1971

Toby Berger.Rate Distortion Theory: A Mathematical Basis for Data Compression. Prentice Hall, Englewood Cliffs, NJ, USA, 1971

work page 1971

[2] [2]

Cover and Joy A

Thomas M. Cover and Joy A. Thomas.Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing). Wiley- Interscience, USA, 2006

work page 2006

[3] [3]

Rethinking lossy compression: The rate-distortion-perception tradeoff

Yochai Blau and Tomer Michaeli. Rethinking lossy compression: The rate-distortion-perception tradeoff. InInternational Conference on Machine Learning, pages 675–685, 2019

work page 2019

[4] [4]

The perception-distortion tradeoff

Yochai Blau and Tomer Michaeli. The perception-distortion tradeoff. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6228–6237, 2018

work page 2018

[5] [5]

Cross-domain lossy compression as entropy constrained optimal transport.IEEE Journal on Selected Areas in Information Theory, 3(3):513–527, 2022

Huan Liu, George Zhang, Jun Chen, and Ashish Khisti. Cross-domain lossy compression as entropy constrained optimal transport.IEEE Journal on Selected Areas in Information Theory, 3(3):513–527, 2022

work page 2022

[6] [6]

Autoencoding beyond pixels using a learned similarity metric

Anders Boesen Lindbo Larsen, Søren Kaae Sønderby, Hugo Larochelle, and Ole Winther. Autoencoding beyond pixels using a learned similarity metric. InInternational Conference on Machine Learning, pages 1558– 1566, 2016

work page 2016

[7] [7]

Generative adversarial networks for extreme learned image compression

Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, and Luc Van Gool. Generative adversarial networks for extreme learned image compression. InProceedings of the IEEE International Conference on Computer Vision, pages 221–231, 2019

work page 2019

[8] [8]

High-fidelity generative image compression

Fabian Mentzer, George D Toderici, Michael Tschannen, and Eirikur Agustsson. High-fidelity generative image compression. InAdvances in Neural Information Processing Systems, volume 33, 2020

work page 2020

[9] [9]

Lossy image compression with conditional diffusion models

Ruihan Yang and Stephan Mandt. Lossy image compression with conditional diffusion models. InAdvances in Neural Information Processing Systems, volume 36, pages 64971–64995. Curran Associates, Inc., 2023. NeurIPS 2023

work page 2023

[10] [10]

The information bottleneck method

Naftali Tishby, Fernando C. Pereira, and William Bialek. The informa- tion bottleneck method.arXiv preprint physics/0004057, 2000

work page internal anchor Pith review Pith/arXiv arXiv 2000

[11] [11]

On the information bottleneck and its applications to learning.IEEE Transactions on Information Theory, 2016

Yihong Wu et al. On the information bottleneck and its applications to learning.IEEE Transactions on Information Theory, 2016

work page 2016

[12] [12]

On the classification- distortion-perception tradeoff

Dong Liu, Haochen Zhang, and Zhiwei Xiong. On the classification- distortion-perception tradeoff. In H. Wallach, H. Larochelle, A. Beygelz- imer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors,Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019

work page 2019

[13] [13]

A rate-distortion-classification approach for lossy image compression.Digital Signal Processing, 141:104163, September 2023

Yuefeng Zhang. A rate-distortion-classification approach for lossy image compression.Digital Signal Processing, 141:104163, September 2023

work page 2023

[14] [14]

Task-oriented lossy compression with data, perception, and classification constraints.IEEE Journal on Selected Areas in Communications, 43(7):2635–2650, 2025

Yuhan Wang, Youlong Wu, Shuai Ma, and Ying-Jun Angela Zhang. Task-oriented lossy compression with data, perception, and classification constraints.IEEE Journal on Selected Areas in Communications, 43(7):2635–2650, 2025

work page 2025

[15] [15]

Lossy compression for lossless prediction

Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, and Lucas Theis. Lossy compression for lossless prediction. InAdvances in Neural Information Processing Systems, volume 36, 2023

work page 2023

[16] [16]

Conditional encoder-based adaptive deep image compression with classification-driven semantic awareness.Electronics, 12(13), 2023

Zhongyue Lei, Weicheng Zhang, Xuemin Hong, Jianghong Shi, Minxian Su, and Chaoheng Lin. Conditional encoder-based adaptive deep image compression with classification-driven semantic awareness.Electronics, 12(13), 2023

work page 2023

[17] [17]

Zhongyue Lei, Peng Duan, Xuemin Hong, João F. C. Mota, Jianghong Shi, and Cheng-Xiang Wang. Progressive deep image compression for hybrid contexts of image classification and reconstruction.IEEE Journal on Selected Areas in Communications, 41(1):72–89, 2023

work page 2023

[18] [18]

The rate-distortion-accuracy tradeoff: Jpeg case study

Xiyang Luo, Hossein Talebi, Feng Yang, Michael Elad, and Peyman Milanfar. The rate-distortion-accuracy tradeoff: Jpeg case study. In 2021 Data Compression Conference (DCC), pages 354–354, 2021

work page 2021

[19] [19]

Universal rate-distortion-perception representations for lossy compression.IEEE Transactions on Information Theory, pages 1–1, 2025

George Zhang, Jingjing Qian, Jun Chen, and Ashish Khisti. Universal rate-distortion-perception representations for lossy compression.IEEE Transactions on Information Theory, pages 1–1, 2025

work page 2025

[20] [20]

A rate- distortion-perception theory for binary sources

Jingjing Qian, George Zhang, Jun Chen, and Ashish Khisti. A rate- distortion-perception theory for binary sources. In Amos Lapidoth and Stefan M. Moser, editors,International Zurich Seminar on Information and Communication (IZS 2022). Proceedings, pages 34 – 38, Zurich,

work page 2022

[21] [21]

International Zurich Seminar on Information and Communication (IZS 2022); Conference Location: Zurich, Switzerland; Conference Date: March 2–4, 2022

ETH Zurich. International Zurich Seminar on Information and Communication (IZS 2022); Conference Location: Zurich, Switzerland; Conference Date: March 2–4, 2022

work page 2022

[22] [22]

Universal rate-distortion-classification representations for lossy compression

Nam Nguyen, Thuan Nguyen, Thinh Nguyen, and Bella Bose. Universal rate-distortion-classification representations for lossy compression. In 2025 IEEE Information Theory Workshop (ITW), pages 1–6, 2025

work page 2025

[23] [23]

CVX: Matlab software for disciplined convex programming, version 2.0

CVX Research, Inc. CVX: Matlab software for disciplined convex programming, version 2.0. https://cvxr.com/cvx, August 2012

work page 2012

[24] [24]

Graph implementations for nons- mooth convex programs

Michael Grant and Stephen Boyd. Graph implementations for nons- mooth convex programs. In V . Blondel, S. Boyd, and H. Kimura, editors, Recent Advances in Learning and Control, Lecture Notes in Control and Information Sciences, pages 95–110. Springer, 2008

work page 2008

[25] [25]

CVXPY: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

Steven Diamond and Stephen Boyd. CVXPY: A python-embedded modeling language for convex optimization.Journal of Machine Learning Research, 17(83):1–5, 2016

work page 2016

[26] [26]

A rewriting system for convex optimization problems.Journal of Control and Decision, 5(1):42–60, 2018

Akshay Agrawal, Robin Verschueren, Steven Diamond, and Stephen Boyd. A rewriting system for convex optimization problems.Journal of Control and Decision, 5(1):42–60, 2018

work page 2018

[27] [27]

Cambridge University Press, 2011

Abbas El Gamal and Young-Han Kim.Network Information Theory. Cambridge University Press, 2011

work page 2011

[28] [28]

Strong functional representation lemma and applications to coding theorems.IEEE Transactions on Information Theory, 64(11):6967–6978, 2018

Cheuk Ting Li and Abbas El Gamal. Strong functional representation lemma and applications to coding theorems.IEEE Transactions on Information Theory, 64(11):6967–6978, 2018

work page 2018

[29] [29]

John Wiley & Sons, 1999

Thomas M Cover and Joy A Thomas.Elements of information theory. John Wiley & Sons, 1999

work page 1999

[30] [30]

Wyner and J

A. Wyner and J. Ziv. A theorem on the entropy of certain binary sequences and applications–i.IEEE Transactions on Information Theory, 19(6):769–772, 1973

work page 1973

[31] [31]

PhD thesis, McMaster University, 2023

Jingjing Qian.On the Rate-Distortion-Perception Tradeoff for Lossy Compression. PhD thesis, McMaster University, 2023. APPENDIX A. Proof of Theorem 1 We start from the one-shot RDC formulation with common randomness given in Definition 1: R∗(D, C) = minpU , pZ|X,U , p ˆX|Z,U H(Z|U) s.t.E[∆(X, ˆX)]≤D, H(S| ˆX)≤C. wherep U,X,Z, ˆX =p U pX pZ|X,U p ˆX|Z,U . ...

work page 2023

[32] [32]

p00|0 log p00|0 (1−q)p 00|0 +qp 00|1 +p 01|0 log p01|0 (1−q)p 01|0 +qp 01|1 +p 11|0 log p11|0 (1−q)p 11|0 +qp 11|1 # (25) +q

implies H(S| ˆX)≥H(S|X) =H(X⊕S 1|X) =H(S 1). Hence, feasibility of the classification constraint requiresC≥ H(S1). We now evaluateH(S| ˆX, U=u)for each mapping. ForU= 1: ˆX=X H(S| ˆX, U= 1) =H(S|X) =H(X⊕S 1|X) =H(S 1) =H b(qS1). ForU= 2: ˆX= 1−X H(S| ˆX, U= 2) =H(S|X) =H(S 1) =H b(qS1). ForU= 3: ˆX= 0 S=X⊕S 1 ⇒P(S= 0) = (1−q X)(1−q S1) +q X qS1 , H(S| ˆX,...

work page