Distributed Integrated Sensing and Edge AI Exploiting Prior Information

Biao Dong; Bin Cao; Guan Gui; Qinyu Zhang

arxiv: 2512.00309 · v2 · pith:HT3LOKWBnew · submitted 2025-11-29 · 📡 eess.SP

Distributed Integrated Sensing and Edge AI Exploiting Prior Information

Biao Dong , Bin Cao , Guan Gui , Qinyu Zhang This is my paper

Pith reviewed 2026-05-21 19:16 UTC · model grok-4.3

classification 📡 eess.SP

keywords distributed sensingedge AIBayesian inferenceGaussian mixture priorpower allocationTDMFDMinference performance

0 comments

The pith

Incorporating Gaussian mixture priors in distributed sensing improves feature denoising at low SNR and enables extra inference gains through discriminant-aware power allocation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a Bayesian framework for a distributed integrated sensing and edge AI system that incorporates task-relevant priors to maximize inference performance. At the sensing level it designs an estimator that uses a Gaussian mixture prior to weight class-conditional posterior means according to their responsibilities, which denoises extracted features more effectively than maximum-likelihood methods when signals are weak. At the communication level it introduces computation-optimal and decision-optimal proxies that yield closed-form power allocation solutions for both time-division and frequency-division multiplexing, with threshold-based and dual-decomposition structures. The results indicate that allocations aware of class discriminability produce additional gains in overall inference quality.

Core claim

Under a Bayesian framework for distributed ISEA, an RWB estimator with a Gaussian-mixture prior denoises features by weighting class-conditional posterior means with responsibilities and outperforms ML at low SNR. Two theoretical proxies—the computation-optimal and decision-optimal—are introduced to derive optimal transceiver designs with closed-form power allocation for TDM and FDM settings, revealing threshold-based and dual-decomposition structures, while discriminant-aware allocation yields additional inference gains.

What carries the argument

The responsibility-weighted Bayesian estimator using a Gaussian-mixture prior, together with computation-optimal and decision-optimal proxies that guide closed-form power allocation in TDM and FDM.

If this is right

Closed-form power allocation policies can be obtained for both TDM and FDM communication settings.
The resulting allocation structures are threshold-based in some cases and use dual decomposition in others.
Discriminant-aware power allocation produces measurable improvements in inference performance beyond standard methods.
The RWB estimator provides denoising benefits over maximum-likelihood estimation specifically at low SNR.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar proxy-based allocation could be tested under other multiplexing schemes or with imperfect channel knowledge.
Replacing the Gaussian mixture with heavier-tailed or multimodal priors might extend the denoising advantage to more varied sensing environments.
Hardware validation with real sensor data would show whether the theoretical proxies remain accurate when implementation losses are present.

Load-bearing premise

The Gaussian-mixture prior accurately models the class-conditional feature distributions and the two theoretical proxies faithfully represent the true end-to-end inference performance.

What would settle it

A controlled experiment that measures actual end-to-end inference error rates using known class-conditional distributions and compares them directly against the performance predicted by the computation-optimal and decision-optimal proxies would confirm or refute the reported gains.

Figures

Figures reproduced from arXiv: 2512.00309 by Biao Dong, Bin Cao, Guan Gui, Qinyu Zhang.

**Figure 2.** Figure 2: Two graphical models, where white nodes denote latent [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: MSE versus sensing SNR under ML estimation and the [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: MSE and MD versus the communication SNR under MSE [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: MSE and MD versus communication SNR under MSE-optimal [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗

**Figure 7.** Figure 7: Classification performance comparison between ML and RWB [PITH_FULL_IMAGE:figures/full_fig_p010_7.png] view at source ↗

**Figure 6.** Figure 6: Two wireless sensing samples of human motion: (a) standing [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 8.** Figure 8: Classification performance comparison under varying communication SNR: (a) MLP and (b) SVM. [PITH_FULL_IMAGE:figures/full_fig_p011_8.png] view at source ↗

**Figure 9.** Figure 9: MLP classification performance comparison under varying number of users [PITH_FULL_IMAGE:figures/full_fig_p011_9.png] view at source ↗

**Figure 10.** Figure 10: The feature visualization under different communication schemes: (a) raw features, (b) decision-optimal, (c) computation-optimal, [PITH_FULL_IMAGE:figures/full_fig_p012_10.png] view at source ↗

**Figure 11.** Figure 11: MLP confusion matrices with SNRc = 10 dB under different communication schemes: (a) decision-optimal, (b) computationoptimal, (c) equal allocation, and (d) channel inversion. In summary, the communication-level results demonstrate that the decision-optimal scheme enhances inference performance by incorporating a discriminative prior into the transceiver design. The performance gap between the computati… view at source ↗

read the original abstract

This paper investigates a distributed ISEA system under a Bayesian framework, focusing on incorporating task-relevant priors to maximize inference performance. At the sensing level, an RWB estimator with a GM prior is designed. By weighting class-conditional posterior means with responsibilities, RWB effectively denoises features and outperforms ML at low SNR. At the communication level, two theoretical proxies are introduced: the computation-optimal and decision-optimal proxies. Optimal transceiver designs in terms of closed-form power allocation are derived for both TDM and FDM settings, revealing threshold-based and dual-decomposition structures. Results show that the discriminant-aware allocation yields additional inference gains.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives closed-form power allocations for distributed sensing and edge AI by pairing an RWB estimator with two proxy objectives, though the proxies' link to real inference error is the main open question.

read the letter

This paper introduces a responsibility-weighted Bayesian estimator that uses a Gaussian mixture prior to denoise features at low SNR by weighting class-conditional posterior means according to their responsibilities. It then defines computation-optimal and decision-optimal proxies to derive closed-form power allocations for transceiver design under both TDM and FDM, with threshold-based and dual-decomposition structures. The results indicate that a discriminant-aware allocation produces additional inference gains over baselines.

Referee Report

2 major / 2 minor

Summary. The paper investigates a distributed integrated sensing and edge AI (ISEA) system under a Bayesian framework that incorporates task-relevant priors to maximize inference performance. It designs an RWB estimator with a Gaussian-mixture prior that weights class-conditional posterior means by responsibilities to denoise features and outperform ML at low SNR. Two theoretical proxies (computation-optimal and decision-optimal) are introduced, and closed-form power allocations are derived for TDM and FDM settings, with results indicating that discriminant-aware allocation yields additional inference gains.

Significance. If the proxies are shown to faithfully track end-to-end inference error, the closed-form derivations and discriminant-aware allocations could provide practical tools for resource optimization in edge-AI sensing systems. The explicit use of prior responsibilities in the estimator and the threshold-based/dual-decomposition structures in the allocations are potential strengths for reproducible transceiver design.

major comments (2)

[Abstract and Results section] Abstract and Results section: the claim that 'the discriminant-aware allocation yields additional inference gains' rests on the two proxies serving as faithful stand-ins for true inference performance, yet the manuscript provides no Monte-Carlo validation or error bars comparing proxy values to actual classification/regression loss after the RWB estimator under the derived power allocations.
[Theoretical derivations (power allocation sections)] Theoretical derivations (power allocation sections): the closed-form solutions for TDM/FDM under the decision-optimal proxy are derived from standard Bayesian estimation and convex optimization; it is not shown that these proxies reduce directly to fitted quantities defined inside the paper rather than external assumptions on the GM prior weighting.

minor comments (2)

[Estimator design] Notation for class responsibilities and GM parameters should be introduced with explicit definitions before their use in the RWB estimator to improve readability.
[Results section] Simulation parameters (SNR ranges, number of Monte-Carlo trials, exact GM mixture weights) are not fully specified in the results, making reproduction of the reported gains difficult.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments and recommendation for major revision. We address each major comment point by point below, indicating the revisions we will make to the manuscript.

read point-by-point responses

Referee: [Abstract and Results section] Abstract and Results section: the claim that 'the discriminant-aware allocation yields additional inference gains' rests on the two proxies serving as faithful stand-ins for true inference performance, yet the manuscript provides no Monte-Carlo validation or error bars comparing proxy values to actual classification/regression loss after the RWB estimator under the derived power allocations.

Authors: We acknowledge that direct Monte-Carlo validation comparing the proxy values to actual end-to-end inference error (classification or regression loss after the RWB estimator) would provide stronger empirical support for the claim of additional gains from discriminant-aware allocation. The proxies are theoretically motivated approximations derived from the Bayesian framework, but the current manuscript does not include such explicit comparisons with error bars. In the revised version, we will add Monte-Carlo simulation results in the Results section that evaluate the proxies against true inference performance under the derived TDM and FDM power allocations, including error bars from repeated trials. revision: yes
Referee: [Theoretical derivations (power allocation sections)] Theoretical derivations (power allocation sections): the closed-form solutions for TDM/FDM under the decision-optimal proxy are derived from standard Bayesian estimation and convex optimization; it is not shown that these proxies reduce directly to fitted quantities defined inside the paper rather than external assumptions on the GM prior weighting.

Authors: The decision-optimal proxy is constructed explicitly from the internal quantities of the Gaussian-mixture prior in the RWB estimator (Section III), using the fitted responsibilities and class-conditional posterior means as defined in the manuscript. The closed-form power allocations follow by substituting these quantities into the convex optimization problem for the proxy. We will revise the theoretical derivations sections to include an explicit step-by-step reduction showing how the proxy expressions map directly onto these fitted GM parameters without introducing external assumptions. revision: partial

Circularity Check

0 steps flagged

No significant circularity; derivations rely on standard Bayesian estimation and convex optimization

full rationale

The paper begins with a standard Bayesian framework and introduces an RWB estimator using a Gaussian-mixture prior as a modeling choice for denoising features. It then defines two theoretical proxies (computation-optimal and decision-optimal) explicitly as stand-ins for inference performance and derives closed-form power allocations for TDM/FDM via convex optimization techniques such as threshold-based and dual-decomposition methods. These steps do not reduce by construction to fitted quantities or self-referential definitions inside the paper; the discriminant-aware allocation gains are presented as outcomes of applying the proxies rather than tautological renamings or self-citations. The work is self-contained against external benchmarks like standard ML estimators and optimization theory, with no load-bearing self-citation chains or ansatzes smuggled via prior author work.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Only the abstract is available, so the ledger is necessarily incomplete; the central claims rest on an assumed Gaussian-mixture prior and on the validity of the two proxies.

free parameters (1)

class responsibilities
Weights used to combine posterior means in the RWB estimator

axioms (1)

domain assumption A Gaussian mixture prior is a suitable model for the class-conditional feature distributions
Invoked to design the RWB estimator

pith-pipeline@v0.9.0 · 5628 in / 1103 out tokens · 34840 ms · 2026-05-21T19:16:39.620048+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

RWB estimator ... weighting class-conditional posterior means with responsibilities ... decision-optimal proxy ... maximum inter-class MD
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean alpha_pin_under_high_calibration unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

optimal transceiver designs ... threshold-based and dual-decomposition structures

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages · 1 internal anchor

[1]

Optimized power control for multi-user integrated sensing and edge AI,

B. Dong and B. Cao, “Optimized power control for multi-user integrated sensing and edge AI,” inProc. IEEE Int. Conf. Commun. (ICC), Glasgow, U.K., May 2026, submitted

work page 2026
[2]

Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,

F. Liu, Y . Cui, C. Masouros, J. Xu, T. X. Han, Y . C. Eldar, and S. Buzzi, “Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1728–1767, 2022

work page 2022
[3]

Flocoff: Data heterogeneity resilient federated learning with communication-efficient edge offloading,

M. Ma, C. Gong, L. Zeng, Y . Yang, and L. Wu, “Flocoff: Data heterogeneity resilient federated learning with communication-efficient edge offloading,”IEEE J. Sel. Areas Commun., 2024

work page 2024
[4]

Robust deep joint source- channel coding enabled distributed image transmission with imperfect channel state information,

B. Dong, B. Cao, G. Gui, and Q. Zhang, “Robust deep joint source- channel coding enabled distributed image transmission with imperfect channel state information,”IEEE Trans. Wireless Commun., early access, Sep. 2025. 13

work page 2025
[5]

Task- oriented sensing, computation, and communication integration for multi- device edge AI,

D. Wen, P. Liu, G. Zhu, Y . Shi, J. Xu, Y . C. Eldar, and S. Cui, “Task- oriented sensing, computation, and communication integration for multi- device edge AI,”IEEE Trans. Wireless Commun., vol. 23, no. 3, pp. 2486–2502, 2023

work page 2023
[6]

Integrated sensing and edge AI: Realizing intelligent perception in 6G,

Z. Liu, X. Chen, H. Wu, Z. Wang, X. Chen, D. Niyato, and K. Huang, “Integrated sensing and edge AI: Realizing intelligent perception in 6G,” IEEE Comm. Surv. Tutor., early access, May 2025

work page 2025
[7]

Task-oriented over-the-air computation for multi-device edge AI,

D. Wen, X. Jiao, P. Liu, G. Zhu, Y . Shi, and K. Huang, “Task-oriented over-the-air computation for multi-device edge AI,”IEEE Trans. Wire- less Commun., vol. 23, no. 3, pp. 2039–2053, 2023

work page 2039
[8]

On the view-and-channel aggregation gain in integrated sensing and edge AI,

X. Chen, K. B. Letaief, and K. Huang, “On the view-and-channel aggregation gain in integrated sensing and edge AI,”IEEE J. Sel. Areas Commun., vol. 42, no. 9, pp. 2292–2305, 2024

work page 2024
[9]

Exploiting semantic communication for non- orthogonal multiple access,

X. Mu and Y . Liu, “Exploiting semantic communication for non- orthogonal multiple access,”IEEE J. Sel. Areas Commun., vol. 41, no. 8, pp. 2563–2576, 2023

work page 2023
[10]

Inference-Optimal ISAC via Task-Oriented Feature Transmission and Power Allocation

B. Dong, B. Cao, and Q. Zhang, “Inference-optimal ISAC via task- oriented feature transmission and power allocation,”arXiv preprint arXiv:2510.20429, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[11]

A survey on over-the-air computation,

A. S ¸ahin and R. Yang, “A survey on over-the-air computation,”IEEE Comm. Surv. Tutor., vol. 25, no. 3, pp. 1877–1908, 2023

work page 1908
[12]

Optimized power control for over-the-air computation in fading channels,

X. Cao, G. Zhu, J. Xu, and K. Huang, “Optimized power control for over-the-air computation in fading channels,”IEEE Trans. Wireless Commun., vol. 19, no. 11, pp. 7498–7513, 2020

work page 2020
[13]

Over-the-air computation systems: Optimization, analysis and scaling laws,

W. Liu, X. Zang, Y . Li, and B. Vucetic, “Over-the-air computation systems: Optimization, analysis and scaling laws,”IEEE Trans. Wireless Commun., vol. 19, no. 8, pp. 5488–5502, 2020

work page 2020
[14]

S. M. Kay,Fundamentals of statistical signal processing: Estimation theory. Prentice-Hall, Inc., 1993

work page 1993
[15]

End-to-end learning for task-oriented semantic communications over MIMO channels: An information-theoretic framework,

C. Cai, X. Yuan, and Y .-J. A. Zhang, “End-to-end learning for task-oriented semantic communications over MIMO channels: An information-theoretic framework,”IEEE J. Sel. Areas Commun., vol. 43, no. 4, pp. 1292–1307, 2025

work page 2025
[16]

Information-theoretic asymptotics of Bayes methods,

B. S. Clarke and A. R. Barron, “Information-theoretic asymptotics of Bayes methods,”IEEE Trans. Inf. Theory, vol. 36, no. 3, pp. 453–471, 2002

work page 2002
[17]

Over-the- air multi-view pooling for distributed sensing,

Z. Liu, Q. Lan, A. E. Kalør, P. Popovski, and K. Huang, “Over-the- air multi-view pooling for distributed sensing,”IEEE Trans. Wireless Commun., vol. 23, no. 7, pp. 7652–7667, 2023

work page 2023
[18]

Energy-efficient edge inference in integrated sensing, communication, and computation networks,

J. Yao, W. Xu, G. Zhu, K. Huang, and S. Cui, “Energy-efficient edge inference in integrated sensing, communication, and computation networks,”IEEE J. Sel. Areas Commun., early access, May 2025

work page 2025
[19]

C. M. Bishop and N. M. Nasrabadi,Pattern recognition and machine learning. New York, USA: Springer Science & Business Media, 2006

work page 2006
[20]

Lightweight semantic-aware commu- nication with packet transmission,

B. Dong, B. Cao, and Q. Zhang, “Lightweight semantic-aware commu- nication with packet transmission,”IEEE Commun. Lett., vol. 29, no. 7, pp. 1569–1573, 2025

work page 2025
[21]

Wasserstein-distance-based Gaussian mixture reduction,

A. Assa and K. N. Plataniotis, “Wasserstein-distance-based Gaussian mixture reduction,”IEEE Signal Process. Lett., vol. 25, no. 10, pp. 1465– 1469, 2018

work page 2018
[22]

Tse and P

D. Tse and P. Viswanath,Fundamentals of wireless communication. Cambridge, U.K.: Cambridge Univ. Press, 2005

work page 2005
[23]

Interference channels,

A. Carleial, “Interference channels,”IEEE Trans. Inf. Theory, vol. 24, no. 1, pp. 60–70, 2003

work page 2003
[24]

The expectation-maximization algorithm,

T. K. Moon, “The expectation-maximization algorithm,”IEEE Signal Process. Mag., vol. 13, no. 6, pp. 47–60, 1996

work page 1996
[25]

T. M. Cover,Elements of information theory. John Wiley & Sons, 1999

work page 1999
[26]

G. H. Hardy, J. E. Littlewood, and G. P ´olya,Inequalities, 2nd ed. Cambridge, U.K.: Cambridge Univ. Press, 1952

work page 1952
[27]

Robust large margin deep neural networks,

J. Sokoli ´c, R. Giryes, G. Sapiro, and M. R. Rodrigues, “Robust large margin deep neural networks,”IEEE Trans. Signal Process., vol. 65, no. 16, pp. 4265–4280, 2017

work page 2017
[28]

The asymptotics of posterior entropy and error probability for Bayesian estimation,

F. Kanayaet al., “The asymptotics of posterior entropy and error probability for Bayesian estimation,”IEEE Trans. Inf. Theory, vol. 41, no. 6, pp. 1988–1992, 1995

work page 1988
[29]

Fukunaga,Introduction to statistical pattern recognition

K. Fukunaga,Introduction to statistical pattern recognition. Amster- dam, The Netherlands: Elsevier, 2013

work page 2013
[30]

Progressive feature transmission for split classification at the wireless edge,

Q. Lan, Q. Zeng, P. Popovski, D. G ¨und¨uz, and K. Huang, “Progressive feature transmission for split classification at the wireless edge,”IEEE Trans. Wireless Commun., vol. 22, no. 6, pp. 3837–3852, 2022

work page 2022
[31]

Kreyszig,Introductory functional analysis with applications

E. Kreyszig,Introductory functional analysis with applications. John Wiley & Sons, 1991

work page 1991
[32]

S. P. Boyd and L. Vandenberghe,Convex optimization. Cambridge, U.K.: Cambridge Univ. Press, 2004

work page 2004
[33]

Dual methods for nonconvex spectrum optimization of multicarrier systems,

W. Yu and R. Lui, “Dual methods for nonconvex spectrum optimization of multicarrier systems,”IEEE Trans. Wireless Commun., vol. 54, no. 7, pp. 1310–1322, 2006

work page 2006
[34]

Wireless sens- ing with deep spectrogram network and primitive based autoregressive hybrid channel model,

G. Li, S. Wang, J. Li, R. Wang, X. Peng, and T. X. Han, “Wireless sens- ing with deep spectrogram network and primitive based autoregressive hybrid channel model,” inProc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun. (SPAWC), Sep. 2021, pp. 481–485

work page 2021
[35]

Visualizing data using t-SNE,

L. Van der Maaten and G. Hinton, “Visualizing data using t-SNE,”J. Mach. Learn. Res., vol. 9, no. Nov, pp. 2579–2605, 2008

work page 2008

[1] [1]

Optimized power control for multi-user integrated sensing and edge AI,

B. Dong and B. Cao, “Optimized power control for multi-user integrated sensing and edge AI,” inProc. IEEE Int. Conf. Commun. (ICC), Glasgow, U.K., May 2026, submitted

work page 2026

[2] [2]

Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,

F. Liu, Y . Cui, C. Masouros, J. Xu, T. X. Han, Y . C. Eldar, and S. Buzzi, “Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1728–1767, 2022

work page 2022

[3] [3]

Flocoff: Data heterogeneity resilient federated learning with communication-efficient edge offloading,

M. Ma, C. Gong, L. Zeng, Y . Yang, and L. Wu, “Flocoff: Data heterogeneity resilient federated learning with communication-efficient edge offloading,”IEEE J. Sel. Areas Commun., 2024

work page 2024

[4] [4]

Robust deep joint source- channel coding enabled distributed image transmission with imperfect channel state information,

B. Dong, B. Cao, G. Gui, and Q. Zhang, “Robust deep joint source- channel coding enabled distributed image transmission with imperfect channel state information,”IEEE Trans. Wireless Commun., early access, Sep. 2025. 13

work page 2025

[5] [5]

Task- oriented sensing, computation, and communication integration for multi- device edge AI,

D. Wen, P. Liu, G. Zhu, Y . Shi, J. Xu, Y . C. Eldar, and S. Cui, “Task- oriented sensing, computation, and communication integration for multi- device edge AI,”IEEE Trans. Wireless Commun., vol. 23, no. 3, pp. 2486–2502, 2023

work page 2023

[6] [6]

Integrated sensing and edge AI: Realizing intelligent perception in 6G,

Z. Liu, X. Chen, H. Wu, Z. Wang, X. Chen, D. Niyato, and K. Huang, “Integrated sensing and edge AI: Realizing intelligent perception in 6G,” IEEE Comm. Surv. Tutor., early access, May 2025

work page 2025

[7] [7]

Task-oriented over-the-air computation for multi-device edge AI,

D. Wen, X. Jiao, P. Liu, G. Zhu, Y . Shi, and K. Huang, “Task-oriented over-the-air computation for multi-device edge AI,”IEEE Trans. Wire- less Commun., vol. 23, no. 3, pp. 2039–2053, 2023

work page 2039

[8] [8]

On the view-and-channel aggregation gain in integrated sensing and edge AI,

X. Chen, K. B. Letaief, and K. Huang, “On the view-and-channel aggregation gain in integrated sensing and edge AI,”IEEE J. Sel. Areas Commun., vol. 42, no. 9, pp. 2292–2305, 2024

work page 2024

[9] [9]

Exploiting semantic communication for non- orthogonal multiple access,

X. Mu and Y . Liu, “Exploiting semantic communication for non- orthogonal multiple access,”IEEE J. Sel. Areas Commun., vol. 41, no. 8, pp. 2563–2576, 2023

work page 2023

[10] [10]

Inference-Optimal ISAC via Task-Oriented Feature Transmission and Power Allocation

B. Dong, B. Cao, and Q. Zhang, “Inference-optimal ISAC via task- oriented feature transmission and power allocation,”arXiv preprint arXiv:2510.20429, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[11] [11]

A survey on over-the-air computation,

A. S ¸ahin and R. Yang, “A survey on over-the-air computation,”IEEE Comm. Surv. Tutor., vol. 25, no. 3, pp. 1877–1908, 2023

work page 1908

[12] [12]

Optimized power control for over-the-air computation in fading channels,

X. Cao, G. Zhu, J. Xu, and K. Huang, “Optimized power control for over-the-air computation in fading channels,”IEEE Trans. Wireless Commun., vol. 19, no. 11, pp. 7498–7513, 2020

work page 2020

[13] [13]

Over-the-air computation systems: Optimization, analysis and scaling laws,

W. Liu, X. Zang, Y . Li, and B. Vucetic, “Over-the-air computation systems: Optimization, analysis and scaling laws,”IEEE Trans. Wireless Commun., vol. 19, no. 8, pp. 5488–5502, 2020

work page 2020

[14] [14]

S. M. Kay,Fundamentals of statistical signal processing: Estimation theory. Prentice-Hall, Inc., 1993

work page 1993

[15] [15]

End-to-end learning for task-oriented semantic communications over MIMO channels: An information-theoretic framework,

C. Cai, X. Yuan, and Y .-J. A. Zhang, “End-to-end learning for task-oriented semantic communications over MIMO channels: An information-theoretic framework,”IEEE J. Sel. Areas Commun., vol. 43, no. 4, pp. 1292–1307, 2025

work page 2025

[16] [16]

Information-theoretic asymptotics of Bayes methods,

B. S. Clarke and A. R. Barron, “Information-theoretic asymptotics of Bayes methods,”IEEE Trans. Inf. Theory, vol. 36, no. 3, pp. 453–471, 2002

work page 2002

[17] [17]

Over-the- air multi-view pooling for distributed sensing,

Z. Liu, Q. Lan, A. E. Kalør, P. Popovski, and K. Huang, “Over-the- air multi-view pooling for distributed sensing,”IEEE Trans. Wireless Commun., vol. 23, no. 7, pp. 7652–7667, 2023

work page 2023

[18] [18]

Energy-efficient edge inference in integrated sensing, communication, and computation networks,

J. Yao, W. Xu, G. Zhu, K. Huang, and S. Cui, “Energy-efficient edge inference in integrated sensing, communication, and computation networks,”IEEE J. Sel. Areas Commun., early access, May 2025

work page 2025

[19] [19]

C. M. Bishop and N. M. Nasrabadi,Pattern recognition and machine learning. New York, USA: Springer Science & Business Media, 2006

work page 2006

[20] [20]

Lightweight semantic-aware commu- nication with packet transmission,

B. Dong, B. Cao, and Q. Zhang, “Lightweight semantic-aware commu- nication with packet transmission,”IEEE Commun. Lett., vol. 29, no. 7, pp. 1569–1573, 2025

work page 2025

[21] [21]

Wasserstein-distance-based Gaussian mixture reduction,

A. Assa and K. N. Plataniotis, “Wasserstein-distance-based Gaussian mixture reduction,”IEEE Signal Process. Lett., vol. 25, no. 10, pp. 1465– 1469, 2018

work page 2018

[22] [22]

Tse and P

D. Tse and P. Viswanath,Fundamentals of wireless communication. Cambridge, U.K.: Cambridge Univ. Press, 2005

work page 2005

[23] [23]

Interference channels,

A. Carleial, “Interference channels,”IEEE Trans. Inf. Theory, vol. 24, no. 1, pp. 60–70, 2003

work page 2003

[24] [24]

The expectation-maximization algorithm,

T. K. Moon, “The expectation-maximization algorithm,”IEEE Signal Process. Mag., vol. 13, no. 6, pp. 47–60, 1996

work page 1996

[25] [25]

T. M. Cover,Elements of information theory. John Wiley & Sons, 1999

work page 1999

[26] [26]

G. H. Hardy, J. E. Littlewood, and G. P ´olya,Inequalities, 2nd ed. Cambridge, U.K.: Cambridge Univ. Press, 1952

work page 1952

[27] [27]

Robust large margin deep neural networks,

J. Sokoli ´c, R. Giryes, G. Sapiro, and M. R. Rodrigues, “Robust large margin deep neural networks,”IEEE Trans. Signal Process., vol. 65, no. 16, pp. 4265–4280, 2017

work page 2017

[28] [28]

The asymptotics of posterior entropy and error probability for Bayesian estimation,

F. Kanayaet al., “The asymptotics of posterior entropy and error probability for Bayesian estimation,”IEEE Trans. Inf. Theory, vol. 41, no. 6, pp. 1988–1992, 1995

work page 1988

[29] [29]

Fukunaga,Introduction to statistical pattern recognition

K. Fukunaga,Introduction to statistical pattern recognition. Amster- dam, The Netherlands: Elsevier, 2013

work page 2013

[30] [30]

Progressive feature transmission for split classification at the wireless edge,

Q. Lan, Q. Zeng, P. Popovski, D. G ¨und¨uz, and K. Huang, “Progressive feature transmission for split classification at the wireless edge,”IEEE Trans. Wireless Commun., vol. 22, no. 6, pp. 3837–3852, 2022

work page 2022

[31] [31]

Kreyszig,Introductory functional analysis with applications

E. Kreyszig,Introductory functional analysis with applications. John Wiley & Sons, 1991

work page 1991

[32] [32]

S. P. Boyd and L. Vandenberghe,Convex optimization. Cambridge, U.K.: Cambridge Univ. Press, 2004

work page 2004

[33] [33]

Dual methods for nonconvex spectrum optimization of multicarrier systems,

W. Yu and R. Lui, “Dual methods for nonconvex spectrum optimization of multicarrier systems,”IEEE Trans. Wireless Commun., vol. 54, no. 7, pp. 1310–1322, 2006

work page 2006

[34] [34]

Wireless sens- ing with deep spectrogram network and primitive based autoregressive hybrid channel model,

G. Li, S. Wang, J. Li, R. Wang, X. Peng, and T. X. Han, “Wireless sens- ing with deep spectrogram network and primitive based autoregressive hybrid channel model,” inProc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun. (SPAWC), Sep. 2021, pp. 481–485

work page 2021

[35] [35]

Visualizing data using t-SNE,

L. Van der Maaten and G. Hinton, “Visualizing data using t-SNE,”J. Mach. Learn. Res., vol. 9, no. Nov, pp. 2579–2605, 2008

work page 2008