arxiv: 2604.19614 · v1 · submitted 2026-04-21 · 💻 cs.IT · math.IT

Recognition: unknown

Goal-Oriented Semantic Communication for Logical Decision Making

Ahmet Faruk Saz , Faramarz Fekri

Authors on Pith no claims yet

Pith reviewed 2026-05-10 01:17 UTC · model grok-4.3

classification 💻 cs.IT math.IT

keywords goal-oriented semantic communicationfirst-order logicsemantic rate-distortioninformation bottlenecklogical decision makingautonomous agentsverifiable transmission

0 comments

The pith

A goal-oriented semantic system transmits only the first-order logic clauses most relevant to a decision task while preserving verifiability.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper establishes a communication framework in which autonomous agents represent their shared world in first-order logic and then select which logical clauses to send based on how much each one reduces uncertainty about a higher-level decision goal. The selection rests on newly defined semantic entropy and mutual information quantities derived from an inductive probability measure over logic structures. A reader would care because raw sensor streams are too large for many wireless links, yet safety-critical choices such as path planning demand both low data volume and the ability to audit the reasoning afterward. The approach recasts rate-distortion optimization and the information bottleneck principle in semantic terms so that only decision-critical facts cross the channel. Demonstration occurs in a multi-agent urban simulator where agents must follow safe paths using deduction over the transmitted clauses.

Core claim

The paper claims that goal-oriented states, introduced as an abstraction layer over ordinary world states in a first-order logic hierarchy, allow a sender to rank and transmit only those FOL clauses that are most informative about the decision rules. Semantic entropy, conditional entropy, and mutual information are defined by assigning an inductive logical probability measure to the structures of the language; the resulting quantities support a semantic rate-distortion objective that minimizes communication cost subject to a bound on the remaining uncertainty about the goal states. Equivalently, the semantic information bottleneck principle extracts the same minimal sufficient set of clauses

What carries the argument

goal-oriented states as an abstraction layer over FOL world states, with transmission selected by the semantic information bottleneck that ranks clauses by their reduction in uncertainty about those states

If this is right

Only clauses critical to the goal states cross the channel, lowering required bandwidth compared with sending all observations.
The transmitted clauses remain inside the original FOL language, so any downstream deduction engine can verify consistency with safety rules.
The rate-distortion curve gives an explicit trade-off between bits sent and remaining uncertainty about the decision outcome.
Multiple agents can share the same logical representation and therefore coordinate without exchanging raw sensor data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same clause-ranking step could be inserted into existing logic-based planners to prune messages before transmission.
If the inductive probability measure can be learned from data rather than hand-specified, the framework might apply to domains where the full FOL theory is only partially known.
Scaling tests with increasing numbers of dynamic agents would reveal whether the computational cost of evaluating the semantic mutual information remains tractable.

Load-bearing premise

An inductive logical probability measure can be placed over semantic structures so that semantic entropy and mutual information become well-defined numbers usable inside a rate-distortion optimization.

What would settle it

In the urban simulator, replace the selected clauses with a random subset of equal size and check whether the receiver can still deduce the correct safe path; if deduction accuracy collapses or verification fails while data volume stays the same, the selection principle does not hold.

Figures

Figures reproduced from arXiv: 2604.19614 by Ahmet Faruk Saz, Faramarz Fekri.

**Figure 2.** Figure 2: Driving Simulator Experiment Results of hypotheses evaluations that agree with the FI-baseline as a result of semantic communication. Action Decision Success Rate (A-DSR) is the fraction of time steps for which the decided action (Stop, Slow, Fast, or Normal) from semantic communication matches the FI-baseline. We compare the semantic selection strategy, minimizing (14), against a FOL-baseline that select… view at source ↗

read the original abstract

This paper develops a principled foundation for goal-oriented semantic communication for logical decision-making. Consider a setting where autonomous agents engage in collaborative perception. In such settings, the volume of sensory data and limited bandwidth often make transmission of raw observations infeasible, requiring intelligent selection of task-relevant information. Because these scenarios are safety-critical, the selection and decision processes must also be transparent and verifiable. To address this, we propose an explainable semantic communication framework grounded in a First-Order Logic (FOL) hierarchical representation of the world. We define semantic information, entropy, conditional entropy, and mutual information by assigning an inductive logical probability measure over semantic structures in the language. Based on these definitions, we formulate a goal-oriented semantic communication objective through semantic rate-distortion theory and, equivalently, through the semantic information bottleneck principle. In this framework, task rules are represented as goal-oriented states, defined as a layer over the world states to capture decision-relevant abstractions. The resulting principle selects evidence that is most informative about these states, aiming to transmit only those FOL clauses most critical for decision-making while preserving logical verifiability. We demonstrate the effectiveness of the approach in a deduction-based safe path-following task within an FOL-based urban environment simulator with multiple dynamic agents.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper develops a goal-oriented semantic communication framework for logical decision-making in collaborative perception settings among autonomous agents. It represents the world via a First-Order Logic (FOL) hierarchical structure, assigns an inductive logical probability measure over semantic structures to define semantic entropy, conditional entropy, and mutual information, and uses these to formulate a communication objective via semantic rate-distortion theory (equivalently, the semantic information bottleneck). Goal-oriented states capture decision-relevant abstractions over world states; the resulting principle selects and transmits only the most informative FOL clauses for the task while preserving logical verifiability. Effectiveness is shown via a deduction-based safe path-following demonstration in an FOL-based urban simulator with dynamic agents.

Significance. If the central construction holds, the work supplies a transparent, logically verifiable approach to semantic communication for safety-critical decision tasks, directly linking FOL reasoning with rate-distortion optimization. The implemented deduction-based simulator constitutes a concrete, reproducible testbed that demonstrates clause selection for a path-following objective; this is a strength for empirical grounding.

major comments (1)

[§3] §3 (definitions of semantic entropy and mutual information): the inductive logical probability measure is introduced as the foundation for all information quantities and the subsequent rate-distortion objective, yet the manuscript provides no explicit construction, grounding, or computation procedure for assigning the measure to FOL structures in the simulator; without this, it is impossible to confirm that the measure is independent of the target task and that the objective does not reduce to a tautology.

minor comments (2)

[Abstract and §5] The abstract states that the framework 'preserves logical verifiability' but does not specify which verification properties (e.g., soundness of the deduction engine or completeness of the transmitted clause set) are formally checked in the simulator; add a short paragraph or table entry clarifying this.
[§2] Notation for goal-oriented states (introduced as a layer over world states) is used without an explicit mapping or example in the main text; a small illustrative diagram or set of clauses would improve readability.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive summary, the recognition of the work's significance for safety-critical decision tasks, and the recommendation for minor revision. We address the single major comment point by point below.

read point-by-point responses

Referee: [§3] §3 (definitions of semantic entropy and mutual information): the inductive logical probability measure is introduced as the foundation for all information quantities and the subsequent rate-distortion objective, yet the manuscript provides no explicit construction, grounding, or computation procedure for assigning the measure to FOL structures in the simulator; without this, it is impossible to confirm that the measure is independent of the target task and that the objective does not reduce to a tautology.

Authors: We agree that the manuscript introduces the inductive logical probability measure in §3 as the foundation for the semantic entropy, mutual information, and rate-distortion objective but does not supply an explicit construction, grounding, or step-by-step computation procedure for its assignment to concrete FOL structures inside the simulator. This leaves open the questions of task-independence and whether the objective could become tautological. In the revised manuscript we will add a new subsection (placed after the simulator description) that (i) recalls the formal definition of the measure as a probability distribution over the set of models of the FOL language that is fixed by the language signature alone, (ii) states the independence from goal-oriented states by construction, and (iii) provides the concrete computation procedure used in the experiments (including how the measure is evaluated or approximated over the finite set of ground atoms and clauses generated by the urban simulator). We will also include a short numerical example with a handful of clauses to illustrate that the resulting semantic rate-distortion functional remains non-trivial. These additions will directly resolve the referee's concern. revision: yes

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper defines semantic entropy, conditional entropy, and mutual information by assigning an inductive logical probability measure over FOL semantic structures, then applies these definitions to formulate a goal-oriented rate-distortion objective and information-bottleneck principle for selecting task-critical clauses. This is a definitional step rather than a derivation that reduces to its own inputs by construction. The framework is evaluated via an implemented deduction-based simulator in an FOL urban environment with dynamic agents, supplying an independent empirical check on the path-following task. No equations or claims reduce a prediction to a fitted parameter, invoke self-citation for uniqueness, or smuggle an ansatz; the central construction remains self-contained against the provided definitions and demonstration.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The central claim rests on the existence of a well-defined inductive logical probability measure over FOL structures and on the hierarchical separation of world states from goal-oriented states; these are not derived from first principles in the abstract.

axioms (2)

domain assumption First-order logic can hierarchically represent world states and decision-relevant abstractions
Invoked when defining goal-oriented states as a layer over world states
ad hoc to paper An inductive logical probability measure exists and can be assigned to semantic structures
Used to define semantic entropy, conditional entropy, and mutual information

invented entities (1)

Goal-oriented states no independent evidence
purpose: Layer over world states to capture decision-relevant abstractions for the communication objective
Introduced to formulate the semantic communication objective around task rules

pith-pipeline@v0.9.0 · 5514 in / 1379 out tokens · 24736 ms · 2026-05-10T01:17:48.289123+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

39 extracted references · 2 canonical work pages

[1]

Li, and Biing-Hwang Juang

Huiqiang Xie, Zhijin Qin, Geoffrey Y . Li, and Biing-Hwang Juang. Deep learning enabled semantic communication systems.IEEE Transactions on Signal Processing, 69:2663–2675, 2020

2020
[2]

Swinjscc: Taming swin transformer for deep joint source- channel coding.IEEE Transactions on Cognitive Communications and Networking, 11:90–104, 2023

Ke Yang, Sixian Wang, Jincheng Dai, Xiaoqi Qin, Kai Niu, and Ping Zhang. Swinjscc: Taming swin transformer for deep joint source- channel coding.IEEE Transactions on Cognitive Communications and Networking, 11:90–104, 2023

2023
[3]

A novel image semantic communication method via dynamic decision generation network and generative adversarial network.Scientific Reports, 14, 2024

Shugang Liu, Zhan Peng, Qiangguo Yu, and Linan Duan. A novel image semantic communication method via dynamic decision generation network and generative adversarial network.Scientific Reports, 14, 2024

2024
[4]

Large-language-model-enabled text semantic communication systems.Applied Sciences, 2024

Zhenyi Wang, Li Zou, Shengyun Wei, Kai Li, Feifan Liao, Haibo Mi, and Rongxuan Lai. Large-language-model-enabled text semantic communication systems.Applied Sciences, 2024

2024
[5]

A mathematical theory of semantic commu- nication.ArXiv, abs/2401.13387, 2024

Kai Niu and Ping Zhang. A mathematical theory of semantic commu- nication.ArXiv, abs/2401.13387, 2024

work page arXiv 2024
[6]

A theory of semantic communication.IEEE Transactions on Mobile Computing, 23:12211– 12228, 2022

Yulin Shao, Qingqing Cao, and Deniz G ¨und¨uz. A theory of semantic communication.IEEE Transactions on Mobile Computing, 23:12211– 12228, 2022

2022
[7]

Rate–distortion–perception trade-off in information theory, generative models, and intelligent communications.Entropy, 27, 2025

Xueyan Niu, Bo Bai, Nian Guo, Weixi Zhang, and Wei Han. Rate–distortion–perception trade-off in information theory, generative models, and intelligent communications.Entropy, 27, 2025

2025
[8]

Performing first- order-logic queries over rdf data: Interpreter versus compiler to apache jena rules

Andrey Krygin, Pavel Karpenko, and Oleg Sychev. Performing first- order-logic queries over rdf data: Interpreter versus compiler to apache jena rules. InInternational Conference on Novelties in Intelligent Digital Systems, 2024

2024
[9]

Analysis of semantic communication for logic-based hypothesis deduction.GLOBE- COM 2025 - 2025 IEEE Global Communications Conference, pages 5701–5707, 2025

Ahmet Faruk Saz, Siheng Xiong, and Faramarz Fekri. Analysis of semantic communication for logic-based hypothesis deduction.GLOBE- COM 2025 - 2025 IEEE Global Communications Conference, pages 5701–5707, 2025

2025
[10]

Lossy semantic communication for the logical deduction of the state of the world.2025 IEEE Wireless Communications and Networking Conference (WCNC), pages 1–6, 2024

Ahmet Faruk Saz, Siheng Xiong, and Faramarz Fekri. Lossy semantic communication for the logical deduction of the state of the world.2025 IEEE Wireless Communications and Networking Conference (WCNC), pages 1–6, 2024

2025
[11]

Ahmet Faruk Saz, Siheng Xiong, and Faramarz Fekri. Discd: Distributed lossy semantic communication for logical deduction of hypothesis.2025 IEEE International Conference on Machine Learning for Communica- tion and Networking (ICMLCN), pages 1–7, 2025

2025
[12]

On the theory of semantic information and communication for logical inference

Ahmet Faruk Saz, Siheng Xiong, and Faramarz Fekri. On the theory of semantic information and communication for logical inference. pages 1–6, 2025

2025
[13]

Relational hypotheses and inductive inference.Synthese, 23:266–286, 1971

Risto Hilpinen. Relational hypotheses and inductive inference.Synthese, 23:266–286, 1971

1971
[14]

A two-dimensional continuum of inductive methods*

Jaakko Hintikka. A two-dimensional continuum of inductive methods*. Studies in logic and the foundations of mathematics, 43:113–132, 1966

1966
[15]

Formulas of first-order logic in distributive normal form

Karen Nelte. Formulas of first-order logic in distributive normal form. Master’s thesis, University of Cape Town, 1997

1997
[16]

Distributive normal forms in first-order logic.Studies in logic and the foundations of mathematics, 40:48–91, 1965

Jaakko Hintikka. Distributive normal forms in first-order logic.Studies in logic and the foundations of mathematics, 40:48–91, 1965

1965
[17]

An outline of a theory of semantic information

Rudolf Carnap and Yehoshua Bar-Hillel. An outline of a theory of semantic information. 1952

1952
[18]

The continuum of inductive methods.Philosophy, 28(106):272–273, 1953

Rudolf Carnap. The continuum of inductive methods.Philosophy, 28(106):272–273, 1953

1953
[19]

Sycara, Pradeep Kumar Raviku- mar, Alexander Gray, Xujie Si, and Sebastian A

Bowen Li, Zhaoyu Li, Qiwei Du, Jinqi Luo, Wenshan Wang, Yaqi Xie, Simon Stepputtis, Chen Wang, Katia P. Sycara, Pradeep Kumar Raviku- mar, Alexander Gray, Xujie Si, and Sebastian A. Scherer. Logicity: Advancing neuro-symbolic ai with abstract urban simulation.ArXiv, abs/2411.00773, 2024

work page arXiv 2024
[20]

The University of Chicago Press, Chicago, 2nd edition, 1962

Rudolf Carnap.The logical foundation of probability. The University of Chicago Press, Chicago, 2nd edition, 1962

1962
[21]

The development of the hintikka program

Ilkka Niiniluoto. The development of the hintikka program. InInductive Logic, 2011

2011
[22]

Jie Bao, Prithwish Basu, Mike Dean, Craig Partridge, Ananthram Swami, Will Leland, and James A. Hendler. Towards a theory of semantic communication. In2011 IEEE Network Science Workshop, pages 110–117, 2011. APPENDIXA INDUCTIVELOGICALPROBABILITY AS AFOUNDATION FORSEMANTICINFORMATION The communication framework developed in this work rests on an inductive ...

2011
[23]

This effect dominates all others

Evidence diversity (K):IfK a < K b, the leading term2 Q−Ka exceeds2 Q−Kb by a factor of2 Kb−Ka, which propagatesdoubly exponentiallythrough (24) to increase every γi simultaneously. This effect dominates all others
[24]

MaximisingH min = mini∈I̸∼ Hi then maximisesγ min

Hypothesis specificity (H min):WhenK a =K b, the bottleneck hypothesis (i.e., the one with the smallestH i) determinesγ min, since2 Q−K−H i is largest—and hence γi smallest—for the smallestH i. MaximisingH min = mini∈I̸∼ Hi then maximisesγ min. Symbolic Comparison Key These observations yield the lexicographic key κ( ˆE) = |I̸∼|, K,−H (1),−H (2), . . . ,−...
[25]

The marginal likelihood of the evidence takes the standard finite-mixture form: p(e) = X w p(C w)p(e|C w) = X w πw fw(e), whereπ w =p(C w)andf w(e) =p(e|C w)

Connection to Finite Mixture Models:In this interpre- tation, the constituentC w functions as a latent model index, p(C w)is a mixing weight, andp(e|C w)is a component- specific likelihood. The marginal likelihood of the evidence takes the standard finite-mixture form: p(e) = X w p(C w)p(e|C w) = X w πw fw(e), whereπ w =p(C w)andf w(e) =p(e|C w). Differen...
[26]

Markov Chain and Conditional Independence Structure: The generative hierarchy induces a natural Markov chain: C w − →D ν − →e. Under this chain, evidence and constituent are conditionally independent given the E-distribution:C w ⊥e|D ν, so p(e|C w, Dν) =p(e|D ν)whenever the E-distribution fully specifies the local structure. A richer chain arises when at-...
[27]

The Hintikka–Hilpinen Prior:Constituents with the same widthware taken to be equiprobable. The prior for a particular constituentC (i) w of widthwis: p(C (i) w ) = p(Cw)K w , p(C w) = K w π(α, wλ/K) PK i=0 K i π(α, iλ/K) , whereπ(α, x) := Γ(α+x)/Γ(α)andα, λare parameters of the inductive system, whereαdetermines how fast inductive generalizations learn fr...
[28]

Direct Uniformity (α= 0):Settingα= 0yields π(0, x) = 1for allx >0, so all2 K constituents are equiprobable: p(C (i) w ) = 1 2K for every constituent, p(C w) = K w 2K . D. Likelihood Models
[29]

Given evidence countsn 1,

Model 1: Monadic Likelihood (Hintikka):Applicable when the language contains only monadic predicates. Given evidence countsn 1, . . . , nc (P nj =n), the likelihood under constituentC w is: p(e|C w) = Qc j=1 π(nj, λ/w) π(n, λ) , π(n j, λ/w) = Γ(nj +λ/w) Γ(λ/w) . This is a Dirichlet–multinomial characteristic function. Un- der the symmetric Dirichlet prior...
[30]

First stage— attributive constituent assignment: p(Dν |C w) = Γ(λ) Γ(n+λ) νY j=1 Γ(nj +λ/w) Γ(λ/w)

Model 2: Dyadic Likelihood (Hilpinen):For dyadic languages, the likelihood decomposes into two stages via an E-distributionD ν, which assigns each observed individual to an attributive constituent class: p(C w, Dν |e) = p(C w)p(D ν |C w)p(e|D ν, Cw) p(e) . First stage— attributive constituent assignment: p(Dν |C w) = Γ(λ) Γ(n+λ) νY j=1 Γ(nj +λ/w) Γ(λ/w) ....
[31]

immediate successor

Model 3: Tuomela’s Matrix Method:Tuomela extends Hintikka’s system to ordered universes with a single dyadic “immediate successor” predicate. He associates a nonnegative matrixAwith the graph of each constituent, wheree ⊤Ake yields the number of distinct paths of lengthk. For consistent constituents, these matrices are primitive and irreducible with a uni...
[32]

Combined with the uniform priorp(C w) = 1/2K: p(C w |e) = 1{C w |=e} |C(e)| , p(e) = |C(e)| 2K , p(h|e) = |C(e∧h)| |C(e)|

Model 4: Indicator Likelihood (Direct Uniformity):The simplest model treats evidence as exact logical evidence: p(e|C w) :=1{C w |=e}. Combined with the uniform priorp(C w) = 1/2K: p(C w |e) = 1{C w |=e} |C(e)| , p(e) = |C(e)| 2K , p(h|e) = |C(e∧h)| |C(e)| . This distinguishes only between evidence that is possible and evidence that is impossible under a ...
[33]

The model has the standard finite-mixture architecture: p(et |C w) = |Q|X i=1 p(Zt =i|C w)| {z } mixing weight p(et |Z t =i, C w)| {z } emission probability

The Latent-Q Model as a Finite Mixture:For each observed evidence iteme t, introduce a latent variableZ t ∈ Q representing the underlying complete Q-sentence. The model has the standard finite-mixture architecture: p(et |C w) = |Q|X i=1 p(Zt =i|C w)| {z } mixing weight p(et |Z t =i, C w)| {z } emission probability . Herep(Z t =i|C w) =θ i(C w)is the proba...
[34]

This is directly analogous to the responsibilitiesγ ti =p(z t = i|x t, θ)in a Gaussian mixture model

Posterior Responsibilities:Theposterior responsibility of Q-sentenceQ i for explaining observatione t under con- stituentC w is: γ(w) t,i :=p(Z t =i|e t, Cw) = p(et |Q i)θ i(C w)P j p(et |Q j)θ j(C w) . This is directly analogous to the responsibilitiesγ ti =p(z t = i|x t, θ)in a Gaussian mixture model. It measures how much “credit” each hidden Q-sentence...
[35]

, θ(w) |Q| )are known, one can compute the posterior over constituents directly

Expectation-Maximisation (EM) Procedure: When the constituent-conditioned Q-distributions θ(w) = (θ (w) 1 , . . . , θ(w) |Q| )are known, one can compute the posterior over constituents directly. However, when these distributions are unknown and must be estimated from data, the framework becomes a genuine EM algorithm: E-step.For each observatione t and ea...
[36]

Variational Approximation with Equal Splitting:When the full E-step posterior is intractable, one adopts a mean-field variational familyq(z) = Q ℓ qℓ(zℓ)and chooses a convenient q. Theequal-splittingheuristic sets: qℓ(i) =q(z ℓ =i|r ℓ =r) := Mri deg(r) ,deg(r) = KX j=1 Mrj , whereM ri ∈ {0,1}is the compatibility indicator (1 if Q- typeris compatible with ...
[37]

The Hierarchical Latent Model:A richer model intro- duces latent attributive constituents as an explicit layer: C w − →Ct j − →Q i − →e. The likelihood becomes: p(e|C w) = X j p(Ctj |C w) X i p(Qi |Ct j)p(e|Q i), with joint posterior responsibilities over both latent layers: p(Qi,Ct j |e, C w)∝p(e|Q i)p(Q i |Ct j)p(Ct j |C w). The Markov chain conditional...
[38]

Under the entity-block model with block weightsw u, distinct-predicate countsd u, and pairwise overlapso uv =|A u ∩A v|: R(e) = X u w2 u 2du + 2 X u<v wu wv 2|Au∩Av|

Reduction to a Q-Level Tractable Score:The constituent-level second moment reduces exactly to a Q-level second moment via the enumeration structure: P C n(C) 2 = 2−M P q s(q) 2 +P q s(q)2 , wheres(q)is the Q-level weight andM=|Q|. Under the entity-block model with block weightsw u, distinct-predicate countsd u, and pairwise overlapso uv =|A u ∩A v|: R(e) ...
[39]

The sum is dominated byγ min, and optimisation reduces to a lexicographic comparison key over small integers(c, K,−H (1),

Indicator-Likelihood Surrogate:Under Model 4, each per-hypothesis term reduces toF i ≈2 −γi whereγ i = 2|Q|−K −2 |Q|−K−H i. The sum is dominated byγ min, and optimisation reduces to a lexicographic comparison key over small integers(c, K,−H (1), . . .). This method is adopted in the present paper. G. Summary All models share the same logical language stru...