Unified Estimation--Guidance Framework Based on Bayesian Decision Theory

Liraz Mudrik; Yaakov Oshman

arxiv: 2602.11373 · v2 · submitted 2026-02-11 · 📡 eess.SY · cs.SY

Unified Estimation--Guidance Framework Based on Bayesian Decision Theory

Liraz Mudrik , Yaakov Oshman This is my paper

Pith reviewed 2026-05-16 02:00 UTC · model grok-4.3

classification 📡 eess.SY cs.SY

keywords bayesiandecisionguidanceestimatorperformanceseparationstatetheorem

0 comments

The pith

A Bayesian decision theory framework modifies the DGL1 guidance law to incorporate estimation errors, yielding a stochastic law that complies with the generalized separation theorem and uses trajectory shaping for better estimation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The work tackles guidance problems like missile pursuit when sensors provide only noisy data instead of exact positions and velocities. Traditional methods often assume estimation and guidance can be designed separately, but this paper argues that assumption is invalid and instead uses Bayesian decision theory to find the best action given the current uncertainty. It computes the probability distribution over possible states with an interacting multiple model particle filter and selects the guidance command that minimizes expected cost. When multiple commands give the same cost, the system picks the one that improves future measurements by shaping the pursuer's path. Simulations show the combined scheme can run in real time.

Core claim

Load-bearing premise

The required posterior probability density function of the game's state can be accurately derived from noisy measurements using an interacting multiple model particle filter, enabling reliable computation of the Bayesian cost and optimal decision.

read the original abstract

Using Bayesian decision theory, we modify the perfect-information, differential game-based guidance law (DGL1) to address the inevitable estimation error occurring when driving this guidance law with a separately-designed state estimator. This yields a stochastic guidance law complying with the generalized separation theorem, as opposed to the common approach, that implicitly, but unjustifiably, assumes the validity of the regular separation theorem. The required posterior probability density function of the game's state is derived from the available noisy measurements using an interacting multiple model particle filter. When the resulting optimal decision turns out to be nonunique, this feature is harnessed to appropriately shape the trajectory of the pursuer so as to enhance its estimator's performance. In addition, certain properties of the particle-based computation of the Bayesian cost are exploited to render the algorithm amenable to real-time implementation. The performance of the entire estimation-decision-guidance scheme is demonstrated using an extensive Monte Carlo simulation study.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 3 minor

Summary. The manuscript proposes a unified estimation-guidance scheme that applies Bayesian decision theory to modify the perfect-information DGL1 differential-game guidance law, yielding a stochastic guidance command that accounts for estimation errors from a separately designed state estimator. The posterior PDF of the game state is obtained via an interacting multiple model particle filter; the resulting Bayesian cost is minimized to produce the guidance law, which is asserted to comply with the generalized separation theorem. Non-uniqueness of the optimal decision is exploited to shape the pursuer trajectory for improved estimator performance, and particle-filter properties are used to enable real-time execution. Performance is illustrated by an extensive Monte Carlo simulation study.

Significance. If the central derivation is correct, the work supplies a principled, decision-theoretic route to joint estimation-guidance design that avoids the unjustified invocation of the classical separation theorem. The explicit use of the posterior PDF, the trajectory-shaping mechanism, and the real-time implementation considerations constitute concrete advances over ad-hoc stochastic extensions of DGL1. The Monte Carlo validation, if statistically sound, would provide useful empirical support for the approach in realistic noise environments.

major comments (3)

[§3.2] §3.2, Eq. (12)–(14): the claim that the derived stochastic law satisfies the generalized separation theorem rests on the assertion that the Bayesian cost is minimized with respect to the true posterior; however, the manuscript provides no explicit proof that the resulting policy is independent of the estimator dynamics in the sense required by the generalized theorem, leaving the compliance statement unverified.
[§4.1] §4.1, Algorithm 1: the interacting multiple model particle filter is stated to deliver the required posterior PDF, yet no analysis of particle degeneracy, effective sample size, or approximation error under the high-dimensional game-state dynamics is supplied; this approximation quality is load-bearing for both the optimality claim and the real-time feasibility argument.
[§5] §5, Table 2: the reported miss-distance statistics show improvement over the baseline DGL1+KF combination, but the Monte Carlo runs do not include a comparison against other stochastic guidance laws (e.g., those derived from stochastic differential games or risk-sensitive criteria), so the magnitude of the claimed advantage cannot be assessed relative to the broader literature.

minor comments (3)

[§3] Notation for the Bayesian cost function is introduced inconsistently between §3 and the appendix; a single, unified symbol table would improve readability.
[Figure 4] Figure 4 caption does not specify the number of Monte Carlo trials or the exact noise parameters used; these details are needed to reproduce the plotted trajectories.
[§4.3] The statement that the optimal decision is “harnessed to shape the trajectory” (abstract and §4.3) would benefit from an explicit algorithmic description of the tie-breaking rule rather than a qualitative description.

Circularity Check

0 steps flagged

No significant circularity detected in derivation chain

full rationale

The paper applies Bayesian decision theory to modify the existing perfect-information DGL1 guidance law to account for estimation errors from a separately designed state estimator, producing a stochastic guidance law that complies with the generalized separation theorem. The posterior PDF is obtained via an interacting multiple model particle filter from noisy measurements, and the Bayesian cost is computed to yield the optimal decision; nonunique decisions are used to shape the pursuer trajectory for improved estimation. Performance is assessed via Monte Carlo simulation. No derivation step reduces by construction to its inputs, no fitted parameters are relabeled as predictions, and no load-bearing premise relies solely on an unverified self-citation chain. The framework is self-contained against external benchmarks with independent validation.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available, so no specific free parameters, axioms, or invented entities can be extracted from the full text. The framework relies on standard Bayesian decision theory and particle filtering without detailing new postulates.

pith-pipeline@v0.9.0 · 5454 in / 1108 out tokens · 72213 ms · 2026-05-16T02:00:32.871435+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

the cost function J to be the final miss distance of the stochastic interception problem... C_ij ... distance between the current ZEM and the singular region’s nearest boundary
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

complying with the generalized separation theorem... posterior probability density function of the game’s state

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages

[1]

OnOptimalGuidanceforHomingMissiles,

Gutman,S.,“OnOptimalGuidanceforHomingMissiles,”JournalofGuidance,Control,andDynamics, Vol. 2, No. 4, 1979, pp. 296–300. https://doi.org/10.2514/3.55878

work page doi:10.2514/3.55878 1979
[2]

Solution Techniques for Realistic Pursuit-Evasion Games,

Shinar, J., “Solution Techniques for Realistic Pursuit-Evasion Games,”Control and Dynamic Systems, Advances in Theory and Applications, Vol. 17, edited by C. T. Leondes, Academic Press, 1981, pp. 63–124. https://doi.org/10.1016/B978-0-12-012717-7.50009-7

work page doi:10.1016/b978-0-12-012717-7.50009-7 1981
[3]

Estimation-Based Guidance Using Optimal Bayesian Decision,

Mudrik, L., and Oshman, Y., “Estimation-Based Guidance Using Optimal Bayesian Decision,”2019 27th Mediterranean Conference on Control and Automation (MED), 2019, pp. 136–141. https: //doi.org/10.1109/MED.2019.8798557

work page doi:10.1109/med.2019.8798557 2019
[4]

L.,Detection, Estimation, and Modulation Theory, Part I: Detection, Estimation, and Linear Modulation Theory, John Wiley & Sons, 2004, Chap

Van Trees, H. L.,Detection, Estimation, and Modulation Theory, Part I: Detection, Estimation, and Linear Modulation Theory, John Wiley & Sons, 2004, Chap. 2

work page 2004
[5]

Exact Bayesian and Particle Filtering of Stochastic Hybrid Systems,

Blom, H. A. P., and Bloem, E. A., “Exact Bayesian and Particle Filtering of Stochastic Hybrid Systems,”IEEE Transactions on Aerospace and Electronic Systems, Vol. 43, No. 1, 2007, pp. 55–70. https://doi.org/10.1109/TAES.2007.357154

work page doi:10.1109/taes.2007.357154 2007
[6]

Stochastic Cooperative Interception Using Information Sharing Based on Engagement Staggering,

Shaferman, V., and Oshman, Y., “Stochastic Cooperative Interception Using Information Sharing Based on Engagement Staggering,”Journal of Guidance, Control, and Dynamics, Vol. 39, No. 9, 2016, pp. 2127–2141. https://doi.org/10.2514/1.G000437

work page doi:10.2514/1.g000437 2016
[7]

Estimation-Guided Guidance and Its Implementation via Sequential Monte Carlo Computation,

Shaviv, I. G., and Oshman, Y., “Estimation-Guided Guidance and Its Implementation via Sequential Monte Carlo Computation,”Journal of Guidance, Control, and Dynamics, Vol. 40, No. 2, 2017, pp. 402–417. https://doi.org/10.2514/1.G000360

work page doi:10.2514/1.g000360 2017
[8]

Information-Enhancement via Trajectory Shaping in Bayesian Decision- Directed Stochastic Guidance,

Mudrik, L., and Oshman, Y., “Information-Enhancement via Trajectory Shaping in Bayesian Decision- Directed Stochastic Guidance,”AIAA SCITECH 2023 Forum, National Harbor, MD, 2023. https: //doi.org/10.2514/6.2023-2497

work page doi:10.2514/6.2023-2497 2023
[9]

Separation of Estimation and Control for Discrete Time Systems,

Witsenhausen, H. S., “Separation of Estimation and Control for Discrete Time Systems,”Proceedings of the IEEE, Vol. 59, No. 11, 1971, pp. 1557–1566. https://doi.org/10.1109/PROC.1971.8488. 43

work page doi:10.1109/proc.1971.8488 1971
[10]

Sufficientstatisticsintheoptimumcontrolofstochasticsystems,

Striebel,C.,“Sufficientstatisticsintheoptimumcontrolofstochasticsystems,”JournalofMathematical Analysis and Applications, Vol. 12, No. 3, 1965, pp. 576–592. https://doi.org/10.1016/0022-247X(65) 90027-2

work page doi:10.1016/0022-247x(65 1965
[11]

F.,Stochastic Optimal Control: Theory and Application, Wiley, 1986, Chap

Stengel, R. F.,Stochastic Optimal Control: Theory and Application, Wiley, 1986, Chap. 5

work page 1986
[12]

What Happens When Certainty Equivalence is Not Valid?: Is There an Optimal Estimator for Terminal Guidance?

Shinar, J., and Turetsky, V., “What Happens When Certainty Equivalence is Not Valid?: Is There an Optimal Estimator for Terminal Guidance?”IFAC Proceedings Volumes, Vol. 36, No. 8, 2003, pp. 175–184. https://doi.org/10.1016/S1474-6670(17)35780-4

work page doi:10.1016/s1474-6670(17)35780-4 2003
[13]

IntegratedEstimation/GuidanceDesignApproachforImproved HomingAgainstRandomlyManeuveringTargets,

Shinar,J.,Turetsky,V.,andOshman,Y.,“IntegratedEstimation/GuidanceDesignApproachforImproved HomingAgainstRandomlyManeuveringTargets,”JournalofGuidance,Control,andDynamics,Vol.30, No. 1, 2007, pp. 154–161. https://doi.org/10.2514/1.22916

work page doi:10.2514/1.22916 2007
[14]

An Adaptive Terminal Guidance Scheme Based on an Exponential Cost Criterion with Application to Homing Missile Guidance,

Speyer, J., “An Adaptive Terminal Guidance Scheme Based on an Exponential Cost Criterion with Application to Homing Missile Guidance,”IEEE Transactions on Automatic Control, Vol. 21, No. 3, 1976, pp. 371–375. https://doi.org/10.1109/TAC.1976.1101206

work page doi:10.1109/tac.1976.1101206 1976
[15]

Stochastic Optimal Control Guidance Law with Bounded Acceleration,

Hexner, G., and Shima, T., “Stochastic Optimal Control Guidance Law with Bounded Acceleration,” IEEE Transactions on Aerospace and Electronic Systems, Vol. 43, No. 1, 2007, pp. 71–78. https: //doi.org/10.1109/TAES.2007.357155

work page doi:10.1109/taes.2007.357155 2007
[16]

LQG Guidance Law with Bounded Acceleration Command,

Hexner, G., Shima, T., and Weiss, H., “LQG Guidance Law with Bounded Acceleration Command,” IEEE Transactions on Aerospace and Electronic Systems, Vol. 44, No. 1, 2008, pp. 77–86. https: //doi.org/10.1109/TAES.2008.4516990

work page doi:10.1109/taes.2008.4516990 2008
[17]

The Interacting Multiple Model Algorithm for Systems with Markovian Switching Coefficients,

Blom, H., and Bar-Shalom, Y., “The Interacting Multiple Model Algorithm for Systems with Markovian Switching Coefficients,”IEEE Transactions on Automatic Control, Vol. 33, No. 8, 1988, pp. 780–783. https://doi.org/10.1109/9.1299

work page doi:10.1109/9.1299 1988
[18]

Near-Optimal Evasion from Pursuers Employing Modern Linear Guidance Laws,

Shaferman, V., “Near-Optimal Evasion from Pursuers Employing Modern Linear Guidance Laws,” Journal of Guidance, Control, and Dynamics, 2021, pp. 1–13. https://doi.org/10.2514/1.G005725. 44

work page doi:10.2514/1.g005725 2021
[19]

Posterior Cramer-Rao bounds for discrete-time nonlinear filtering,

Tichavsky, P., Muravchik, C., and Nehorai, A., “Posterior Cramer-Rao bounds for discrete-time nonlinear filtering,”IEEE Trans. Signal Process., Vol. 46, No. 5, 1998, pp. 1386–1396. https: //doi.org/10.1109/78.668800

work page doi:10.1109/78.668800 1998
[20]

A Particle Filter Approach to Approximate Posterior Cramer-Rao Lower Bound: The Case of Hidden States,

Tulsyan, A., Huang, B., Gopaluni, R. B., and Forbes, J. F., “A Particle Filter Approach to Approximate Posterior Cramer-Rao Lower Bound: The Case of Hidden States,”IEEE Transactions on Aerospace and Electronic Systems, Vol. 49, No. 4, 2013, pp. 2478–2495. https://doi.org/10.1109/TAES.2013.6621830

work page doi:10.1109/taes.2013.6621830 2013
[21]

Optimizationofobservertrajectoriesforbearings-onlytargetlocalization,

Oshman,Y.,andDavidson,P.,“Optimizationofobservertrajectoriesforbearings-onlytargetlocalization,” IEEE Transactions on Aerospace and Electronic Systems, Vol. 35, No. 3, 1999, pp. 892–902. https://doi.org/10.1109/7.784059. 45

work page doi:10.1109/7.784059 1999

[1] [1]

OnOptimalGuidanceforHomingMissiles,

Gutman,S.,“OnOptimalGuidanceforHomingMissiles,”JournalofGuidance,Control,andDynamics, Vol. 2, No. 4, 1979, pp. 296–300. https://doi.org/10.2514/3.55878

work page doi:10.2514/3.55878 1979

[2] [2]

Solution Techniques for Realistic Pursuit-Evasion Games,

Shinar, J., “Solution Techniques for Realistic Pursuit-Evasion Games,”Control and Dynamic Systems, Advances in Theory and Applications, Vol. 17, edited by C. T. Leondes, Academic Press, 1981, pp. 63–124. https://doi.org/10.1016/B978-0-12-012717-7.50009-7

work page doi:10.1016/b978-0-12-012717-7.50009-7 1981

[3] [3]

Estimation-Based Guidance Using Optimal Bayesian Decision,

Mudrik, L., and Oshman, Y., “Estimation-Based Guidance Using Optimal Bayesian Decision,”2019 27th Mediterranean Conference on Control and Automation (MED), 2019, pp. 136–141. https: //doi.org/10.1109/MED.2019.8798557

work page doi:10.1109/med.2019.8798557 2019

[4] [4]

L.,Detection, Estimation, and Modulation Theory, Part I: Detection, Estimation, and Linear Modulation Theory, John Wiley & Sons, 2004, Chap

Van Trees, H. L.,Detection, Estimation, and Modulation Theory, Part I: Detection, Estimation, and Linear Modulation Theory, John Wiley & Sons, 2004, Chap. 2

work page 2004

[5] [5]

Exact Bayesian and Particle Filtering of Stochastic Hybrid Systems,

Blom, H. A. P., and Bloem, E. A., “Exact Bayesian and Particle Filtering of Stochastic Hybrid Systems,”IEEE Transactions on Aerospace and Electronic Systems, Vol. 43, No. 1, 2007, pp. 55–70. https://doi.org/10.1109/TAES.2007.357154

work page doi:10.1109/taes.2007.357154 2007

[6] [6]

Stochastic Cooperative Interception Using Information Sharing Based on Engagement Staggering,

Shaferman, V., and Oshman, Y., “Stochastic Cooperative Interception Using Information Sharing Based on Engagement Staggering,”Journal of Guidance, Control, and Dynamics, Vol. 39, No. 9, 2016, pp. 2127–2141. https://doi.org/10.2514/1.G000437

work page doi:10.2514/1.g000437 2016

[7] [7]

Estimation-Guided Guidance and Its Implementation via Sequential Monte Carlo Computation,

Shaviv, I. G., and Oshman, Y., “Estimation-Guided Guidance and Its Implementation via Sequential Monte Carlo Computation,”Journal of Guidance, Control, and Dynamics, Vol. 40, No. 2, 2017, pp. 402–417. https://doi.org/10.2514/1.G000360

work page doi:10.2514/1.g000360 2017

[8] [8]

Information-Enhancement via Trajectory Shaping in Bayesian Decision- Directed Stochastic Guidance,

Mudrik, L., and Oshman, Y., “Information-Enhancement via Trajectory Shaping in Bayesian Decision- Directed Stochastic Guidance,”AIAA SCITECH 2023 Forum, National Harbor, MD, 2023. https: //doi.org/10.2514/6.2023-2497

work page doi:10.2514/6.2023-2497 2023

[9] [9]

Separation of Estimation and Control for Discrete Time Systems,

Witsenhausen, H. S., “Separation of Estimation and Control for Discrete Time Systems,”Proceedings of the IEEE, Vol. 59, No. 11, 1971, pp. 1557–1566. https://doi.org/10.1109/PROC.1971.8488. 43

work page doi:10.1109/proc.1971.8488 1971

[10] [10]

Sufficientstatisticsintheoptimumcontrolofstochasticsystems,

Striebel,C.,“Sufficientstatisticsintheoptimumcontrolofstochasticsystems,”JournalofMathematical Analysis and Applications, Vol. 12, No. 3, 1965, pp. 576–592. https://doi.org/10.1016/0022-247X(65) 90027-2

work page doi:10.1016/0022-247x(65 1965

[11] [11]

F.,Stochastic Optimal Control: Theory and Application, Wiley, 1986, Chap

Stengel, R. F.,Stochastic Optimal Control: Theory and Application, Wiley, 1986, Chap. 5

work page 1986

[12] [12]

What Happens When Certainty Equivalence is Not Valid?: Is There an Optimal Estimator for Terminal Guidance?

Shinar, J., and Turetsky, V., “What Happens When Certainty Equivalence is Not Valid?: Is There an Optimal Estimator for Terminal Guidance?”IFAC Proceedings Volumes, Vol. 36, No. 8, 2003, pp. 175–184. https://doi.org/10.1016/S1474-6670(17)35780-4

work page doi:10.1016/s1474-6670(17)35780-4 2003

[13] [13]

IntegratedEstimation/GuidanceDesignApproachforImproved HomingAgainstRandomlyManeuveringTargets,

Shinar,J.,Turetsky,V.,andOshman,Y.,“IntegratedEstimation/GuidanceDesignApproachforImproved HomingAgainstRandomlyManeuveringTargets,”JournalofGuidance,Control,andDynamics,Vol.30, No. 1, 2007, pp. 154–161. https://doi.org/10.2514/1.22916

work page doi:10.2514/1.22916 2007

[14] [14]

An Adaptive Terminal Guidance Scheme Based on an Exponential Cost Criterion with Application to Homing Missile Guidance,

Speyer, J., “An Adaptive Terminal Guidance Scheme Based on an Exponential Cost Criterion with Application to Homing Missile Guidance,”IEEE Transactions on Automatic Control, Vol. 21, No. 3, 1976, pp. 371–375. https://doi.org/10.1109/TAC.1976.1101206

work page doi:10.1109/tac.1976.1101206 1976

[15] [15]

Stochastic Optimal Control Guidance Law with Bounded Acceleration,

Hexner, G., and Shima, T., “Stochastic Optimal Control Guidance Law with Bounded Acceleration,” IEEE Transactions on Aerospace and Electronic Systems, Vol. 43, No. 1, 2007, pp. 71–78. https: //doi.org/10.1109/TAES.2007.357155

work page doi:10.1109/taes.2007.357155 2007

[16] [16]

LQG Guidance Law with Bounded Acceleration Command,

Hexner, G., Shima, T., and Weiss, H., “LQG Guidance Law with Bounded Acceleration Command,” IEEE Transactions on Aerospace and Electronic Systems, Vol. 44, No. 1, 2008, pp. 77–86. https: //doi.org/10.1109/TAES.2008.4516990

work page doi:10.1109/taes.2008.4516990 2008

[17] [17]

The Interacting Multiple Model Algorithm for Systems with Markovian Switching Coefficients,

Blom, H., and Bar-Shalom, Y., “The Interacting Multiple Model Algorithm for Systems with Markovian Switching Coefficients,”IEEE Transactions on Automatic Control, Vol. 33, No. 8, 1988, pp. 780–783. https://doi.org/10.1109/9.1299

work page doi:10.1109/9.1299 1988

[18] [18]

Near-Optimal Evasion from Pursuers Employing Modern Linear Guidance Laws,

Shaferman, V., “Near-Optimal Evasion from Pursuers Employing Modern Linear Guidance Laws,” Journal of Guidance, Control, and Dynamics, 2021, pp. 1–13. https://doi.org/10.2514/1.G005725. 44

work page doi:10.2514/1.g005725 2021

[19] [19]

Posterior Cramer-Rao bounds for discrete-time nonlinear filtering,

Tichavsky, P., Muravchik, C., and Nehorai, A., “Posterior Cramer-Rao bounds for discrete-time nonlinear filtering,”IEEE Trans. Signal Process., Vol. 46, No. 5, 1998, pp. 1386–1396. https: //doi.org/10.1109/78.668800

work page doi:10.1109/78.668800 1998

[20] [20]

A Particle Filter Approach to Approximate Posterior Cramer-Rao Lower Bound: The Case of Hidden States,

Tulsyan, A., Huang, B., Gopaluni, R. B., and Forbes, J. F., “A Particle Filter Approach to Approximate Posterior Cramer-Rao Lower Bound: The Case of Hidden States,”IEEE Transactions on Aerospace and Electronic Systems, Vol. 49, No. 4, 2013, pp. 2478–2495. https://doi.org/10.1109/TAES.2013.6621830

work page doi:10.1109/taes.2013.6621830 2013

[21] [21]

Optimizationofobservertrajectoriesforbearings-onlytargetlocalization,

Oshman,Y.,andDavidson,P.,“Optimizationofobservertrajectoriesforbearings-onlytargetlocalization,” IEEE Transactions on Aerospace and Electronic Systems, Vol. 35, No. 3, 1999, pp. 892–902. https://doi.org/10.1109/7.784059. 45

work page doi:10.1109/7.784059 1999