Dual Control of Linear Systems from Bilinear Observations with Belief Space Model Predictive Control
Pith reviewed 2026-05-08 02:20 UTC · model grok-4.3
The pith
Belief-space MPC plans over state estimates and input-dependent covariances to improve control when actions affect observation quality.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In finite-horizon quadratic control of linear systems with bilinear observations, the separation principle fails because control inputs affect the future quality of state estimates obtained from an input-dependent Kalman filter. Belief-space model predictive control (B-MPC) addresses this by planning directly over both the estimated state and its error covariance, using a deterministic surrogate of the belief evolution. In synthetic numerical tests this produces lower estimation covariance and more uncertainty-aware actions than either the separation-principle controller or its MPC variant.
What carries the argument
Belief-space model predictive control (B-MPC) that optimizes over state estimates and the deterministic trajectory of the input-dependent Kalman-filter covariance matrix.
If this is right
- B-MPC outperforms separation-principle controllers and their MPC variants in regimes where control inputs improve future observation quality.
- The method produces lower closed-loop estimation covariance than non-dual controllers.
- Selected actions explicitly trade off immediate cost against future reduction in uncertainty.
- The approach applies to any finite-horizon linear-quadratic problem whose observation model is bilinear in state and input.
Where Pith is reading between the lines
- The same deterministic-surrogate idea could be tested on systems whose observation model is only approximately bilinear or mildly nonlinear.
- Replacing the deterministic covariance propagation with sampled trajectories inside the planner would quantify the approximation error for highly stochastic regimes.
- The framework suggests that explicit modeling of information-gathering value can be added to standard MPC without leaving the linear-quadratic setting.
Load-bearing premise
The deterministic surrogate of the stochastic belief evolution defined by the input-dependent Kalman filter is sufficiently accurate to produce effective control plans despite the underlying randomness in states and observations.
What would settle it
A side-by-side run on the same linear system in which the planner uses the deterministic covariance surrogate versus a version that draws full stochastic realizations of the belief trajectory; if the performance gap disappears or reverses under high process noise, the surrogate approximation is the limiting factor.
Figures
read the original abstract
We study finite-horizon quadratic control of linear systems with bilinear observations, in which the control input affects not only the state dynamics but also the partial observations of the state. In this setting, the separation principle can fail because control inputs influence the future quality of state estimates. State estimation requires an input-dependent Kalman filter whose gain and error covariance evolve as functions of the control inputs. To address this challenge, we propose a belief-space model predictive control ($\texttt{B-MPC}$) method that plans directly over both the estimated state and its error covariance. In particular, $\texttt{B-MPC}$ plans with a deterministic surrogate of the belief evolution defined by the input-dependent Kalman filter. Through numerical experiments in two synthetic settings, we show that $\texttt{B-MPC}$ can outperform both the separation-principle controller and its MPC variant in favorable regimes, and that these gains are accompanied by lower estimation covariance and more uncertainty-aware action choices.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper studies finite-horizon quadratic control of linear systems with bilinear observations, in which control inputs affect both state dynamics and observation quality. It proposes a belief-space model predictive control (B-MPC) method that plans directly over the estimated state and its error covariance using a deterministic surrogate of the input-dependent Kalman filter belief evolution. Numerical experiments in two synthetic settings are used to claim that B-MPC outperforms both the separation-principle controller and its MPC variant in favorable regimes, with accompanying reductions in estimation covariance and more uncertainty-aware actions.
Significance. If the empirical claims hold, the work provides a pragmatic approximation method for dual control settings where the separation principle fails. The deterministic surrogate of input-dependent belief evolution is a reasonable modeling choice that could enable uncertainty-aware planning in applications such as sensor scheduling or active perception.
major comments (1)
- [Numerical experiments] Numerical experiments section: the abstract and description report outperformance in two synthetic settings but supply no details on experiment design, baseline implementations, number of Monte Carlo trials, statistical tests, or the precise parameter regimes tested. The qualification to 'favorable regimes' therefore cannot be evaluated for robustness or risk of post-hoc selection.
Simulated Author's Rebuttal
We thank the referee for their careful reading and constructive feedback on our manuscript. We address the single major comment below and agree that additional details are needed to strengthen the presentation of the numerical results.
read point-by-point responses
-
Referee: Numerical experiments section: the abstract and description report outperformance in two synthetic settings but supply no details on experiment design, baseline implementations, number of Monte Carlo trials, statistical tests, or the precise parameter regimes tested. The qualification to 'favorable regimes' therefore cannot be evaluated for robustness or risk of post-hoc selection.
Authors: We agree that the current numerical experiments section does not provide sufficient detail for readers to assess the robustness of the reported outperformance. In the revised manuscript we will expand the section to include: (i) explicit descriptions of the two synthetic system models, including all parameter values, initial conditions, and horizon lengths; (ii) implementation details for the separation-principle controller and its MPC variant (e.g., how the Kalman filter is run, any approximations used); (iii) the exact number of Monte Carlo trials performed for each reported result; (iv) the statistical measures or tests used to compare controllers; and (v) a clearer delineation of the parameter regimes tested, together with additional results or discussion showing that the favorable regimes were identified a priori rather than through post-hoc selection. These additions will allow the claims to be evaluated more rigorously. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The paper defines B-MPC directly from the standard input-dependent Kalman filter equations for belief evolution (with deterministic surrogate) combined with quadratic MPC planning over estimated state and covariance. No load-bearing step reduces by construction to a fitted parameter, self-citation chain, or renamed input; the separation-principle baseline and its MPC variant are external comparators, and the numerical experiments in synthetic settings serve as independent empirical validation rather than tautological prediction. The modeling choice is explicitly stated as an approximation whose effectiveness is tested, not assumed by definition.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption System dynamics are linear and observations are bilinear in the control input
- standard math Finite-horizon quadratic cost and Gaussian noise
invented entities (1)
-
Deterministic surrogate of belief evolution
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Explore-then-commit for nonstationary linear bandits with latent dynamics
Sunmook Choi, Yahya Sattar, Yassir Jedra, Maryam Fazel, and Sarah Dean. Explore-then-commit for nonstationary linear bandits with latent dynamics. arXiv preprint arXiv:2510.16208 , 2025
-
[2]
Lars Gr \"u ne and J \"u rgen Pannek. Nonlinear model predictive control. In Nonlinear model predictive control: Theory and algorithms , pages 45--69. Springer, 2016
work page 2016
-
[3]
A new approach to linear filtering and prediction problems
Rudolph Emil Kalman. A new approach to linear filtering and prediction problems. 1960
work page 1960
-
[4]
Champion-level drone racing using deep reinforcement learning
Elia Kaufmann, Leonard Bauersfeld, Antonio Loquercio, Matthias M \"u ller, Vladlen Koltun, and Davide Scaramuzza. Champion-level drone racing using deep reinforcement learning. Nature , 620(7976):982--987, 2023
work page 2023
-
[5]
Model predictive control of bilinear systems as uncertain linear systems
Sahand Hadizadeh Kafash, Justin Koeln, and Justin Ruths. Model predictive control of bilinear systems as uncertain linear systems. In 2022 IEEE Conference on Control Technology and Applications (CCTA) , pages 562--567. IEEE, 2022
work page 2022
-
[6]
Leslie Pack Kaelbling, Michael L. Littman, and Anthony R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence , 101(1--2):99--134, 1998
work page 1998
-
[7]
Diyou Liu and Mohammad Khosravi. System identification for linear dynamics with bilinear observation models: An expectation–maximization approach. In 2024 IEEE 63rd Conference on Decision and Control (CDC) , pages 7190--7195, 2024
work page 2024
-
[8]
Model predictive control: Recent developments and future promise
David Q Mayne. Model predictive control: Recent developments and future promise. Automatica , 50(12):2967--2986, 2014
work page 2014
-
[9]
Keiko Nagami and Mac Schwager. State estimation and belief space planning under epistemic uncertainty for learning-based perception systems. IEEE Robotics and Automation Letters , 9(6):5118--5125, 2024
work page 2024
-
[10]
Christos H. Papadimitriou and John N. Tsitsiklis. The complexity of markov decision processes. Mathematics of Operations Research , 12(3):441--450, 1987
work page 1987
-
[11]
Optimization and control of bilinear systems: theory, algorithms, and applications , volume 11
Panos M Pardalos and Vitaliy A Yatsenko. Optimization and control of bilinear systems: theory, algorithms, and applications , volume 11. Springer Science & Business Media, 2010
work page 2010
-
[12]
J.B. Rawlings, D.Q. Mayne, and M. Diehl. Model Predictive Control: Theory, Computation, and Design . Nob Hill Publishing, LLC, 2024
work page 2024
-
[13]
Sub-optimality of the separation principle for quadratic control from bilinear observations
Yahya Sattar, Sunmook Choi, Yassir Jedra, Maryam Fazel, and Sarah Dean. Sub-optimality of the separation principle for quadratic control from bilinear observations. In 2025 IEEE 64th Conference on Decision and Control (CDC) , pages 3862--3867. IEEE, 2025
work page 2025
-
[14]
Learning linear dynamics from bilinear observations
Yahya Sattar, Yassir Jedra, and Sarah Dean. Learning linear dynamics from bilinear observations. In 2025 American Control Conference (ACC) , pages 3109--3115. IEEE, 2025
work page 2025
-
[15]
Yifan Xie, Julian Berberich, Robin Str \"a sser, and Frank Allg \"o wer. Bilinear data-driven min-max mpc: Designing rational controllers via sum-of-squares optimization. In 2025 IEEE 64th Conference on Decision and Control (CDC) , pages 1042--1047. IEEE, 2025
work page 2025
-
[16]
Optimal sampling-based motion planning in gaussian belief space for minimum sensing navigation, 2023
Vrushabh Zinage, Ali Reza Pedram, and Takashi Tanaka. Optimal sampling-based motion planning in gaussian belief space for minimum sensing navigation, 2023
work page 2023
-
[17]
Belief space planning: A covariance steering approach
Dongliang Zheng, Jack Ridderhof, Panagiotis Tsiotras, and Ali-akbar Agha-mohammadi. Belief space planning: A covariance steering approach. In 2022 International Conference on Robotics and Automation (ICRA) , pages 11051--11057. IEEE, 2022
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.