Control-Channel Informativity for Koopman EDMDc under Behavior-Policy Data
Pith reviewed 2026-05-20 09:41 UTC · model grok-4.3
The pith
The strict positivity of residual input covariance after state projection is necessary and sufficient for finite-sample identifiability of the lifted control-channel block in EDMDc.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that the conditional intervention certificate, defined as the residual covariance of inputs after orthogonal projection away from the span of active lifted-state features and realized as the Schur complement of the lifted-state block in the EDMDc information matrix, must be strictly positive to guarantee that the lifted control-channel block is identifiable from finite samples. When this certificate vanishes, there exist distinct lifted models that agree on every collected transition yet produce different predictions under counterfactual inputs. The result is supported by a closed-loop statistical bound that uses predictable regressors and conditionally sub-Gaussian nose
What carries the argument
The conditional intervention certificate, which measures residual input covariance after projection onto the orthogonal complement of the active lifted-state feature span.
If this is right
- Strict positivity of the certificate guarantees unique recovery of the lifted control-channel block from the given finite samples.
- Vanishing certificate allows multiple models to agree on collected transitions while differing on counterfactual inputs.
- Under scalar dithered feedback the residual intervention information grows quadratically with dither amplitude.
- Control-channel estimation error scales inversely with the intervention signal-to-noise ratio.
- State coverage, joint-regression conditioning, and intervention excitation function as complementary rather than interchangeable diagnostics.
Where Pith is reading between the lines
- Behavior policies used for data collection could be deliberately augmented with small persistent excitation to keep the certificate positive and thereby support later control design.
- The same residual-covariance diagnostic may be useful in other off-policy model-learning settings to decide when collected trajectories suffice for intervention effects.
- Online monitoring of the certificate during data acquisition could trigger adaptive policy adjustments that restore identifiability without full re-collection.
- In high-dimensional or partially observed systems the certificate could serve as a practical stopping criterion for data gathering before attempting control synthesis.
Load-bearing premise
Transition noise is conditionally sub-Gaussian and regressors are predictable.
What would settle it
A finite data set generated by a behavior policy in which the computed Schur complement is zero or negative, yet every pair of lifted models that fit the observed transitions produce identical outputs under new input sequences, would disprove necessity and sufficiency.
Figures
read the original abstract
Extended dynamic mode decomposition with control (EDMDc) is often trained from trajectories generated by a behavior policy or a pre-existing feedback controller. Such data can predict the observed behavior accurately while failing to identify how new input commands change the lifted state. This paper studies that failure as a control-channel informativity problem. We introduce a conditional intervention certificate, defined as the residual input covariance after projecting the input data away from the active lifted-state feature span. The certificate is the Schur complement of the lifted-state block in the EDMDc information matrix. We prove that its strict positivity is necessary and sufficient for finite-sample sample- identifiability of the lifted control-channel block. If the certificate vanishes, distinct lifted models agree on every collected transition but disagree under counterfactual inputs. We then give a closed-loop statistical bound using predictable regressors, conditionally sub-Gaussian transition noise, and a regularized Schur complement. A scalar feedback example shows the unavoidable scaling: under dithered feedback, residual intervention information grows quadratically with the dither amplitude and the control-channel error decreases with the inverse intervention signal-to-noise scale. New experiments verify these scalings exactly in a linear system and diagnostically in controlled Duffing and Van der Pol benchmarks. A larger EDMDc acquisition grid further shows that state coverage, joint regression conditioning, and intervention excitation are complementary diagnostics rather than interchangeable performance score.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper studies control-channel informativity in EDMDc when trajectories are generated under a behavior policy or existing feedback controller. It defines a conditional intervention certificate as the Schur complement of the lifted-state block within the EDMDc Gram matrix and proves that strict positivity of this certificate is necessary and sufficient for finite-sample identifiability of the lifted control-channel parameters. The work further derives a closed-loop statistical bound under predictable regressors and conditionally sub-Gaussian transition noise, illustrates unavoidable scaling with dither amplitude in a scalar feedback example, and validates the scalings on linear and nonlinear benchmark systems.
Significance. If the algebraic characterization and statistical bound hold, the paper supplies a concrete, computable diagnostic for whether behavior-policy data can identify control effects in Koopman models. This is valuable for data-driven control applications where open-loop excitation is unavailable. The direct link to the normal equations of joint least-squares and the explicit necessity construction via an alternative control matrix are clear strengths; the scaling result with dither amplitude offers practical guidance.
major comments (2)
- [Section 3] Section 3 (finite-sample identifiability): The necessity direction constructs an alternative control matrix that agrees on observed transitions when the certificate vanishes. Please confirm that this construction remains valid for arbitrary feature maps and does not implicitly require the lifted state to be a faithful representation of the original dynamics.
- [Section 4] Section 4 (closed-loop statistical bound): The bound is stated for the regularized Schur complement. Clarify whether the regularization parameter must be chosen independently of the data or can be adapted, and whether the resulting high-probability statement still implies practical identifiability when the unregularized certificate is only marginally positive.
minor comments (3)
- Notation: The term 'conditional intervention certificate' is introduced without an explicit symbol; introducing a compact notation (e.g., C_I) would improve readability when the quantity is referenced repeatedly in theorems and experiments.
- Experiments: The Duffing and Van der Pol results are described as 'diagnostic'; adding a quantitative table that reports the certificate value alongside the observed control-channel error for each acquisition grid would make the complementarity claim easier to verify.
- References: The connection to existing persistency-of-excitation conditions in adaptive control or subspace identification is mentioned only briefly; a short paragraph contrasting the Schur-complement certificate with classical rank conditions would help situate the contribution.
Simulated Author's Rebuttal
We thank the referee for the thorough review and the encouraging recommendation for minor revision. The comments help clarify important aspects of the identifiability results and statistical bounds. We address each major comment below.
read point-by-point responses
-
Referee: [Section 3] Section 3 (finite-sample identifiability): The necessity direction constructs an alternative control matrix that agrees on observed transitions when the certificate vanishes. Please confirm that this construction remains valid for arbitrary feature maps and does not implicitly require the lifted state to be a faithful representation of the original dynamics.
Authors: The necessity construction is algebraic and operates solely in the space of lifted features. Given a vanishing Schur complement, we explicitly construct an alternative control matrix B' such that the lifted model with B' produces identical one-step predictions on all observed (lifted-state, input) pairs, yet differs under new inputs. This argument relies only on the linear algebra of the Gram matrix and the definition of the Schur complement; it holds for any feature map and makes no reference to whether the lifted coordinates faithfully embed the original state space. The result concerns identifiability of the lifted control parameters, which is the relevant quantity for subsequent control design. A short clarifying paragraph has been added to Section 3. revision: yes
-
Referee: [Section 4] Section 4 (closed-loop statistical bound): The bound is stated for the regularized Schur complement. Clarify whether the regularization parameter must be chosen independently of the data or can be adapted, and whether the resulting high-probability statement still implies practical identifiability when the unregularized certificate is only marginally positive.
Authors: The regularization parameter is a fixed positive constant selected independently of the data, typically on the order of the noise variance divided by the sample size or a similar a priori quantity. Adaptation to the data is not required for the high-probability statement and would complicate the analysis. When the unregularized certificate is only marginally positive, the bound on the regularized quantity remains valid and implies that the control-channel estimation error is controlled by the inverse of (certificate - λ); the resulting guarantee is correspondingly weaker, which accurately reflects the limited practical identifiability in that regime. We have inserted additional explanatory text after the main theorem in Section 4 to make these distinctions explicit. revision: yes
Circularity Check
Central identifiability claim reduces to algebraic definition of the certificate
specific steps
-
self definitional
[Abstract]
"We introduce a conditional intervention certificate, defined as the residual input covariance after projecting the input data away from the active lifted-state feature span. The certificate is the Schur complement of the lifted-state block in the EDMDc information matrix. We prove that its strict positivity is necessary and sufficient for finite-sample sample-identifiability of the lifted control-channel block."
The certificate is defined to be the Schur complement, which by construction is the precise algebraic condition (full column rank of the residual regressors) that guarantees a unique least-squares solution for the control coefficients in the joint normal equations. The claimed necessity-and-sufficiency proof therefore reduces directly to this definition, with no additional derivation or external content.
full rationale
The paper defines the conditional intervention certificate explicitly as the Schur complement of the lifted-state block in the EDMDc Gram matrix (i.e., residual input covariance after projection onto the state features). It then claims to prove that strict positivity of this quantity is necessary and sufficient for finite-sample identifiability of the control-channel block. This equivalence is exactly the condition for unique solvability of the normal equations in the joint least-squares problem, which holds by standard block-matrix linear algebra once the certificate is so defined. The necessity direction is shown by constructing an alternative control matrix that matches all observed transitions precisely when the residual covariance is singular. No external assumptions, self-citations, or fitted parameters are required for the equivalence; the result is therefore self-definitional rather than independently derived.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Transition noise is conditionally sub-Gaussian
- domain assumption Regressors are predictable
invented entities (1)
-
Conditional intervention certificate
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We introduce a conditional intervention certificate, defined as the residual input covariance after projecting the input data away from the active lifted-state feature span. The certificate is the Schur complement of the lifted-state block in the EDMDc information matrix. We prove that its strict positivity is necessary and sufficient for finite-sample sample-identifiability of the lifted control-channel block.
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
If βN = 0, then there exist a ≠ 0 and h such that a⊤U = h⊤Z. For any c, define Ac = A⋆ − c h⊤, Bc = B⋆ + c a⊤. Then AcZ + BcU = A⋆Z + B⋆U on the collected data.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
A data-driven approximation of the koopman operator: Extending dynamic mode decomposition,
M. O. Williams, I. G. Kevrekidis, and C. W. Rowley, “A data-driven approximation of the koopman operator: Extending dynamic mode decomposition,”Journal of Nonlinear Science, vol. 25, pp. 1307–1346, 2015
work page 2015
-
[2]
Dynamic mode decomposition with control,
J. L. Proctor, S. L. Brunton, and J. N. Kutz, “Dynamic mode decomposition with control,”SIAM Journal on Applied Dynamical Systems, vol. 15, no. 1, pp. 142–161, 2016
work page 2016
-
[3]
Linear predictors for nonlinear dynamical systems: Koopman operator meets model predictive control,
M. Korda and I. Mezi ´c, “Linear predictors for nonlinear dynamical systems: Koopman operator meets model predictive control,”Automatica, vol. 93, pp. 149–160, 2018
work page 2018
-
[4]
Modern koopman theory for dynamical systems,
S. L. Brunton, M. Budiši ´c, E. Kaiser, and J. N. Kutz, “Modern koopman theory for dynamical systems,”SIAM Review, vol. 64, no. 2, pp. 229–340, 2022. 13
work page 2022
-
[5]
Active learning of dynamics for data-driven control using koopman operators,
I. Abraham and T. D. Murphey, “Active learning of dynamics for data-driven control using koopman operators,”IEEE Transactions on Robotics, vol. 35, no. 5, pp. 1071–1083, 2019
work page 2019
-
[6]
Koopman operators for generalized persistence of excitation conditions for nonlinear systems,
N. Boddupalli, A. Hasnain, S. P. Nandanoori, and E. Yeung, “Koopman operators for generalized persistence of excitation conditions for nonlinear systems,” inProceedings of the IEEE 58th Conference on Decision and Control (CDC). IEEE, 2019, pp. 8106–8111
work page 2019
-
[7]
Willems’ fundamental lemma for nonlinear systems with koopman linear embedding,
X. Shang, J. Cortés, and Y . Zheng, “Willems’ fundamental lemma for nonlinear systems with koopman linear embedding,”IEEE Control Systems Letters, 2024
work page 2024
-
[8]
Data-driven mpc with stability guarantees using extended dynamic mode decomposition,
L. Bold, L. Grüne, M. Schaller, and K. Worthmann, “Data-driven mpc with stability guarantees using extended dynamic mode decomposition,”IEEE Transactions on Automatic Control, vol. 70, no. 1, pp. 534–541, 2025
work page 2025
-
[9]
Koopman-based feedback design with stability guarantees,
R. Strässer, M. Schaller, K. Worthmann, J. Berberich, and F. Allgöwer, “Koopman-based feedback design with stability guarantees,”IEEE Transactions on Automatic Control, vol. 70, no. 1, pp. 355–370, 2025
work page 2025
-
[10]
Error analysis of kernel edmd for prediction and control in the koopman framework,
F. M. Philipp, M. Schaller, K. Worthmann, S. Peitz, and F. Nüske, “Error analysis of kernel edmd for prediction and control in the koopman framework,”Journal of Nonlinear Science, vol. 35, p. 92, 2025
work page 2025
-
[11]
Kernel-based koopman approximants for control: Flexible sampling, error analysis, and stability,
L. Bold, F. M. Philipp, M. Schaller, and K. Worthmann, “Kernel-based koopman approximants for control: Flexible sampling, error analysis, and stability,”SIAM Journal on Control and Optimization, vol. 63, no. 6, pp. 4044–4071, 2025
work page 2025
-
[12]
Persistent excitation in adaptive systems,
K. S. Narendra and A. M. Annaswamy, “Persistent excitation in adaptive systems,”International Journal of Control, vol. 45, no. 1, pp. 127–160, 1987
work page 1987
-
[13]
A note on persistency of excitation,
J. C. Willems, P. Rapisarda, I. Markovsky, and B. L. De Moor, “A note on persistency of excitation,” Systems & Control Letters, vol. 54, no. 4, pp. 325–329, 2005
work page 2005
-
[14]
From experiment design to closed-loop control,
H. Hjalmarsson, “From experiment design to closed-loop control,”Automatica, vol. 41, no. 3, pp. 393–438, 2005
work page 2005
-
[15]
D-optimal input design for nonlinear fir-type systems: A dispersion-based approach,
A. De Cock, M. Gevers, and J. Schoukens, “D-optimal input design for nonlinear fir-type systems: A dispersion-based approach,”Automatica, vol. 73, pp. 88–100, 2016
work page 2016
-
[16]
Trajectory synthesis for fisher information maximization,
A. D. Wilson, J. A. Schultz, and T. D. Murphey, “Trajectory synthesis for fisher information maximization,”IEEE Transactions on Robotics, vol. 30, no. 6, pp. 1358–1370, 2014
work page 2014
-
[17]
Data informativity: A new perspective on data-driven analysis and control,
H. J. van Waarde, J. Eising, H. L. Trentelman, and M. K. Camlibel, “Data informativity: A new perspective on data-driven analysis and control,”IEEE Transactions on Automatic Control, vol. 65, no. 11, pp. 4753–4768, 2020
work page 2020
-
[18]
Optimal excitation trajectories for mechanical systems 14 identification,
T. Lee, B. D. Lee, and F. C. Park, “Optimal excitation trajectories for mechanical systems 14 identification,”Automatica, vol. 131, p. 109773, 2021
work page 2021
-
[19]
Space-filling input design for nonlinear state-space identification,
M. Kiss, R. Tóth, and M. Schoukens, “Space-filling input design for nonlinear state-space identification,”IFAC-PapersOnLine, vol. 58, no. 15, pp. 562–567, 2024
work page 2024
-
[20]
V . Smits and O. Nelles, “Space-filling optimized excitation signals for nonlinear system identification of dynamic processes of a diesel engine,”Control Engineering Practice, vol. 144, p. 105821, 2024
work page 2024
-
[21]
On space-filling input design for nonlinear dynamic model learning: A gaussian process approach,
Y . Liu, M. Kiss, R. Tóth, and M. Schoukens, “On space-filling input design for nonlinear dynamic model learning: A gaussian process approach,”IEEE Control Systems Letters, vol. 9, pp. 1868–1873, 2025
work page 2025
-
[22]
Active learning-based model predictive coverage control,
R. Rickenbach, J. Köhler, A. Scampicchio, M. N. Zeilinger, and A. Carron, “Active learning-based model predictive coverage control,”IEEE Transactions on Automatic Control, vol. 69, no. 9, pp. 5931–5946, 2024
work page 2024
-
[23]
Improved algorithms for linear stochastic bandits,
Y . Abbasi-Yadkori, D. Pál, and C. Szepesvári, “Improved algorithms for linear stochastic bandits,” inAdvances in Neural Information Processing Systems, vol. 24, 2011, pp. 2312–2320
work page 2011
-
[24]
Learning without mixing: Towards a sharp analysis of linear system identification,
M. Simchowitz, H. Mania, S. Tu, M. I. Jordan, and B. Recht, “Learning without mixing: Towards a sharp analysis of linear system identification,” inProceedings of the 31st Conference on Learning Theory, ser. Proceedings of Machine Learning Research, vol. 75. PMLR, 2018, pp. 439–473
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.