A Measure-Theoretic Formulation of Behavioral Systems
Pith reviewed 2026-05-09 14:10 UTC · model grok-4.3
The pith
Representing a dynamical system as the set of all probability measures on its admissible trajectories yields a convex and weakly closed behavior even for nonlinear deterministic dynamics.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A finite-horizon dynamical system is identified with the set of all Borel probability measures supported on its admissible trajectories. For deterministic systems this set is convex and weakly closed even when the underlying dynamics are nonlinear, because any convex combination of measures supported on admissible trajectories remains supported on admissible trajectories. Its extreme points are precisely the Dirac measures concentrated on individual admissible trajectories, so the original deterministic behavior is recovered as the extremal skeleton of the measure-valued object.
What carries the argument
The behavioral-measure set: the collection of all Borel probability measures whose support is contained in the set of admissible trajectories.
Load-bearing premise
The space of trajectories can be equipped with a Borel sigma-algebra such that the collection of probability measures supported on admissible trajectories is weakly closed and its extreme points are exactly the Dirac measures on single admissible trajectories.
What would settle it
An explicit pair of admissible trajectory distributions whose convex combination is supported only on trajectories that violate the system dynamics would falsify the claimed convexity.
Figures
read the original abstract
In Willems' behavioral systems theory, a dynamical system is identified with the set of all trajectories compatible with its laws of motion. In the linear time-invariant setting this trajectory set is a linear subspace, and its algebraic structure underpins the Fundamental Lemma: a single persistently exciting data trajectory generates the entire finite-horizon behavior. For nonlinear or stochastic systems, however, the admissible trajectory set is generally nonconvex, obstructing direct optimization over the behavior. In this paper, we lift the behavioral viewpoint from trajectories to probability measures on trajectories by representing a finite-horizon dynamical system with the set of all Borel probability measures supported on its admissible trajectories. For deterministic systems, this behavioral-measure set is convex and weakly closed even when the dynamics are nonlinear, because convex combinations of trajectory distributions remain dynamically admissible even when convex combinations of trajectories do not. Its extreme points are precisely the Dirac masses on individual admissible trajectories, so the classical deterministic theory is embedded as the extremal skeleton of the richer measure-valued object. On this foundation we establish two core deterministic results and outline a stochastic extension based on history-conditional kernel consistency.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper lifts Willems' behavioral systems theory from trajectory sets to the set of all Borel probability measures supported on admissible trajectories. For deterministic finite-horizon systems the behavioral-measure set is shown to be convex and weakly closed regardless of linearity of the dynamics, because convex combinations and weak limits preserve support inclusion in the admissible set B. The extreme points are precisely the Dirac measures on individual trajectories in B, recovering the classical deterministic theory. Two core deterministic results are established on this foundation and a stochastic extension via history-conditional kernel consistency is outlined.
Significance. If the derivations hold, the construction supplies a convex set in measure space for optimization over behaviors even when the underlying trajectory set is nonconvex, which is a direct response to a known obstruction in nonlinear behavioral control. The embedding of the deterministic case as the extremal skeleton is a clean application of standard facts on the weak topology and support. Credit is due for the parameter-free character of the lift and for relying only on classical measure-theoretic properties rather than additional dynamical structure.
major comments (2)
- [§3, Theorem 3.1] §3, Theorem 3.1 (weak closure): the argument that weak limits preserve support inclusion in B requires the trajectory space to be Polish (or at least completely regular) so that the support map is upper semicontinuous; the manuscript should state the precise topological assumptions on the trajectory space to make this rigorous for general nonlinear finite-horizon dynamics.
- [§4, Proposition 4.2] §4, Proposition 4.2 (extreme points): while the claim that extreme points are exactly the Diracs follows from the definition of extremality in the simplex, the proof should explicitly invoke the relevant fact from convex analysis (e.g., that any non-Dirac measure is a nontrivial convex combination) rather than leaving it implicit.
minor comments (3)
- [Definition 2.3] Definition 2.3: the notation for the behavioral-measure set could be introduced with an explicit symbol (e.g., M(B)) to improve readability when it is used repeatedly in later sections.
- [Introduction] Introduction, paragraph 3: the discussion of the Fundamental Lemma would benefit from a direct citation to Willems' original work rather than only secondary references.
- [§6] The stochastic extension in §6 is described as an outline; if the authors intend it as a contribution, a brief statement of the precise consistency conditions would help readers assess its scope.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of our manuscript and for the constructive comments. We address each major comment below and will incorporate the requested clarifications in the revised version.
read point-by-point responses
-
Referee: [§3, Theorem 3.1] §3, Theorem 3.1 (weak closure): the argument that weak limits preserve support inclusion in B requires the trajectory space to be Polish (or at least completely regular) so that the support map is upper semicontinuous; the manuscript should state the precise topological assumptions on the trajectory space to make this rigorous for general nonlinear finite-horizon dynamics.
Authors: We agree that the topological assumptions should be stated explicitly. In the revised manuscript we will add, at the beginning of Section 3, the standing assumption that the trajectory space is a Polish space (complete separable metric space). This is the natural setting for finite-horizon behaviors whose trajectories take values in Euclidean space. Under this hypothesis the support map is upper semicontinuous with respect to the weak topology, so any weak limit of measures supported on the closed admissible set B remains supported on B. The proof of Theorem 3.1 is therefore rigorous once the assumption is recorded; no change to the argument itself is required. revision: yes
-
Referee: [§4, Proposition 4.2] §4, Proposition 4.2 (extreme points): while the claim that extreme points are exactly the Diracs follows from the definition of extremality in the simplex, the proof should explicitly invoke the relevant fact from convex analysis (e.g., that any non-Dirac measure is a nontrivial convex combination) rather than leaving it implicit.
Authors: We thank the referee for this suggestion. In the revised manuscript we will make the invocation explicit in the proof of Proposition 4.2 by adding the standard fact that the extreme points of the simplex of probability measures are precisely the Dirac measures. Equivalently, any non-Dirac measure μ admits a nontrivial convex decomposition μ = λν₁ + (1-λ)ν₂ with 0 < λ < 1 and ν₁ ≠ ν₂ (obtained, for instance, by splitting the support into two sets of positive μ-measure). This renders the identification of the extreme points with the Dirac measures on admissible trajectories fully self-contained. revision: yes
Circularity Check
No significant circularity; derivation relies on standard measure theory
full rationale
The paper defines the behavioral-measure set as the collection of all Borel probability measures whose support lies inside the admissible trajectory set B. Convexity and weak closure then follow immediately from the definition of support (any convex combination of measures supported on B remains supported on B) and the fact that weak limits preserve support inclusion when B is closed in a Polish space. Extreme points being the Dirac measures is likewise a direct consequence of the geometry of the probability simplex. No equations reduce a claimed result to a fitted parameter or prior self-citation; the argument invokes only textbook facts about weak topology and extremal points. The construction is therefore self-contained against external mathematical benchmarks and receives the default non-circularity finding.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math The trajectory space is equipped with a Borel sigma-algebra and the weak topology on probability measures is well-defined.
invented entities (1)
-
Behavioral-measure set
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Paradigms and puzzles in the theory of dynamical systems,
J. C. Willems, “Paradigms and puzzles in the theory of dynamical systems,”IEEE Transactions on Automatic Control, vol. 36, no. 3, pp. 259–294, 1991
work page 1991
-
[2]
A note on persistency of excitation,
J. C. Willems, P. Rapisarda, I. Markovsky, and B. L. M. D. Moor, “A note on persistency of excitation,”Systems & Control Letters, vol. 54, no. 4, pp. 325–329, 2005
work page 2005
-
[3]
Data-enabled predictive control: In the shallows of the data-driven sea,
J. Coulson, J. Lygeros, and F. D¨ orfler, “Data-enabled predictive control: In the shallows of the data-driven sea,” inProc. IEEE Conference on Decision and Control (CDC), 2019, pp. 6309–6314
work page 2019
-
[4]
Data-driven model predictive control with stability and ro- bustness guarantees,
J. Berberich and F. Allg¨ ower, “Data-driven model predictive control with stability and ro- bustness guarantees,”IEEE Transactions on Automatic Control, vol. 66, no. 4, pp. 1702–1717, 2021
work page 2021
-
[5]
Data-driven simulation of linear time-invariant systems,
I. Markovsky and J. C. Willems, “Data-driven simulation of linear time-invariant systems,” Automatica, vol. 44, no. 12, pp. 3011–3017, 2008
work page 2008
-
[6]
Data informativity: A new perspective on data-driven analysis and control,
H. J. van Waarde, J. Eising, H. L. Trentelman, and M. K. Camlibel, “Data informativity: A new perspective on data-driven analysis and control,”IEEE Transactions on Automatic Control, vol. 65, no. 11, pp. 4753–4768, 2020
work page 2020
-
[7]
An informativity approach to the data- driven algebraic regulator problem,
H. L. Trentelman, H. van Waarde, and K. Camlibel, “An informativity approach to the data- driven algebraic regulator problem,”IEEE Transactions on Automatic Control, vol. 67, no. 11, pp. 6227–6233, 2022. 27
work page 2022
-
[8]
The informativity approach: To data-driven analysis and control,
H. J. van Waarde, J. Eising, M. K. Camlibel, and H. L. Trentelman, “The informativity approach: To data-driven analysis and control,”IEEE Control Systems Magazine, vol. 43, no. 6, pp. 32–66, 2023
work page 2023
-
[9]
T. Faulwasser, R. Ou, G. Pan, P. Schmitz, and K. Worthmann, “Behavioral theory for stochastic systems? A data-driven journey from Willems to Wiener and back again,”Annual Reviews in Control, vol. 55, pp. 92–117, 2023
work page 2023
-
[10]
Convex duality and nonlinear optimal control,
R. B. Vinter, “Convex duality and nonlinear optimal control,”IEEE Transactions on Automatic Control, vol. 38, no. 5, pp. 754–765, 1993
work page 1993
-
[11]
A semidefinite programming approach to the generalized moment problem,
J. B. Lasserre, “A semidefinite programming approach to the generalized moment problem,” Mathematical Programming, vol. 112, no. 1, pp. 65–92, 2008
work page 2008
-
[12]
London, UK: Imperial College Press, 2010
——,Moments, Positive Polynomials and Their Applications. London, UK: Imperial College Press, 2010
work page 2010
-
[13]
Nonlinear optimal control synthesis via occupation measures and LMI relaxations,
D. Henrion, J. B. Lasserre, and C. Savorgnan, “Nonlinear optimal control synthesis via occupation measures and LMI relaxations,”SIAM Journal on Control and Optimization, vol. 47, no. 4, pp. 1643–1666, 2008
work page 2008
-
[14]
Convex computation of the region of attraction of polynomial control systems,
D. Henrion and M. Korda, “Convex computation of the region of attraction of polynomial control systems,”IEEE Transactions on Automatic Control, vol. 59, no. 2, pp. 297–312, 2014
work page 2014
-
[15]
Linear conic formulations of nonlinear stochastic control,
M. Korda and D. Henrion, “Linear conic formulations of nonlinear stochastic control,”SIAM Journal on Control and Optimization, vol. 58, no. 4, pp. 2334–2362, 2020
work page 2020
-
[16]
L. C. Young,Lectures on the Calculus of Variations and Optimal Control Theory. Philadelphia, PA: W. B. Saunders, 1969
work page 1969
-
[17]
Warga,Optimal Control of Differential and Functional Equations
J. Warga,Optimal Control of Differential and Functional Equations. New York, NY: Academic Press, 1972
work page 1972
-
[18]
J. C. Willems, “Open stochastic systems,”IEEE Transactions on Automatic Control, vol. 58, no. 2, pp. 406–421, 2013
work page 2013
-
[19]
D. P. Bertsekas,Dynamic Programming and Optimal Control, 4th ed. Belmont, MA: Athena Scientific, 2012, vol. 1
work page 2012
-
[20]
Billingsley,Probability and Measure, 3rd ed
P. Billingsley,Probability and Measure, 3rd ed. New York, NY: John Wiley & Sons, 1995
work page 1995
-
[21]
New York, NY: John Wiley & Sons, 1999
——,Convergence of Probability Measures, 2nd ed. New York, NY: John Wiley & Sons, 1999
work page 1999
-
[22]
O. Hern´ andez-Lerma and J. B. Lasserre,Discrete-Time Markov Control Processes: Basic Optimality Criteria. New York, NY: Springer, 1996. 28
work page 1996
-
[23]
Altman,Constrained Markov Decision Processes, ser
E. Altman,Constrained Markov Decision Processes, ser. Stochastic Modeling. Boca Raton, FL: Chapman & Hall/CRC, 1999, vol. 7
work page 1999
-
[24]
M. L. Puterman,Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York, NY: John Wiley & Sons, 1994
work page 1994
-
[25]
C. Berge,Topological Spaces. Edinburgh: Oliver and Boyd, 1963
work page 1963
-
[26]
D. P. Bertsekas and S. E. Shreve,Stochastic Optimal Control: The Discrete-Time Case. Belmont, MA: Athena Scientific, 1996
work page 1996
-
[27]
J. W. Polderman and J. C. Willems,Introduction to Mathematical Systems Theory: A Behavioral Approach. Springer, 1998
work page 1998
-
[28]
W. H. Fleming and H. M. Soner,Controlled Markov Processes and Viscosity Solutions. New York, NY: Springer, 2006. 29
work page 2006
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.