pith. sign in

arxiv: 2605.16584 · v1 · pith:OZQDMMUFnew · submitted 2026-05-15 · 📡 eess.SY · cs.SY

Provably Efficient Sensor Allocation for Unknown High-dimensional Systems with Limited Sensing

Pith reviewed 2026-05-20 15:41 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords sensor allocationsystem identificationlinear time-invariant systemsobservabilityhigh-dimensional systemslimited sensingnon-asymptotic guarantees
0
0 comments X

The pith

Multiple partial trajectories let a two-stage method learn near-optimal sensor placements for unknown linear systems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tries to establish that one can learn sensor allocations ensuring observability for high-dimensional linear systems whose dynamics are unknown, while using only a small number of sensors. It combines a new system identification step that merges data from several independent trajectories—each seeing a different subset of states—with a standard sensor selection procedure run on the resulting estimates. A sympathetic reader would care because prior approaches either require impractically many sensors or start from the assumption that a good allocation is already known in advance. If the non-asymptotic bounds hold, the result is that full-state reconstruction becomes feasible with far fewer measurements than the system dimension.

Core claim

For an unknown linear time-invariant system, information pooled across multiple trajectories each observing a distinct subset of state coordinates suffices to identify the dynamics accurately enough that a sensor allocation procedure applied to the identified model produces an observable system using a near-optimal number of sensors. The guarantees are non-asymptotic and hold with high probability when sensors can be placed on any coordinate; the same approach extends to the case where some coordinates are unavailable for sensing.

What carries the argument

The two-stage framework that first performs system identification by integrating multiple partial-observation trajectories and then applies sensor allocation to the learned parameters.

If this is right

  • A near-optimal number of sensors suffices to guarantee observability of the unknown system.
  • The approach requires no initial observable sensor allocation to be provided in advance.
  • Non-asymptotic performance bounds hold for high-dimensional systems.
  • The guarantees extend to settings where some state coordinates cannot receive sensors.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same identification-then-allocation pattern could be tested on nonlinear systems approximated locally by linear models.
  • In applications such as power-grid monitoring, the method might translate into substantially lower sensor hardware costs once validated on real data.
  • Online variants could update the allocation as new trajectories become available without restarting the identification step.

Load-bearing premise

The system must be linear and time-invariant, and several independent trajectories must be available, each observing a different subset of the state coordinates.

What would settle it

A concrete counterexample would be a known linear system on which the method selects a sensor set that leaves at least one unstable mode unobservable, even though a smaller known-optimal set achieves observability.

Figures

Figures reproduced from arXiv: 2605.16584 by Derya Cansever, Na Li, Yuyang Zhang.

Figure 1
Figure 1. Figure 1: Simulation results for Model 1 (top row, block-cyclic system) and Model 2 (bottom row, thermal dynamics). Panels (a,c) plot Markov parameter estimation errors against trajectory length T on a logarithmic y-axis. Panels (b) and (d) visualize the learned sensor allocation matrix when K = 16 or K = 12, respectively. The bright squares mark active (sensor, state-coordinate) pairs. (inputs) of the 20 zones. Dyn… view at source ↗
read the original abstract

This paper focuses on learning efficient sensor allocations that ensure observability of unknown high-dimensional linear systems using only a small number of sensors. Existing methods either require an impractically large number of sensors or assume access to an observable allocation in advance. We propose a two-stage framework that overcomes these limitations: first, a novel system identification algorithm integrates information from multiple trajectories, each observing different subsets of state coordinates; then, a classic sensor allocation method is adapted to operate on the learned system parameters. Our non-asymptotic guarantees show that the proposed approach learns a sensor allocation with a near-optimal number of sensors when sensors can be allocated on any state coordinate. We further extend the results to settings with inaccessible state coordinates that are unavailable for sensor allocation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes a two-stage framework for sensor allocation in unknown high-dimensional LTI systems. Stage 1 develops a novel identification procedure that fuses data from multiple independent trajectories, each observing a distinct subset of state coordinates, to produce estimates  and Ĉ. Stage 2 adapts a classic combinatorial sensor-selection algorithm to these estimates and returns a set S whose cardinality is near the minimal observability index. Non-asymptotic finite-sample guarantees are claimed for the near-optimality of |S| when sensors may be placed on any coordinate; an extension to inaccessible coordinates is also presented.

Significance. If the claimed non-asymptotic guarantees can be made rigorous, the work would supply a concrete, data-driven route to near-minimal sensor placement for high-dimensional systems without presupposing an observable configuration. The multi-trajectory partial-observation identification step and the explicit finite-sample analysis are genuine strengths that address practical constraints in large-scale control applications.

major comments (2)
  1. [§4.3, Theorem 4.2] §4.3, Theorem 4.2: The non-asymptotic identification bounds (Theorem 3.1) control ||Â−A|| and ||Ĉ−C|| at rate O(1/√T) under the stated assumptions, yet no explicit propagation argument is given showing that these perturbations preserve the rank of the observability matrix evaluated at the selected S for the true (A,C). In high dimension even small perturbations can change the rank, so the claimed probabilistic guarantee that the returned S renders the true system observable does not follow from the stated bounds.
  2. [§5.1, Algorithm 2] §5.1, Algorithm 2: The sensor-selection routine is described as operating on the estimated pair (Â,Ĉ), but the analysis does not quantify how the combinatorial search tolerates the identification error while still returning a set whose size is within a constant factor of the true minimal observability index. The near-optimality claim therefore rests on an unproven continuity property of the rank condition under matrix perturbations.
minor comments (2)
  1. [§2] Notation for the observability index and the minimal sensor cardinality is introduced without a dedicated definition paragraph; a short table summarizing the symbols would improve readability.
  2. [§4] The assumption that the system is controllable is used in the identification analysis but is not restated in the sensor-allocation section; a brief reminder would help readers track the conditions under which the guarantees hold.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thorough review and valuable feedback on our work. The comments correctly identify areas where the finite-sample analysis can be strengthened to make the guarantees fully rigorous. We provide point-by-point responses below and will incorporate the necessary additions in the revised manuscript.

read point-by-point responses
  1. Referee: §4.3, Theorem 4.2: The non-asymptotic identification bounds (Theorem 3.1) control ||Â−A|| and ||Ĉ−C|| at rate O(1/√T) under the stated assumptions, yet no explicit propagation argument is given showing that these perturbations preserve the rank of the observability matrix evaluated at the selected S for the true (A,C). In high dimension even small perturbations can change the rank, so the claimed probabilistic guarantee that the returned S renders the true system observable does not follow from the stated bounds.

    Authors: We appreciate the referee highlighting this important point regarding the propagation of identification errors to the observability rank. While the identification error bounds are provided in Theorem 3.1, the explicit argument connecting them to the preservation of the rank condition for the selected sensor set was not detailed in the current version. In the revised manuscript, we will introduce a perturbation analysis for the observability matrix. Specifically, we will derive a bound showing that if the estimation errors ||Â - A|| and ||Ĉ - C|| are sufficiently small (which occurs with high probability for large enough T), and given that the true observability matrix for the minimal set has a positive minimal singular value, the rank will be preserved. This will be added as a supporting lemma in Section 4.3 to rigorously justify the probabilistic guarantee in Theorem 4.2. revision: yes

  2. Referee: §5.1, Algorithm 2: The sensor-selection routine is described as operating on the estimated pair (Â,Ĉ), but the analysis does not quantify how the combinatorial search tolerates the identification error while still returning a set whose size is within a constant factor of the true minimal observability index. The near-optimality claim therefore rests on an unproven continuity property of the rank condition under matrix perturbations.

    Authors: We agree that quantifying the robustness of the sensor-selection procedure to estimation errors is crucial for the near-optimality claim. The current analysis assumes the estimates are close enough but does not provide explicit bounds on how the combinatorial optimization behaves under perturbations. In the revision, we will add a continuity result for the minimal observability index under small matrix perturbations, showing that the selected set S from the estimated system will have cardinality at most a constant factor larger than the true minimal one, with high probability. This will involve analyzing the output of the combinatorial search algorithm under bounded perturbations and will be incorporated into Section 5.1. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper describes a two-stage procedure: a novel identification algorithm that integrates multiple partial-observation trajectories to produce estimates of the unknown LTI system, followed by adaptation of a classic sensor-allocation routine applied to those estimates. Non-asymptotic guarantees are asserted for the end-to-end result. No quoted step reduces a claimed prediction or uniqueness result to a fitted parameter, self-citation, or definitional tautology; the identification and allocation stages remain distinct and rely on external observability concepts rather than importing load-bearing premises from the authors' prior work.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Only abstract available so ledger is minimal and based solely on explicit statements; full paper may contain additional fitted parameters or assumptions.

axioms (2)
  • domain assumption The system is linear and time-invariant.
    Central focus of the paper on linear systems with observability.
  • domain assumption Multiple trajectories with varying sensor subsets are available for data collection.
    Required for the first-stage identification algorithm.

pith-pipeline@v0.9.0 · 5652 in / 1232 out tokens · 45495 ms · 2026-05-20T15:41:14.512034+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages · 1 internal anchor

  1. [1]

    A review of active probing- based system identification techniques with applications in power sys- tems,

    R. Chakraborty, H. Jain, and G.-S. Seo, “A review of active probing- based system identification techniques with applications in power sys- tems,”International Journal of Electrical Power & Energy Systems, vol. 140, p. 108008, 2022

  2. [2]

    Principles of large scale numerical weather prediction,

    N. Phillips, “Principles of large scale numerical weather prediction,” inDynamic Meteorology: Lectures Delivered at the Summer School of Space Physics of the Centre National D’Etudes Spatiales, Held at Lannion, France, August–September 1970. Springer, 1973, pp. 1–96

  3. [3]

    Decentralized temperature control via hvac systems in energy efficient buildings: An approximate solution procedure,

    X. Zhang, W. Shi, X. Li, B. Yan, A. Malkawi, and N. Li, “Decentralized temperature control via hvac systems in energy efficient buildings: An approximate solution procedure,” in2016 IEEE Global Conference on Signal and Information Processing. IEEE, 2016, pp. 936–940

  4. [4]

    Network neuroscience,

    D. S. Bassett and O. Sporns, “Network neuroscience,”Nature neuro- science, vol. 20, no. 3, pp. 353–364, 2017

  5. [5]

    J. C. Doyle, B. A. Francis, and A. R. Tannenbaum,Feedback control theory. Courier Corporation, 2013

  6. [6]

    Failure detection in linear systems

    H. L. Jones, “Failure detection in linear systems.” Ph.D. dissertation, Massachusetts Institute of Technology, 1973

  7. [7]

    Pmu configuration for system dynamic performance measurement in large, multiarea power systems,

    I. Kamwa and R. Grondin, “Pmu configuration for system dynamic performance measurement in large, multiarea power systems,”IEEE Transactions on Power Systems, vol. 17, no. 2, pp. 385–394, 2002. 16 IEEE TRANSACTIONS AND JOURNALS TEMPLA TE

  8. [8]

    Learning-based sensor selection with guaran- teed performance bounds,

    R. Vafaee and M. Siami, “Learning-based sensor selection with guaran- teed performance bounds,” in2022 American Control Conference (ACC). IEEE, 2022, pp. 1459–1465

  9. [9]

    Minimal controllability problems,

    A. Olshevsky, “Minimal controllability problems,”IEEE Transactions on Control of Network Systems, vol. 1, no. 3, pp. 249–258, 2014

  10. [10]

    Submodularity in input node selection for networked linear systems: Efficient algorithms for performance and controllability,

    A. Clark, B. Alomair, L. Bushnell, and R. Poovendran, “Submodularity in input node selection for networked linear systems: Efficient algorithms for performance and controllability,”IEEE Control Systems Magazine, vol. 37, no. 6, pp. 52–74, 2017

  11. [11]

    Near optimal finite time identification of arbitrary linear dynamical systems,

    T. Sarkar and A. Rakhlin, “Near optimal finite time identification of arbitrary linear dynamical systems,” inInternational Conference on Machine Learning. PMLR, 2019, pp. 5610–5618

  12. [12]

    Learning without mixing: Towards a sharp analysis of linear system identifica- tion,

    M. Simchowitz, H. Mania, S. Tu, M. I. Jordan, and B. Recht, “Learning without mixing: Towards a sharp analysis of linear system identifica- tion,” inConference On Learning Theory. PMLR, 2018, pp. 439–473

  13. [13]

    Learning with little mixing,

    I. Ziemann and S. Tu, “Learning with little mixing,”Advances in Neural Information Processing Systems, vol. 35, pp. 4626–4637, 2022

  14. [14]

    Non-asymptotic and accurate learning of nonlinear dynamical systems,

    Y . Sattar and S. Oymak, “Non-asymptotic and accurate learning of nonlinear dynamical systems,”Journal of Machine Learning Research, vol. 23, no. 140, pp. 1–49, 2022

  15. [15]

    Learning nonlinear dynamical systems from a single trajectory,

    D. Foster, T. Sarkar, and A. Rakhlin, “Learning nonlinear dynamical systems from a single trajectory,” inLearning for Dynamics and Control. PMLR, 2020, pp. 851–861

  16. [16]

    Efficient learning of generalized linear and single index models with isotonic regression,

    S. M. Kakade, V . Kanade, O. Shamir, and A. Kalai, “Efficient learning of generalized linear and single index models with isotonic regression,” Advances in Neural Information Processing Systems, vol. 24, 2011

  17. [17]

    Non-asymptotic identification of lti systems from a single trajectory,

    S. Oymak and N. Ozay, “Non-asymptotic identification of lti systems from a single trajectory,” in2019 American control conference (ACC). IEEE, 2019, pp. 5655–5661

  18. [18]

    Finite sample analysis of stochastic system identification,

    A. Tsiamis and G. J. Pappas, “Finite sample analysis of stochastic system identification,” in2019 IEEE 58th Conference on Decision and Control (CDC). IEEE, 2019, pp. 3648–3654

  19. [19]

    Finite time lti system identification,

    T. Sarkar, A. Rakhlin, and M. A. Dahleh, “Finite time lti system identification,”Journal of Machine Learning Research, vol. 22, no. 26, pp. 1–61, 2021

  20. [20]

    A framework for optimal actuator/sensor selection in a control system,

    A. Argha, S. W. Su, A. Savkin, and B. Celler, “A framework for optimal actuator/sensor selection in a control system,”International Journal of Control, vol. 92, no. 2, pp. 242–260, 2019

  21. [21]

    On submodularity and controllability in complex dynamical networks,

    T. H. Summers, F. L. Cortesi, and J. Lygeros, “On submodularity and controllability in complex dynamical networks,”IEEE Transactions on Control of Network Systems, vol. 3, no. 1, pp. 91–101, 2015

  22. [22]

    A deep reinforcement learning approach to sensor placement under uncertainty,

    A. Jabini and E. A. Johnson, “A deep reinforcement learning approach to sensor placement under uncertainty,”IFAC-PapersOnLine, vol. 55, no. 27, pp. 178–183, 2022

  23. [23]

    Optimal sensor placement methodology for identification with unmeasured excitation,

    K.-V . Yuen, L. S. Katafygiotis, C. Papadimitriou, and N. C. Micklebor- ough, “Optimal sensor placement methodology for identification with unmeasured excitation,”J. Dyn. Sys., Meas., Control, vol. 123, no. 4, pp. 677–686, 2001

  24. [24]

    Reinforcement learning-based optimal sensor placement for spatiotemporal modeling,

    Z. Wang, H.-X. Li, and C. Chen, “Reinforcement learning-based optimal sensor placement for spatiotemporal modeling,”IEEE transactions on cybernetics, vol. 50, no. 6, pp. 2861–2871, 2019

  25. [25]

    A machine learning approach for constrained sensor placement,

    K. Kasper, L. Mathelin, and H. Abou-Kandil, “A machine learning approach for constrained sensor placement,” in2015 American Control Conference (ACC). IEEE, 2015, pp. 4479–4484

  26. [26]

    Input-output data-driven sensor selection,

    F. Fotiadis and K. G. Vamvoudakis, “Input-output data-driven sensor selection,” in2024 IEEE 63rd Conference on Decision and Control (CDC). IEEE, 2024, pp. 4506–4511

  27. [27]

    System identification,

    L. Ljung, “System identification,” inSignal analysis and prediction. Springer, 1998, pp. 163–173

  28. [28]

    K. J. Keesman,System identification: an introduction. Springer Science & Business Media, 2011

  29. [29]

    Statistical learning theory for control: A finite-sample perspective,

    A. Tsiamis, I. Ziemann, N. Matni, and G. J. Pappas, “Statistical learning theory for control: A finite-sample perspective,”IEEE Control Systems Magazine, vol. 43, no. 6, pp. 67–97, 2023

  30. [30]

    A tutorial on the non-asymptotic theory of system identification,

    I. Ziemann, A. Tsiamis, B. Lee, Y . Jedra, N. Matni, and G. J. Pappas, “A tutorial on the non-asymptotic theory of system identification,” in 2023 62nd IEEE Conference on Decision and Control (CDC). IEEE, 2023, pp. 8921–8939

  31. [31]

    Finite time identification in unstable linear systems,

    M. K. S. Faradonbeh, A. Tewari, and G. Michailidis, “Finite time identification in unstable linear systems,”Automatica, vol. 96, pp. 342– 353, 2018

  32. [32]

    System identification: A machine learning perspective,

    A. Chiuso and G. Pillonetto, “System identification: A machine learning perspective,”Annual Review of Control, Robotics, and Autonomous Systems, vol. 2, no. 1, pp. 281–304, 2019

  33. [33]

    Discovering governing equations from data by sparse identification of nonlinear dynamical systems,

    S. L. Brunton, J. L. Proctor, and J. N. Kutz, “Discovering governing equations from data by sparse identification of nonlinear dynamical systems,”Proceedings of the national academy of sciences, vol. 113, no. 15, pp. 3932–3937, 2016

  34. [34]

    Neuropixels 2.0: A miniaturized high-density probe for stable, long-term brain recordings,

    N. A. Steinmetz, C. Aydin, A. Lebedeva, M. Okun, M. Pachitariu, M. Bauza, M. Beau, J. Bhagat, C. B ¨ohm, M. Brouxet al., “Neuropixels 2.0: A miniaturized high-density probe for stable, long-term brain recordings,”Science, vol. 372, no. 6539, p. eabf4588, 2021

  35. [35]

    Observability and synchronization of neuron models,

    L. A. Aguirre, L. L. Portes, and C. Letellier, “Observability and synchronization of neuron models,”Chaos: An Interdisciplinary Journal of Nonlinear Science, vol. 27, no. 10, 2017

  36. [36]

    Observability for optimal sensor locations in data assimilation,

    S. King, W. Kang, and L. Xu, “Observability for optimal sensor locations in data assimilation,”International Journal of Dynamics and Control, vol. 3, no. 4, pp. 416–424, 2015

  37. [37]

    Learning low-dimensional latent dynamics from high-dimensional observations: Non-asymptotics and lower bounds,

    Y . Zhang, S. Talebi, and N. Li, “Learning low-dimensional latent dynamics from high-dimensional observations: Non-asymptotics and lower bounds,”arXiv preprint arXiv:2405.06089, 2024

  38. [38]

    Suprema of chaos processes and the restricted isometry property,

    F. Krahmer, S. Mendelson, and H. Rauhut, “Suprema of chaos processes and the restricted isometry property,”Communications on Pure and Applied Mathematics, vol. 67, no. 11, pp. 1877–1904, 2014

  39. [39]

    Improved algorithms for linear stochastic bandits,

    Y . Abbasi-Yadkori, D. P´al, and C. Szepesv ´ari, “Improved algorithms for linear stochastic bandits,”Advances in neural information processing systems, vol. 24, 2011

  40. [40]

    Perturbation theory for pseudo-inverses,

    P.- ˚A. Wedin, “Perturbation theory for pseudo-inverses,”BIT Numerical Mathematics, vol. 13, pp. 217–232, 1973

  41. [41]

    An analysis of the greedy algorithm for the submodular set covering problem,

    L. A. Wolsey, “An analysis of the greedy algorithm for the submodular set covering problem,”Combinatorica, vol. 2, no. 4, pp. 385–393, 1982

  42. [42]

    Distributed reinforcement learning for decentralized linear quadratic control: A derivative-free policy optimization approach,

    Y . Li, Y . Tang, R. Zhang, and N. Li, “Distributed reinforcement learning for decentralized linear quadratic control: A derivative-free policy optimization approach,”IEEE Transactions on Automatic Control, vol. 67, no. 12, pp. 6429–6444, 2021

  43. [43]

    Introduction to the non-asymptotic analysis of random matrices

    R. Vershynin, “Introduction to the non-asymptotic analysis of random matrices,”arXiv preprint arXiv:1011.3027, 2010