Optimal Control of a Mesoscopic Information Engine
Pith reviewed 2026-05-13 23:25 UTC · model grok-4.3
The pith
LQG dynamics decouple optimal measurement scheduling from trap-position feedback in a mesoscopic information engine.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By casting the engine inside the POMDP framework, the underlying LQG dynamics decouple the optimal measurement and driving protocols. The optimal feedback law for trap placement recovers the discontinuous Schmiedl-Seifert protocol in the open-loop limit and extends it to arbitrary measurement schedules. For a binary on/off sensor the paper derives the optimal measurement times, identifies deadline-induced blindness, and obtains exact periodic schedules in the infinite-horizon limit together with the macroscopic velocity envelopes beyond which the engine becomes net dissipative.
What carries the argument
Linear-Quadratic-Gaussian (LQG) dynamics inside the Partially Observable Markov Decision Process (POMDP) formulation, which separate the choice of measurement instants from the continuous feedback law that sets trap position.
If this is right
- The optimal driving protocol reduces exactly to the discontinuous Schmiedl-Seifert protocol when measurements are absent.
- Optimal finite-horizon measurement schedules exist for any fixed measurement cost and any deadline.
- Deadline-induced blindness appears: all measurements cease as the deadline is approached regardless of cost.
- Exact periodic measurement schedules are obtained in the infinite-horizon steady state as a function of cost.
- Macroscopic velocity envelopes mark the boundary between net work extraction and net dissipation.
Where Pith is reading between the lines
- The same separation may let designers add variable-precision sensors without re-optimizing the entire feedback law.
- Bounds on net gain supply a practical design rule for choosing trap stiffness and measurement rate in viscous environments.
- The decoupling suggests that similar LQG information engines could be built with non-binary sensors without losing analytic tractability.
- Testing the predicted blindness near deadlines in a real optical-trap setup would directly probe the finite-time theory.
Load-bearing premise
The particle obeys linear-quadratic-Gaussian dynamics under the overdamped approximation and measurements are binary with a fixed per-use cost.
What would settle it
Numerical simulation or experiment on an overdamped particle showing that the optimal measurement schedule changes when the feedback gain for trap position is altered would falsify the claimed decoupling.
Figures
read the original abstract
We analytically solve the finite-time control problem of driving an overdamped particle via an optical trap under costly measurement. By formulating this mesoscopic information engine within the Partially Observable Markov Decision Process (POMDP) framework, we demonstrate that the underlying Linear-Quadratic-Gaussian (LQG) dynamics decouple the optimal measurement and driving protocols. We derive the optimal feedback control law for the trap placement, which recovers the discontinuous Schmiedl-Seifert driving protocol in the open-loop limit and extends it to any measurement scheduling. For a costly, binary (on/off) sensor, we evaluate the optimal measurement protocol and derive physical bounds on the maximum net gain that can be extracted from thermal fluctuations. We show the emergence of deadline-induced blindness, a phenomenon where all measurements cease as the deadline approaches regardless of their cost. Taking the infinite-horizon limit, we find the exact periodic measurement schedules for the steady state as a function of the measurement cost $C$ and derive the macroscopic velocity envelopes beyond which viscous drag forces the engine into a net-dissipative regime. Finally, we generalize the results to a variable-precision sensor.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper analytically solves the finite-time optimal control problem for an overdamped colloidal particle driven by an optical trap under costly binary measurements. Formulating the system as a POMDP under LQG dynamics, it claims that the optimal measurement and driving protocols decouple, derives the feedback law for trap placement (recovering the discontinuous Schmiedl-Seifert protocol in the open-loop limit), evaluates optimal measurement schedules with net-gain bounds, identifies deadline-induced blindness, obtains exact periodic schedules in the infinite-horizon limit as a function of measurement cost C, derives velocity envelopes for net-dissipative regimes, and generalizes to variable-precision sensors.
Significance. If the decoupling holds rigorously, the work offers a valuable analytical framework for optimal operation of mesoscopic information engines, extending known protocols with exact results and physical bounds that could inform experiments in stochastic thermodynamics. The recovery of the Schmiedl-Seifert limit and derivation of velocity envelopes add concrete physical insight.
major comments (2)
- [Section 3 (derivation of optimal feedback control law)] The central decoupling claim (that LQG dynamics separate optimal measurement and driving protocols for discrete on/off measurements with fixed cost C) is load-bearing but requires explicit verification: the dynamic-programming recursion for the value function must be shown to factorize independently of the measurement schedule, as the belief-state update with additive cost typically couples mean/variance evolution to the horizon in a non-separable manner (standard LQG separation applies to continuous observations).
- [Section 6 (infinite-horizon limit)] The macroscopic velocity envelopes (beyond which viscous drag renders the engine net-dissipative) rest on the overdamped LQG approximation; their quantitative range of validity should be bounded against the point where inertial or nonlinear effects invalidate the linear dynamics assumption.
minor comments (3)
- [Abstract] The term 'deadline-induced blindness' is introduced in the abstract without definition; a one-sentence clarification on first use would improve accessibility.
- [Section 2] Belief-state notation (mean and variance) should be introduced with explicit symbols in the methods section before its repeated use in the POMDP formulation.
- A small number of equation cross-references in the text appear inconsistent with the numbering in the displayed equations; these should be checked for accuracy.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments. We address the two major comments point by point below, providing clarifications and indicating the revisions we will implement.
read point-by-point responses
-
Referee: [Section 3 (derivation of optimal feedback control law)] The central decoupling claim (that LQG dynamics separate optimal measurement and driving protocols for discrete on/off measurements with fixed cost C) is load-bearing but requires explicit verification: the dynamic-programming recursion for the value function must be shown to factorize independently of the measurement schedule, as the belief-state update with additive cost typically couples mean/variance evolution to the horizon in a non-separable manner (standard LQG separation applies to continuous observations).
Authors: We appreciate the referee's emphasis on making the separation explicit. In the LQG-POMDP setting with quadratic control costs and additive measurement penalties independent of state, the Bellman recursion factors such that the optimal feedback law for trap position depends only on the current belief mean (recovering the Schmiedl-Seifert form in the open-loop case) while the measurement schedule is optimized separately via the expected reduction in variance cost. The variance evolution enters only through an additive term in the value function that does not affect the argmin over control. To address the request for verification, we will insert a short appendix deriving the factorization of the dynamic-programming equation for the binary-measurement case, confirming independence from the horizon-dependent measurement policy. revision: yes
-
Referee: [Section 6 (infinite-horizon limit)] The macroscopic velocity envelopes (beyond which viscous drag renders the engine net-dissipative) rest on the overdamped LQG approximation; their quantitative range of validity should be bounded against the point where inertial or nonlinear effects invalidate the linear dynamics assumption.
Authors: We agree that bounding the regime of validity strengthens the physical interpretation. The envelopes are derived under the standard overdamped colloidal-particle model used throughout the literature on stochastic thermodynamics. In the revision we will add a paragraph in Section 6 that estimates the inertial relaxation time (m/γ) relative to the control and measurement intervals for representative experimental parameters (micron-scale beads in aqueous solution), showing that the envelopes lie well within the overdamped regime for typical trap stiffnesses and viscosities. We will also note the velocity scale at which trap anharmonicity or inertia would require extension beyond the linear model. revision: yes
Circularity Check
No significant circularity; derivations follow from standard LQG separation under explicit model assumptions.
full rationale
The paper formulates the mesoscopic engine as a POMDP with linear dynamics and quadratic costs, then applies the LQG separation principle to decouple measurement and control. This yields analytic optimal feedback laws that recover the known Schmiedl-Seifert protocol in the open-loop limit. No equation reduces a claimed prediction to a fitted parameter by construction, no uniqueness theorem is imported from self-citation, and no ansatz is smuggled via prior work. The binary measurement cost and deadline effects are derived directly from the value function under the stated overdamped approximation; the results remain falsifiable against the model equations without circular reduction.
Axiom & Free-Parameter Ledger
free parameters (1)
- measurement cost C
axioms (2)
- domain assumption Overdamped Langevin dynamics for the particle
- domain assumption Linear-Quadratic-Gaussian control structure
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the underlying Linear-Quadratic-Gaussian (LQG) dynamics decouple the optimal measurement and driving protocols... Riccati recurrence... Pn = −κ [n(1−α)/(1+α+n(1−α))]
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Jarzynski, Physical review letters78, 2690 (1997)
C. Jarzynski, Physical review letters78, 2690 (1997)
work page 1997
-
[2]
G. E. Crooks, Physical Review E60, 2721 (1999)
work page 1999
-
[3]
Seifert, Reports on progress in physics75, 126001 (2012)
U. Seifert, Reports on progress in physics75, 126001 (2012)
work page 2012
- [4]
- [5]
- [6]
- [7]
-
[8]
J. C. Maxwell,Theory of Heat(Longmans, Green, and Company, 1871)
-
[9]
Szilard, Zeitschrift f¨ ur Physik53, 840 (1929)
L. Szilard, Zeitschrift f¨ ur Physik53, 840 (1929)
work page 1929
-
[10]
Landauer, IBM journal of research and development 5, 183 (1961)
R. Landauer, IBM journal of research and development 5, 183 (1961)
work page 1961
-
[11]
J. M. Parrondo, J. M. Horowitz, and T. Sagawa, Nature physics11, 131 (2015)
work page 2015
- [12]
-
[13]
A. B´ erut, A. Arakelyan, A. Petrosyan, S. Ciliberto, R. Dillenschneider, and E. Lutz, Nature483, 187 (2012)
work page 2012
-
[14]
I. A. Mart´ ınez,´E. Rold´ an, L. Dinis, and R. A. Rica, Soft matter13, 22 (2017)
work page 2017
- [15]
-
[16]
T. K. Saha, J. N. Lucero, J. Ehrich, D. A. Sivak, and J. Bechhoefer, Proceedings of the National Academy of Sciences118, e2023356118 (2021)
work page 2021
-
[17]
T. K. Saha, J. Ehrich, M. Gavrilov, S. Still, D. A. Sivak, and J. Bechhoefer, Physical Review Letters131, 057101 (2023)
work page 2023
- [18]
-
[19]
J. Alvarado, E. G. Teich, D. A. Sivak, and J. Bechhoefer, Annual Review of Condensed Matter Physics17(2026)
work page 2026
- [20]
-
[21]
T. Kamijima, A. Takatsu, K. Funo, and T. Sagawa, Phys- ical Review Research7, 023159 (2025)
work page 2025
- [22]
- [23]
- [24]
- [25]
-
[26]
R. Garcia-Millan, J. Sch¨ uttler, M. E. Cates, and S. A. Loos, Physical Review Letters135, 088301 (2025)
work page 2025
-
[27]
J. Sch¨ uttler, R. Garcia-Millan, M. E. Cates, and S. A. Loos, Physical Review E112, 024119 (2025)
work page 2025
-
[28]
Whitelam, Physical Review X13, 021005 (2023)
S. Whitelam, Physical Review X13, 021005 (2023)
work page 2023
-
[29]
Bechhoefer, New Journal of Physics17, 075003 (2015)
J. Bechhoefer, New Journal of Physics17, 075003 (2015)
work page 2015
-
[30]
M. Biehl and N. Virgo, inInternational Workshop on Active Inference(Springer, 2022) pp. 16–31
work page 2022
- [31]
-
[32]
B. D. Anderson and J. B. Moore,Optimal control: linear quadratic methods(Courier Corporation, 2007)
work page 2007
-
[33]
Bechhoefer,Control theory for physicists(Cambridge University Press, 2021)
J. Bechhoefer,Control theory for physicists(Cambridge University Press, 2021)
work page 2021
-
[34]
K. J. ˚Astr¨ om, Journal of mathematical analysis and ap- plications10, 174 (1965)
work page 1965
-
[35]
L. P. Kaelbling, M. L. Littman, and A. R. Cassandra, Artificial intelligence101, 99 (1998)
work page 1998
-
[36]
G. E. Uhlenbeck and L. S. Ornstein, Physical Review36, 823 (1930)
work page 1930
-
[37]
D. T. Gillespie, Physical review E54, 2084 (1996)
work page 2084
-
[38]
Bellman,Dynamic Programming(Princeton Univer- sity Press, Princeton, NJ, 1957)
R. Bellman,Dynamic Programming(Princeton Univer- sity Press, Princeton, NJ, 1957)
work page 1957
-
[39]
Bertsekas,Dynamic programming and optimal control: Volume I, Vol
D. Bertsekas,Dynamic programming and optimal control: Volume I, Vol. 4 (Athena scientific, 2012)
work page 2012
-
[40]
H. A. Simon, Econometrica, Journal of the Econometric Society24, 74 (1956)
work page 1956
-
[41]
Y. Bar-Shalom and E. Tse, IEEE Transactions on Auto- matic Control19, 494 (1974)
work page 1974
- [42]
-
[43]
R. M. Corless, G. H. Gonnet, D. E. Hare, D. J. Jeffrey, and D. E. Knuth, Advances in Computational mathe- matics5, 329 (1996)
work page 1996
-
[44]
Kalman, Journal of Basic Engineering82, 35 (1960)
R. Kalman, Journal of Basic Engineering82, 35 (1960)
work page 1960
-
[45]
L. L. Bonilla, Physical Review E100, 10.1103/phys- reve.100.022601 (2019)
- [46]
-
[47]
L. Cocconi, J. Knight, and C. Roberts, Physical Review Letters131, 188301 (2023)
work page 2023
-
[48]
C. Casert and S. Whitelam, Nature Communications15, 9128 (2024). A. Decoupling and Dimensional Reduction of the Cost Function To distinguish the state transitions within a sin- gle step, we denote the prior state before measure- ment as (µ−,Σ −), the posterior state after measurement as (µ +,Σ +), and the prior state at the next step as (µ′−,Σ ′−). In the...
work page 2024
-
[49]
The expected thermodynamic work of instanta- neously translating a harmonic trap is indepen- dent of the spatial variance. Since the trap stiff- nessκis constant, theE[x 2] terms cancel exactly: Ex[W] =−κµ +λ′ + 1 2 κ(λ′)2 +κµ +λ− 1 2 κλ2
-
[50]
The transition of the prior mean is deterministic and invariant to the measurement uncertainty Σ +: µ′− =αµ + + (1−α)λ ′
-
[51]
The information dynamics Σ ′− =α 2Σ+ + Σeq are deterministic and independent of the physical trap placementλ ′. Since Σ′− is independent ofλ ′, the future informational costg n−1(Σ′−) acts as a constant with respect to the in- ner minimization overλ ′ and factors out. We define the 9 remaining inner minimization as the intermediate physi- cal cost ˜Jn(µ+,...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.