pith. sign in

arxiv: 2606.10130 · v1 · pith:JA7YGSZNnew · submitted 2026-06-08 · ⚛️ physics.optics · cs.LG· physics.data-an

Effective Training Principles of Physical Reservoirs

Pith reviewed 2026-06-27 15:09 UTC · model grok-4.3

classification ⚛️ physics.optics cs.LGphysics.data-an
keywords reservoir computingphysical reservoirsextreme learning machineoutput pruningregularizationoptical computingnonlinear dynamicsoverfitting mitigation
0
0 comments X

The pith

Selecting readouts across the full output spectrum plus L1 or L2 regularization reduces overfitting and raises accuracy in physical reservoir computers.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests ways to train physical reservoir computers without overfitting by pruning the reservoir outputs and adding regularization. It compares search-based pruning methods against a variance-based filter and shows that keeping readouts distributed across the entire output spectrum helps most when training is non-iterative. L1 and L2 penalties are shown to improve results on strongly nonlinear benchmarks such as the Spiral task. All experiments use a fiber-optical extreme learning machine, yet the authors present the pruning and regularization steps as general tools for any physical reservoir.

Core claim

Enforcing readout selection across the full output spectrum improves performance, especially for non-iterative methods; L1 and L2 regularization significantly enhance performance on highly nonlinear tasks such as the Spiral Benchmark. These gains are obtained by comparing loss-minimizing search methods (Equal Search, Branch and Bound) against statistical filtering (Variance Filter) and random pruning, all applied to the output layer of a nonlinear fiber-optical extreme learning machine.

What carries the argument

Output pruning (Equal Search, Branch and Bound, Variance Filter) combined with L1 (LASSO) and L2 (ridge) regularization applied to the readout weights of the physical reservoir.

If this is right

  • Informed output sampling becomes essential once the latent space of the reservoir shrinks.
  • Regularization yields the largest lift precisely on tasks whose target function is highly nonlinear.
  • Non-iterative training methods gain more from full-spectrum readout selection than iterative ones.
  • Both pruning and regularization lower the computational cost of the training phase.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the same pruning statistics hold across different physical media, a single set of selection rules could be ported between optical, electronic, and mechanical reservoirs.
  • The variance-filter approach may interact with the specific spectrum of the physical nonlinearity, suggesting a platform-dependent tuning step that the paper leaves open.
  • Extending the full-spectrum constraint to multi-layer or deep physical reservoirs could further reduce the need for iterative retraining.

Load-bearing premise

That the fiber-optical extreme learning machine used for experiments is representative of physical reservoirs in general and that the observed gains from pruning and regularization will transfer to other physical substrates without platform-specific retuning.

What would settle it

A controlled test on a second physical reservoir (for example a photonic or spin-wave device) in which the same pruning rules and regularization strengths produce no accuracy gain or a clear drop on the Spiral Benchmark.

Figures

Figures reproduced from arXiv: 2606.10130 by Bennet Fischer, Glitta R. Cheeran, Juliane Heim, Mario Chemnitz, Mehmet M\"uft\"uoglu, Sobhi Saeed.

Figure 1
Figure 1. Figure 1: Physical reservoir computing diagram. a) Experimental realization of physical [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Equal Search (ES) performance comparison on Handwritten MNIST images [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Illustration of Branch&Bound (BB) algorithm on Handwritten MNIST images classification. (a) Testing and training accuracies and cost over iterations with pruning size is 15 in each step. (b) Bins selected by BB, shown on the average spectrum for 10, 50, and 100 bins. (c) Approximaiton of pruning with respect to the search space. The dashed lines are 10% (blue) and 20% (orange) probabilities. The solid line… view at source ↗
Figure 4
Figure 4. Figure 4: Comparison of different Variance Filter (VF) search approaches (testing on [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Performance of combined Variance Filter (VF) search approaches for handwritten [PITH_FULL_IMAGE:figures/full_fig_p012_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Performance of Random Search (RS) vs. optimized search methods on [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Impact of regularization on performance for a highly nonlinear task (4-spirals [PITH_FULL_IMAGE:figures/full_fig_p016_7.png] view at source ↗
read the original abstract

Reservoir computers benefit from the inherent complexity of optical phenomena, which provide rich, often nonlinear dynamics. However, training directly on the reservoir's output renders the system prone to overfitting and computationally inefficient during the training phase. In this work, we investigate strategies to mitigate overfitting and reduce computational overhead through output pruning and regularization. We compare loss-minimizing search methods (Equal Search and Branch and Bound) against an output-oriented statistical filtering approach (Variance Filter) and random pruning, highlighting advantages and disadvantages of each approach and the overall importance of informed reservoir output sampling, particularly for a shrinking latent space. We further demonstrate that enforcing readout selection across the full output spectrum improves performance, especially for non-iterative methods. Additionally, we examine L1 and L2 regularization techniques (LASSO and ridge regression), both of which significantly enhance performance on highly nonlinear tasks such as the Spiral Benchmark. While our methods are of general use, results are obtained from and discussed exemplarily for a nonlinear fiber-optical extreme learning machine. Overall, this study provides a deep analysis of the reservoirs' hidden-layer filtering mechanisms and the output-layer training, enabling optimized performance in physical reservoir computing systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript examines output pruning strategies (Equal Search, Branch and Bound, Variance Filter, and random pruning) and L1/L2 regularization (LASSO and ridge regression) to reduce overfitting and computational cost when training physical reservoir computers. Using a nonlinear fiber-optical extreme learning machine as the experimental platform, it reports that informed readout selection across the full output spectrum benefits non-iterative methods and that regularization improves performance on nonlinear benchmarks such as the Spiral task; the authors state that the methods are of general use while noting the results are obtained exemplarily on this substrate.

Significance. If the reported performance gains are reproducible and the pruning/regularization benefits prove transferable, the work would supply concrete, practical guidelines for training physical reservoirs that could reduce overfitting and training overhead in optical and other hardware implementations.

major comments (2)
  1. [Abstract] Abstract: the central claim that the described methods 'significantly enhance performance' and are 'of general use' is presented without any quantitative metrics, error bars, dataset sizes, or baseline comparisons, rendering the empirical contribution impossible to evaluate from the provided text.
  2. [Abstract] Abstract and closing paragraph: the assertion that the pruning and regularization principles are 'of general use' for physical reservoirs rests on results from a single fiber-optical ELM substrate; no cross-platform experiments, no ablation of substrate-specific dynamics (e.g., fiber nonlinearity versus other photonic or electronic reservoirs), and no theoretical derivation establishing platform independence are supplied, leaving transferability as an untested assumption that directly supports the broad applicability claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment below and will revise the abstract accordingly to improve clarity and precision.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the described methods 'significantly enhance performance' and are 'of general use' is presented without any quantitative metrics, error bars, dataset sizes, or baseline comparisons, rendering the empirical contribution impossible to evaluate from the provided text.

    Authors: We agree that the abstract would benefit from quantitative support. In the revised manuscript we will add specific metrics (e.g., accuracy improvements on the Spiral benchmark with LASSO/ridge regularization versus unregularized baselines, including standard deviations across repeated trials) together with dataset sizes and brief baseline comparisons. This will allow readers to evaluate the empirical contribution directly from the abstract. revision: yes

  2. Referee: [Abstract] Abstract and closing paragraph: the assertion that the pruning and regularization principles are 'of general use' for physical reservoirs rests on results from a single fiber-optical ELM substrate; no cross-platform experiments, no ablation of substrate-specific dynamics (e.g., fiber nonlinearity versus other photonic or electronic reservoirs), and no theoretical derivation establishing platform independence are supplied, leaving transferability as an untested assumption that directly supports the broad applicability claim.

    Authors: The manuscript already qualifies the claim by stating that 'results are obtained from and discussed exemplarily for a nonlinear fiber-optical extreme learning machine.' We acknowledge that no cross-platform experiments or theoretical derivation of platform independence are provided, as the work focuses on training principles rather than exhaustive validation across substrates. In revision we will further tone down the phrasing in the abstract and closing paragraph to 'principles expected to be of general use for physical reservoir computers, demonstrated here on a fiber-optical ELM' to avoid implying untested universality while retaining the practical motivation. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical methods with no derivations or self-referential reductions

full rationale

The manuscript applies standard statistical techniques (pruning via Equal Search, Branch and Bound, Variance Filter; L1/L2 regularization) to experimental data from a single fiber-optical ELM. No equations, derivations, or parameter-fitting steps are described that would reduce any claim to its own inputs by construction. The statement that methods are 'of general use' is an interpretive claim, not a load-bearing derivation that collapses into fitted values or self-citations. The paper is self-contained as an empirical study without the circular patterns enumerated in the analysis criteria.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The work relies on standard assumptions of reservoir computing (fixed random hidden layer, linear readout) that are not detailed here.

pith-pipeline@v0.9.1-grok · 5754 in / 1092 out tokens · 16859 ms · 2026-06-27T15:09:00.967147+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

33 extracted references · 1 canonical work pages

  1. [1]

    The unified Reservoir Computing concept and its digital hardware implementations,

    Verstraeten, David and Schrauwen, Benjamin and D’Haene, Michiel and Stroobandt, Dirk, “The unified Reservoir Computing concept and its digital hardware implementations,” inProceedings of the 2006 EPFL LATSIS Symposium, (2006), pp. 139–140

  2. [2]

    Reservoir computing approaches to recurrent neural network training,

    M. Lukoševičius and H. Jaeger, “Reservoir computing approaches to recurrent neural network training,” Comput. Sci. Rev.3, 127–149

  3. [3]

    Real-time computing without stable states: A new framework for neural computation based on perturbations,

    W. Maass, T. Natschläger, and H. Markram, “Real-time computing without stable states: A new framework for neural computation based on perturbations,” Neural Comput.14, 2531–2560 (2002)

  4. [4]

    Recent advances in physical reservoir computing: A review,

    G. Tanaka, T. Yamane, J. B. Héroux,et al., “Recent advances in physical reservoir computing: A review,” Neural Networks115, 100–123

  5. [5]

    Extreme learning machine: Theory and applications,

    G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew, “Extreme learning machine: Theory and applications,” Neurocomputing 70, 489–501 (2006). Neural Networks

  6. [6]

    Robust forecasting using predictive generalized synchronization in reservoir computing,

    J. A. Platt, A. Wong, R. Clark,et al., “Robust forecasting using predictive generalized synchronization in reservoir computing,” Chaos: An Interdiscip. J. Nonlinear Sci.31, 123118 (2021)

  7. [7]

    Optimizing memory in reservoir computers,

    T. L. Carroll, “Optimizing memory in reservoir computers,” Chaos: An Interdiscip. J. Nonlinear Sci.32, 023123 (2022)

  8. [9]

    Embedding theory of reservoir computing and reducing reservoir network using time delays,

    X.-Y. Duan, X. Ying, S.-Y. Leng,et al., “Embedding theory of reservoir computing and reducing reservoir network using time delays,” Phys. Rev. Res.5, L022041 (2023)

  9. [10]

    Stable output feedback in reservoir computing using ridge regression,

    F. Wyffels, B. Schrauwen, and D. Stroobandt, “Stable output feedback in reservoir computing using ridge regression,” inArtificialNeuralNetworks-ICANN2008,V.Kůrková,R.Neruda,andJ.Koutník,eds.(SpringerBerlinHeidelberg, Berlin, Heidelberg, 2008), pp. 808–817

  10. [11]

    Photonic extreme learning machine by free-space optical propagation,

    D. Pierangeli, G. Marcucci, and C. Conti, “Photonic extreme learning machine by free-space optical propagation,” Photon. Res.9, 1446

  11. [12]

    All-optical reservoir computing,

    F. Duport, B. Schneider, A. Smerieri,et al., “All-optical reservoir computing,” Opt. Express20, 22783–22795 (2012)

  12. [13]

    Multiplexed networks: reservoir computing with virtual and real nodes,

    A. Röhm and K. Lüdge, “Multiplexed networks: reservoir computing with virtual and real nodes,” J. Phys. Commun. 2, 085007

  13. [14]

    Reconfigurable semiconductor laser networks based on diffractive coupling,

    D. Brunner and I. Fischer, “Reconfigurable semiconductor laser networks based on diffractive coupling,” Opt. Lett. 40, 3854

  14. [15]

    Neuromorphiccomputingviafission-basedbroadbandfrequencygeneration,

    B.Fischer,M.Chemnitz,Y.Zhu,etal.,“Neuromorphiccomputingviafission-basedbroadbandfrequencygeneration,” Adv. Sci.10, 2303835 (2023)

  15. [16]

    Principlesandmetricsofextremelearningmachinesusingahighlynonlinear fiber,

    M.Hary,D.Brunner,L.Leybov,etal.,“Principlesandmetricsofextremelearningmachinesusingahighlynonlinear fiber,” Nanophotonics14, 2733–2748 (2025)

  16. [17]

    Principlesandmetricsofextremelearningmachinesusingahighlynonlinear fiber,

    M.Hary,D.Brunner,L.Leybov,etal.,“Principlesandmetricsofextremelearningmachinesusingahighlynonlinear fiber,” Nanophotonics14, 2733–2748

  17. [18]

    Robust regularized extreme learning machine for regression using iteratively reweighted least squares,

    K. Chen, Q. Lv, Y. Lu, and Y. Dou, “Robust regularized extreme learning machine for regression using iteratively reweighted least squares,” Neurocomputing230, 345–358 (2017)

  18. [19]

    Pruning and regularization in reservoir computing,

    X. Dutoit, B. Schrauwen, J. Van Campenhout,et al., “Pruning and regularization in reservoir computing,” Neurocom- puting72, 1534–1546

  19. [20]

    Adding filters to improve reservoir computer performance,

    T. Carroll, “Adding filters to improve reservoir computer performance,” Phys. D: Nonlinear Phenom.416, 132798 (2021)

  20. [21]

    Nonlinear inference capacity of fiber-optical extreme learning machines,

    S. Saeed, M. Müftüoğlu, G. R. Cheeran,et al., “Nonlinear inference capacity of fiber-optical extreme learning machines,” Nanophotonics14, 2749–2760 (2025)

  21. [22]

    On the partition of numbers,

    G. B. Mathews, “On the partition of numbers,” Proc. Lond. Math. Soc.s1-28, 486–490 (1896)

  22. [23]

    An automatic method of solving discrete programming problems,

    A. H. Land and A. G. Doig, “An automatic method of solving discrete programming problems,” Econometrica28, 497–520 (1960)

  23. [24]

    Machine-aided near-transform-limited pulse compression in fully fiber-interconnected systems for efficient spectral broadening,

    B. Fischer, M. Müftüoglu, and M. Chemnitz, “Machine-aided near-transform-limited pulse compression in fully fiber-interconnected systems for efficient spectral broadening,” inNonlinear Optics and its Applications 2024,vol. 13004 J. M. Dudley, A. C. Peacock, B. Stiller, and G. Tissoni, eds., International Society for Optics and Photonics (SPIE, 2024), p. 1300402

  24. [25]

    Adaptive control of pulse phase in a chirped-pulse amplifier,

    A. Efimov, M. D. Moores, N. M. Beach,et al., “Adaptive control of pulse phase in a chirped-pulse amplifier,” Opt. Lett.23, 1915–1917 (1998)

  25. [26]

    Real-time reservoir computing network-based systems for detection tasks on visual contents,

    A. Jalalvand, G. Van Wallendael, and R. Van De Walle, “Real-time reservoir computing network-based systems for detection tasks on visual contents,” in2015 7th International Conference on Computational Intelligence, Communication Systems and Networks,(IEEE), pp. 146–151

  26. [27]

    A multicriteria approach to find predictive and sparse models with stable feature selection for high-dimensional data,

    A. Bommert, J. Rahnenführer, and M. Lang, “A multicriteria approach to find predictive and sparse models with stable feature selection for high-dimensional data,” Comput. Math. Methods Med.2017, 1–18

  27. [28]

    Sensitivity-guided framework for pruned and quantized reservoir computing accelerators,

    A. Jafari, M. Taheri, H. G. Mohammadi,et al., “Sensitivity-guided framework for pruned and quantized reservoir computing accelerators,”

  28. [29]

    A comprehensive study of random forest for short-term load forecasting,

    G. Dudek, “A comprehensive study of random forest for short-term load forecasting,” Energies15, 7547

  29. [30]

    Random forest-driven photonic reservoir computing model for signal modulation recognition,

    L. Zheng, S. Wen, P. Zhang,et al., “Random forest-driven photonic reservoir computing model for signal modulation recognition,” in2025 Photonics Global Conference (PGC),(IEEE), pp. 1–3

  30. [31]

    SVM–ELM: Pruning of extreme learning machine with support vector machines for regression,

    S. F. Mahmood, M. H. Marhaban, F. Z. Rokhani,et al., “SVM–ELM: Pruning of extreme learning machine with support vector machines for regression,” J. Intell. Syst.25, 555–566

  31. [32]

    OP-ELM: Optimally pruned extreme learning machine,

    Yoan Miche, A. Sorjamaa, P. Bas,et al., “OP-ELM: Optimally pruned extreme learning machine,” IEEE Trans. Neural Netw.21, 158–162

  32. [33]

    Novel and efficient randomized algorithms for feature selection,

    Z. Wang, X. Xiao, and S. Rajasekaran, “Novel and efficient randomized algorithms for feature selection,” Big Data Min. Anal.3, 208–224

  33. [34]

    Hoerl and Robert W

    A. E. Hoerl and R. W. Kennard, “Ridge regression: Biased estimation for nonorthogonal problems,” Technometrics 12, 55–67. _eprint: https://doi.org/10.1080/00401706.1970.10488634