Mathematical Reasoning Enhanced LLM for Formula Derivation: A Case Study on Fiber NLI Modellin
Pith reviewed 2026-05-15 09:05 UTC · model grok-4.3
The pith
Structured prompts let an LLM reconstruct known fiber interference formulas and derive a new approximation for multi-span C and C+L transmissions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Guiding an LLM with structured prompts reconstructs the known closed-form ISRS GN expressions and yields a novel approximation tailored for multi-span C and C+L band transmissions; the derived model produces central-channel GSNRs nearly identical to baseline models, with mean absolute error across all channels and spans below 0.109 dB.
What carries the argument
Structured prompts that break the derivation into sequential symbolic steps for manipulating physical equations of nonlinear interference.
If this is right
- The new approximation supplies a low-complexity closed-form tool for estimating GSNR in C and C+L band networks.
- Physical consistency of the LLM output is verified by direct numerical match to established models.
- The prompting workflow can be reused for other closed-form derivations in optical channel modeling.
- Mean absolute error remains below 0.109 dB for both central and edge channels across multiple spans.
Where Pith is reading between the lines
- The same prompting structure could be tested on formula derivation tasks in related fields such as wireless nonlinear distortion modeling.
- If the approach succeeds on problems absent from training data, it would strengthen the case that LLMs can generate original physical expressions.
- Network planning software might incorporate the derived approximation to speed up multi-band link design iterations.
Load-bearing premise
The structured prompts cause the LLM to perform genuine step-by-step symbolic reasoning rather than simply recalling patterns already present in its training data.
What would settle it
Apply the same prompting procedure to derive an expression for a fiber link whose span count, band allocation, or power levels lie outside the numerical validation set and compare its GSNR predictions against independent split-step Fourier simulations.
Figures
read the original abstract
Recent advances in large language models (LLMs) have demonstrated strong capabilities in code generation and text synthesis, yet their potential for symbolic physical reasoning in domain-specific scientific problems remains underexplored. We present a mathematical reasoning enhanced generative AI approach for optical communication formula derivation, focusing on the fiber nonlinear interference modelling. By guiding an LLM with structured prompts, we successfully reconstructed the known closed-form ISRS GN expressions and further derived a novel approximation tailored for multi-span C and C+L band transmissions. Numerical validations show that the LLM-derived model produces central-channel GSNRs nearly identical to baseline models, with mean absolute error across all channels and spans below 0.109 dB, demonstrating both physical consistency and practical accuracy.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a generative AI method that uses structured prompts to guide an LLM in performing symbolic mathematical reasoning for fiber nonlinear interference (NLI) modeling. It claims to have reconstructed the known closed-form ISRS GN expressions and derived a novel approximation specifically tailored for multi-span C and C+L band transmissions. Numerical validation is reported to show that the LLM-derived model yields central-channel GSNR values nearly identical to baseline models, with mean absolute error below 0.109 dB across all channels and spans.
Significance. If the central claim holds and the LLM performs genuine first-principles symbolic derivation rather than statistical completion, the work would demonstrate a viable path for using LLMs to accelerate formula derivation in optical communications, potentially enabling faster exploration of complex physical models with reduced manual effort. The reported sub-0.1 dB accuracy on GSNR predictions indicates practical relevance for system design in wideband fiber links.
major comments (2)
- [Methods / Prompt Design] The manuscript provides no explicit structured prompts, no sequence of intermediate symbolic expressions produced by the LLM, and no verification that the derived approximation reduces to the known ISRS GN model in appropriate limits (e.g., single-span or narrowband cases). Without these, the numerical MAE < 0.109 dB on central-channel GSNRs alone cannot establish that a derivation from physical principles occurred rather than pattern-matching or interpolation within the tested regime.
- [Results] §4 (Numerical Validation): the reported error metric is restricted to central-channel GSNRs; no results are shown for edge channels or full-band GSNR distributions, which are critical for assessing whether the novel multi-span C/C+L approximation maintains physical consistency outside the central-channel validation set.
minor comments (2)
- [Abstract] The abstract states that the LLM 'reconstructed the known closed-form ISRS GN expressions' but does not name the specific LLM or provide the prompt template, hindering reproducibility.
- [Derivation] Notation for the novel approximation (e.g., any new parameters introduced for multi-span C+L modeling) should be defined explicitly in the main text rather than assumed from context.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback, which has helped us strengthen the transparency and scope of the validation. We address each major comment below and have revised the manuscript to incorporate the suggested improvements.
read point-by-point responses
-
Referee: [Methods / Prompt Design] The manuscript provides no explicit structured prompts, no sequence of intermediate symbolic expressions produced by the LLM, and no verification that the derived approximation reduces to the known ISRS GN model in appropriate limits (e.g., single-span or narrowband cases). Without these, the numerical MAE < 0.109 dB on central-channel GSNRs alone cannot establish that a derivation from physical principles occurred rather than pattern-matching or interpolation within the tested regime.
Authors: We agree that explicit documentation of the prompts and derivation steps is necessary to substantiate the symbolic reasoning claim. In the revised manuscript, we have added an appendix containing the complete structured prompts used for both the ISRS GN reconstruction and the multi-span approximation derivation. We also include the full sequence of intermediate symbolic expressions generated by the LLM, along with a new subsection verifying that the novel approximation reduces exactly to the standard ISRS GN model under single-span and narrowband limits. These additions directly demonstrate the first-principles path taken rather than statistical interpolation. revision: yes
-
Referee: [Results] §4 (Numerical Validation): the reported error metric is restricted to central-channel GSNRs; no results are shown for edge channels or full-band GSNR distributions, which are critical for assessing whether the novel multi-span C/C+L approximation maintains physical consistency outside the central-channel validation set.
Authors: We acknowledge the limitation in the original validation scope. The revised §4 now includes comprehensive results for edge channels and full-band GSNR distributions across both C-band and C+L-band scenarios. Updated figures and tables report the mean absolute error for all channels and spans, which remains below 0.109 dB, confirming that the approximation preserves physical consistency beyond the central-channel cases. revision: yes
Circularity Check
No significant circularity in LLM-guided formula derivation
full rationale
The paper describes guiding an LLM via structured prompts to reconstruct known closed-form ISRS GN expressions and derive a novel multi-span approximation, with validation consisting of numerical comparison to baseline models yielding MAE below 0.109 dB on central-channel GSNRs. This match constitutes an independent external check against established models rather than any reported step reducing by construction to fitted parameters, self-citations, or input definitions. No equations, prompt details, or derivation steps are exhibited that equate the output to the inputs; the chain relies on the LLM's prompted reasoning plus post-hoc numerical falsifiability, making the result self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- prompt design parameters
axioms (1)
- domain assumption LLMs can execute reliable symbolic mathematical reasoning for domain-specific physical problems when guided by structured prompts
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
By guiding an LLM with structured prompts, we successfully reconstructed the known closed-form ISRS GN expressions and further derived a novel approximation... mean absolute error ... below 0.109 dB
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
step-by-step derivation process... Taylor expansion of the ISRS term... dimensionality reduction, approximation, and simplification
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
The GN Model of Non -Linear Propagation in Uncompensated Coherent Optical Systems,
P. Poggiolini, "The GN Model of Non -Linear Propagation in Uncompensated Coherent Optical Systems," in Journal of Lightwave Technology, vol. 30, no. 24, pp. 3857-3879, Dec.15, 2012
work page 2012
-
[2]
Scalable and Disaggregated GGN Approximation Applied to a C+L+S Optical Network,
A. D’Amico et al, "Scalable and Disaggregated GGN Approximation Applied to a C+L+S Optical Network," in Journal of Lightwave Technology, vol. 40, no. 11, pp. 3499-3511, 1 June1, 2022
work page 2022
-
[3]
EGN model of non -linear fiber propagation,
Andrea Carena et al, "EGN model of non -linear fiber propagation," Opt. Express 22, 16335-16362 (2014)
work page 2014
-
[4]
The Gaussian Noise Model in the Presence of Inter -Channel Stimulated Raman Scattering,
D. Semrau, R. I. Killey and P. Bayvel, "The Gaussian Noise Model in the Presence of Inter -Channel Stimulated Raman Scattering," in Journal of Lightwave Technology, vol. 36, no. 14, pp. 3046-3055, 15 July15, 2018
work page 2018
-
[5]
D. Semrau, R. I. Killey and P. Bayvel, "A Closed-Form Approximation of the Gaussian Noise Model in the Presence of Inter -Channel Stimulated Raman Scattering," in Journal of Lightwave Technology , vol. 37, no. 9, pp. 1924-1936, 1 May1, 2019
work page 1924
-
[6]
Experimental Test of a Closed -Form EGN Model Over C+L Bands,
Y. Jiang et al, "Experimental Test of a Closed -Form EGN Model Over C+L Bands," in JLT, vol. 43, no. 2, pp. 439-449, 15 Jan.15, 2025
work page 2025
-
[7]
H. Buglia et al, "A Closed -Form Expression for the Gaussian Noise Model in the Presence of Inter -Channel Stimulated Raman Scattering Extended for Arbitrary Loss and Fibre Length," in JLT, vol. 41, no. 11, pp. 3577-3586, 1 June1, 2023
work page 2023
-
[8]
Y. Wang et al, "AlarmGPT: an intelligent alarm analyser for optical networks using a generative pre -trained transformer," in JOCN, vol. 16, no. 6, pp. 681-694, June 2024
work page 2024
-
[9]
Large language model-based optical network log analysis using LLaMA2 with instruction tuning,
Y. Pang et al., "Large language model-based optical network log analysis using LLaMA2 with instruction tuning," in JOCN, vol. 16, no. 11, pp. 1116-1132, November 2024
work page 2024
-
[10]
Y. Zhang et al, "GPT-Enabled Digital Twin Assistant for Multi -task Cooperative Management in Autonomous Optical Network," OFC 2024
work page 2024
-
[11]
OptiComm-GPT: a GPT-based versatile research assistant for optical fiber communication systems
X. Jiang et al, "OptiComm-GPT: a GPT-based versatile research assistant for optical fiber communication systems. " Opt Express. 2024
work page 2024
-
[12]
Y. Zhang et al., "Design and Evaluation of an LLM-Based Agent for QoT Estimation and Performance Optimization in Optical Networks," in IEEE Open Journal of the Communications Society, vol. 6, pp. 7470-7484, 2025
work page 2025
-
[13]
End-to-end transport network digital twins with cloud - native SDN controllers and generative AI,
A. Abishek et al, "End-to-end transport network digital twins with cloud - native SDN controllers and generative AI," in Journal of Optical Communications and Networking, vol. 17, no. 7, pp. C70-C81, July 2025
work page 2025
-
[14]
LLM-enabled Full -stack Configuration Automation of SDM Transport Network,
C. Wang et al, "LLM-enabled Full -stack Configuration Automation of SDM Transport Network," ECOC 2024, pp. 1599-1602
work page 2024
-
[15]
A. Zhou et al, "Large Language Model -Driven AI Agent in SDN Controller Towards Intent -Based Management of Optical Networks ," ECOC 2024, W3E. 2
work page 2024
-
[16]
Y. Song et al, "Synergistic Interplay of Large Language Model and Digital Twin for Autonomous Optical Networks: Field Demonstrations," in IEEE Communications Magazine, 2024
work page 2024
-
[17]
C. Sun et al, "Experimental demonstration of local AI-Agents for lifecycle management and control automation of optical networks," in JOCN, vol. 17, no. 8, pp. C82-C92, August 2025
work page 2025
-
[18]
Generative AI -Driven Hierarchical Multi -Agent Framework for Zero -Touch Optical Networks ,
Y. Zhang et al, "Generative AI -Driven Hierarchical Multi -Agent Framework for Zero -Touch Optical Networks ," IEEE Communications Magazine, 2026
work page 2026
-
[19]
Autonomous chemical research with large language models,
Daniil A. Boiko et al, "Autonomous chemical research with large language models," in Nature, 624, 570–578 (2023)
work page 2023
-
[20]
Artificial intelligence in drug development,
K. Zhang et al, "Artificial intelligence in drug development," Artificial intelligence in drug development. Nat Med 31, 45–59 (2025)
work page 2025
-
[21]
GitHub - Telecominfraproject/oopt-gnpy: Optical Route Planning Library, Based on a Gaussian Noise Model
-
[22]
Implementation of the ISRS GN model,
GitHub - dsemrau/ISRSGNmodel. D. Semrau et al, "Implementation of the ISRS GN model, " v. 1.0 (2019)
work page 2019
-
[23]
Y. Song et al, "Efficient Three-Step Amplifier Configuration Algorithm for Dynamic C+L -Band Links in Presence of Stimulated Raman Scattering," in JLT, vol. 41, no. 5, pp. 1445-1453, 1 March1, 2023. Fig. 4. Comparisons of the LLM-GN model in a C+L-band 10-span link. (a) NLI coefficient after a single span. (b) C+L-band simulation setup. Fig. 5. Comparison...
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.