When Good Equations Get Bad Scores: Improving Symbolic Regression Through Better Parameter Optimization

Boxiao Wang; Jian Cheng; Kai Li; Runxiang Wang; Yang Huang; Yifan Zhang; Zhiwei Chen; Ziwen Zhang

arxiv: 2605.23272 · v1 · pith:A2DILOMEnew · submitted 2026-05-22 · 💻 cs.LG · cs.AI

When Good Equations Get Bad Scores: Improving Symbolic Regression Through Better Parameter Optimization

Boxiao Wang , Kai Li , Zhiwei Chen , Yang Huang , Runxiang Wang , Ziwen Zhang , Yifan Zhang , Jian Cheng This is my paper

Pith reviewed 2026-05-25 04:43 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords symbolic regressionparameter optimizationbi-level optimizationnon-convex optimizationstructure-aware fittingsemantics-guided evaluationequation discovery

0 comments

The pith

Exploiting structural and semantic priors in symbolic expressions improves parameter optimization and resolves the good-structure-bad-score problem.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Symbolic regression searches for equations by trying different structures in an outer loop while fitting their parameters in an inner loop. Nonlinear operators make the inner optimization non-convex, so fast local solvers often produce poor fits and low scores even for correct structures, which misleads the search. The paper introduces SAGE-Fit to exploit the dual native priors of symbolic expressions through tailored modules that improve fitting quality. This plug-and-play component raises evaluation fidelity and boosts the performance of existing symbolic regression systems. A sympathetic reader would care because it targets a core bottleneck that wastes search effort on otherwise valid equations.

Core claim

The central claim is that the 'Good Structure, Bad Score' phenomenon arises from non-convex parameter optimization in the inner loop of bi-level SR frameworks, and that a Structure-Aware and Semantics-Guided Evaluator (SAGE-Fit) can mitigate it by designing tailored modules that capitalize on the dual native priors of symbolic expressions, thereby enhancing evaluation fidelity and improving various SR systems.

What carries the argument

SAGE-Fit, a fitting framework that exploits structural and semantic priors unique to symbolic expressions through tailored modules for each property.

If this is right

Significantly enhances evaluation fidelity of candidate structures.
Universally improves the performance of various symbolic regression systems as a plug-and-play module.
Reduces the impact of poor local minima in parameter fitting.
Mitigates misguidance of the outer-loop search away from true equations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This approach may enable more accurate scientific knowledge discovery by ensuring correct equations are not discarded due to fitting issues.
Future work could explore integrating these priors directly into the search process rather than just evaluation.
Testing on problems with known ground truth would show whether score improvements correlate with better equation recovery rates.

Load-bearing premise

The dual native priors of symbolic expressions can be capitalized on via tailored modules to reliably improve fitting quality without introducing new optimization issues or biases.

What would settle it

A controlled experiment on synthetic data with known correct structures where standard BFGS yields poor scores but SAGE-Fit yields high scores and leads to correct selection in the outer loop.

Figures

Figures reproduced from arXiv: 2605.23272 by Boxiao Wang, Jian Cheng, Kai Li, Runxiang Wang, Yang Huang, Yifan Zhang, Zhiwei Chen, Ziwen Zhang.

**Figure 1.** Figure 1: “Needle-in-a-basin” phenomenon in SR. We select three representative cases from llmsrbench. For each task, we [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗

**Figure 2.** Figure 2: Non-convexity and multimodality in nonlinear SR: disconnected basins and local-optimum traps. [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

read the original abstract

Symbolic Regression (SR) plays a central role in scientific knowledge discovery by distilling mathematical equations from observational data. Most existing SR methods function within a bi-level optimization framework: an outer loop that searches for the discrete equation structure, and an inner loop that optimizes the continuous parameters of that structure. Crucially, parameter-fitting quality directly determines a structure's score and thus the outer-loop search. However, nonlinear operators make the inner loop highly non-convex, and budget-driven reliance on fast local solvers (e.g., BFGS) often yields poor local minima and underestimated scores for correct structures. This ``Good Structure, Bad Score'' phenomenon becomes a key bottleneck, degrading efficiency and misguiding the search away from the true equation. To resolve this, we propose SAGE-Fit (Structure-Aware and Semantics-Guided Evaluator for Symbolic Regression), an SR-native fitting framework that exploits the dual native priors of symbolic expressions. By capitalizing on the structural and semantic priors unique to SR, we design tailored modules for each property, thereby effectively mitigating this optimization bottleneck. Extensive experiments demonstrate that our approach, as a plug-and-play module, significantly enhances evaluation fidelity and universally improves the performance of various SR systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SAGE-Fit targets the inner-loop fitting bottleneck in symbolic regression with structure and semantics modules, but the abstract gives little concrete evidence on whether the gains are real or substantial.

read the letter

The paper flags a practical problem in symbolic regression: the inner parameter optimization step often lands in poor local minima for nonlinear operators, so even correct structures get low scores and get thrown out by the outer search. SAGE-Fit is their attempt to fix that by building modules that use the structural and semantic properties of expressions as priors. The framing of the bi-level setup and the “Good Structure, Bad Score” issue is straightforward and matches what people in the area already complain about. Presenting it as a plug-and-play addition is a reasonable way to make the idea testable against existing SR pipelines. That part is useful. The soft spot is that the description stays at the level of “we design tailored modules for each property” without showing what those modules actually compute or how they avoid creating new biases or extra hyperparameters. The claim of “universal improvement” across SR systems is stated but not backed by any numbers or ablation details in what’s visible. If the full paper has reproducible code, clear pseudocode for the modules, and results on standard benchmarks that beat strong baselines on fitting quality, the contribution becomes more solid. Without that, it reads as an idea that still needs verification. This is mainly for people already working on symbolic regression or bi-level optimization in equation discovery. A reader who cares about inner-loop tricks would find the motivation familiar and the proposed direction sensible. It deserves peer review because the underlying bottleneck is real and the approach is internally consistent, even if the current write-up leaves the magnitude of the fix open to question. Send it out, but flag that reviewers will want implementation specifics and controlled experiments.

Referee Report

2 major / 1 minor

Summary. The paper identifies the 'Good Structure, Bad Score' issue in symbolic regression (SR) arising from bi-level optimization: the outer loop searches discrete equation structures while the inner loop fits continuous parameters, but non-convexity and reliance on fast local solvers (e.g., BFGS) frequently yield poor local minima and underestimated scores for correct structures. It proposes SAGE-Fit, an SR-native plug-and-play framework that exploits structural and semantic priors via tailored modules to improve inner-loop fitting quality and thereby guide the outer search more effectively. Extensive experiments are claimed to show universal performance gains across SR systems.

Significance. If the modules reliably improve fitting fidelity without new biases or optimization pathologies, the work would address a recognized bottleneck in SR pipelines and could be adopted broadly as a modular enhancement. The emphasis on SR-native priors (rather than generic optimizers) is a targeted contribution; reproducible code or parameter-free derivations would strengthen this, but none are noted in the provided abstract.

major comments (2)

[Abstract] The central claim that tailored modules 'effectively mitigating this optimization bottleneck' rests on the assumption that structural and semantic priors can be capitalized without introducing new issues; however, the abstract supplies no derivation details, pseudocode, or verification that the modules achieve the claimed mitigation (e.g., no mention of how structure-aware components interact with BFGS or handle nonlinear operators).
[Abstract] Experimental evidence is asserted ('extensive experiments demonstrate... universally improves') but no quantitative results, baselines, or ablation on the modules are visible in the abstract; this leaves the load-bearing claim that evaluation fidelity is enhanced without supporting numbers or controls.

minor comments (1)

[Abstract] Clarify the precise definition of 'dual native priors' and how each module maps to structural vs. semantic properties.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments on our manuscript. We address each major comment below, focusing on the abstract's level of detail while noting that the full paper provides the requested technical content.

read point-by-point responses

Referee: [Abstract] The central claim that tailored modules 'effectively mitigating this optimization bottleneck' rests on the assumption that structural and semantic priors can be capitalized without introducing new issues; however, the abstract supplies no derivation details, pseudocode, or verification that the modules achieve the claimed mitigation (e.g., no mention of how structure-aware components interact with BFGS or handle nonlinear operators).

Authors: The abstract serves as a high-level summary and is constrained by length. Full derivation details, pseudocode for the structure-aware and semantics-guided modules, and their specific interactions with local solvers such as BFGS (including handling of nonlinear operators) are provided in Section 3 of the manuscript. We can revise the abstract to include a one-sentence high-level description of the modules if the editor requests. revision: partial
Referee: [Abstract] Experimental evidence is asserted ('extensive experiments demonstrate... universally improves') but no quantitative results, baselines, or ablation on the modules are visible in the abstract; this leaves the load-bearing claim that evaluation fidelity is enhanced without supporting numbers or controls.

Authors: The abstract summarizes the outcome of the experiments at a high level, as is conventional. Quantitative results, baseline comparisons, and module ablations are reported in detail in Sections 4 and 5, supported by tables and figures. We are willing to incorporate one or two key quantitative highlights into the abstract during revision to strengthen the summary. revision: partial

Circularity Check

0 steps flagged

No significant circularity; proposal is a design and empirical claim

full rationale

The paper describes a bi-level optimization setup in SR and proposes SAGE-Fit as a plug-and-play module that exploits structural and semantic priors via tailored components. No derivation chain, equations, or fitted parameters are shown that reduce by construction to the inputs; the central claim is an engineering improvement evaluated experimentally rather than a tautological prediction or self-referential definition. No self-citations or uniqueness theorems are invoked in the provided text to bear load on the result.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no information on free parameters, axioms, or invented entities.

pith-pipeline@v0.9.0 · 5763 in / 773 out tokens · 19269 ms · 2026-05-25T04:43:10.583526+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

51 extracted references · 51 canonical work pages · 1 internal anchor

[1]

Genetic Programming Theory and Practice IX , pages=

FFX: Fast, scalable, deterministic symbolic regression technology , author=. Genetic Programming Theory and Practice IX , pages=. 2011 , publisher=

work page 2011
[2]

Proceedings of the national academy of sciences , volume=

Discovering governing equations from data by sparse identification of nonlinear dynamical systems , author=. Proceedings of the national academy of sciences , volume=. 2016 , publisher=

work page 2016
[3]

IFAC-PapersOnLine , volume=

Sparse identification of nonlinear dynamics with control (SINDYc) , author=. IFAC-PapersOnLine , volume=. 2016 , publisher=

work page 2016
[4]

Fluids , volume=

Equation discovery using fast function extraction: a deterministic symbolic regression approach , author=. Fluids , volume=. 2019 , publisher=

work page 2019
[5]

International Conference on Computer Aided Systems Theory , pages=

Symbolic regression with fast function extraction and nonlinear least squares optimization , author=. International Conference on Computer Aided Systems Theory , pages=. 2022 , organization=

work page 2022
[6]

Statistics and computing , volume=

Genetic programming as a means for programming computers by natural selection , author=. Statistics and computing , volume=. 1994 , publisher=

work page 1994
[7]

science , volume=

Distilling free-form natural laws from experimental data , author=. science , volume=. 2009 , publisher=

work page 2009
[8]

Proceedings of the genetic and evolutionary computation conference , pages=

Linear scaling with and within semantic backpropagation-based genetic programming for symbolic regression , author=. Proceedings of the genetic and evolutionary computation conference , pages=

work page
[9]

2023 , eprint=

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl , author=. 2023 , eprint=

work page 2023
[10]

Extrapolation and learning equations

Extrapolation and learning equations , author=. arXiv preprint arXiv:1610.02995 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[11]

International Conference on Machine Learning , pages=

Learning equations for extrapolation and control , author=. International Conference on Machine Learning , pages=. 2018 , organization=

work page 2018
[12]

IEEE Transactions on Evolutionary Computation , year=

Evolving equation learner for symbolic regression , author=. IEEE Transactions on Evolutionary Computation , year=

work page
[13]

Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients , author=. Proc. of the International Conference on Learning Representations , year=

work page
[14]

arXiv preprint arXiv:2505.10762 , year=

Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics , author=. arXiv preprint arXiv:2505.10762 , year=

work page arXiv
[15]

Advances in Neural Information Processing Systems , year=

A Unified Framework for Deep Symbolic Regression , author=. Advances in Neural Information Processing Systems , year=

work page
[16]

Proceedings of the 41st International Conference on Machine Learning , pages =

A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data , author =. Proceedings of the 41st International Conference on Machine Learning , pages =. 2024 , editor =

work page 2024
[17]

International Conference on Machine Learning , pages=

Deep generative symbolic regression with monte-carlo-tree-search , author=. International Conference on Machine Learning , pages=. 2023 , organization=

work page 2023
[18]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Improving Monte Carlo Tree Search for Symbolic Regression , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

work page
[19]

In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery

Merler, Matteo and Haitsiukevich, Katsiaryna and Dainese, Nicola and Marttinen, Pekka. In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop). 2024. doi:10.18653/v1/2024.acl-srw.49

work page doi:10.18653/v1/2024.acl-srw.49 2024
[20]

2025 , eprint=

Finetuning Large Language Model as an Effective Symbolic Regressor , author=. 2025 , eprint=

work page 2025
[21]

Structural and Multidisciplinary Optimization , volume=

Metamodeling by symbolic regression and Pareto simulated annealing , author=. Structural and Multidisciplinary Optimization , volume=. 2008 , publisher=

work page 2008
[22]

proceedings of the genetic and evolutionary computation conference , pages=

Simulated annealing for symbolic regression , author=. proceedings of the genetic and evolutionary computation conference , pages=

work page
[23]

2009 21st IEEE International Conference on Tools with Artificial Intelligence , pages=

Evolution strategies for constants optimization in genetic programming , author=. 2009 21st IEEE International Conference on Tools with Artificial Intelligence , pages=. 2009 , organization=

work page 2009
[24]

International Conference on Intelligent Systems Design and Applications , pages=

Continuous Cartesian Genetic Programming with Particle Swarm Optimization , author=. International Conference on Intelligent Systems Design and Applications , pages=. 2018 , organization=

work page 2018
[25]

Evolutionary Design of a PSO-Tuned Multigene Symbolic Regression Genetic Programming Model for River Flow Forecasting , author=. Int. J. Adv. Comput. Sci. Appl , volume=

work page
[26]

Proceedings of the 10th annual conference on Genetic and evolutionary computation , pages=

Using differential evolution for symbolic regression and numerical constant creation , author=. Proceedings of the 10th annual conference on Genetic and evolutionary computation , pages=

work page
[27]

and Paige, Brooks and Hern\'

Kusner, Matt J. and Paige, Brooks and Hern\'. Grammar variational autoencoder , year =. Proceedings of the 34th International Conference on Machine Learning - Volume 70 , pages =

work page
[28]

arXiv preprint arXiv:2104.05417 , year=

An approach to symbolic regression using feyn , author=. arXiv preprint arXiv:2104.05417 , year=

work page arXiv
[29]

Proceedings of the 2020 genetic and evolutionary computation conference companion , pages=

Operon C++ an efficient genetic programming framework for symbolic regression , author=. Proceedings of the 2020 genetic and evolutionary computation conference companion , pages=

work page 2020
[30]

Classification, clustering, and data analysis: recent advances and applications , pages=

Symbolic regression analysis , author=. Classification, clustering, and data analysis: recent advances and applications , pages=. 2002 , publisher=

work page 2002
[31]

Science advances , volume=

AI Feynman: A physics-inspired method for symbolic regression , author=. Science advances , volume=. 2020 , publisher=

work page 2020
[32]

Artificial Intelligence Review , volume=

Interpretable scientific discovery with symbolic regression: a review , author=. Artificial Intelligence Review , volume=. 2024 , publisher=

work page 2024
[33]

arXiv e-prints , pages=

Benchmarking symbolic regression constant optimization schemes , author=. arXiv e-prints , pages=

work page
[34]

2026 , eprint=

An Empirical Investigation of Neural ODEs and Symbolic Regression for Dynamical Systems , author=. 2026 , eprint=

work page 2026
[35]

SIAM Journal on Numerical Analysis , volume=

Algorithms for the solution of the nonlinear least-squares problem , author=. SIAM Journal on Numerical Analysis , volume=. 1978 , publisher=

work page 1978
[36]

Genetic Programming and Evolvable Machines , volume=

Parameter identification for symbolic regression using nonlinear least squares , author=. Genetic Programming and Evolvable Machines , volume=. 2020 , publisher=

work page 2020
[37]

Advances in stochastic and deterministic global optimization , pages=

On the least-squares fitting of data by sinusoids , author=. Advances in stochastic and deterministic global optimization , pages=. 2016 , publisher=

work page 2016
[38]

SIAM Journal on Scientific Computing , volume=

Deflation Techniques for Finding Multiple Local Minima of a Nonlinear Least Squares Problem , author=. SIAM Journal on Scientific Computing , volume=. 2025 , publisher=

work page 2025
[39]

Journal of Symbolic Computation , volume=

Effects of reducing redundant parameters in parameter optimization for symbolic regression using genetic programming , author=. Journal of Symbolic Computation , volume=. 2025 , publisher=

work page 2025
[40]

Advances in neural information processing systems , volume=

Stein variational gradient descent: A general purpose bayesian inference algorithm , author=. Advances in neural information processing systems , volume=

work page
[41]

Journal of Machine Learning Research , volume=

On the geometry of Stein variational gradient descent , author=. Journal of Machine Learning Research , volume=

work page
[42]

Department of Civil and Environmental Engineering Duke University August , volume=

The Levenberg-Marquardt algorithm for nonlinear least squares curve-fitting problems , author=. Department of Civil and Environmental Engineering Duke University August , volume=

work page
[43]

Advances in neural information processing systems , volume=

Stein variational gradient descent with matrix-valued kernels , author=. Advances in neural information processing systems , volume=

work page
[44]

Journal of global optimization , volume=

Differential evolution--a simple and efficient heuristic for global optimization over continuous spaces , author=. Journal of global optimization , volume=. 1997 , publisher=

work page 1997
[45]

SIAM Journal on optimization , volume=

An interior trust region approach for nonlinear minimization subject to bounds , author=. SIAM Journal on optimization , volume=. 1996 , publisher=

work page 1996
[46]

Numerische Mathematik , volume=

A Levenberg--Marquardt method for large nonlinear least-squares problems with dynamic accuracy in functions and gradients , author=. Numerische Mathematik , volume=. 2018 , publisher=

work page 2018
[47]

Inverse problems , volume=

Separable nonlinear least squares: the variable projection method and its applications , author=. Inverse problems , volume=. 2003 , publisher=

work page 2003
[48]

Reddy , booktitle=

Parshin Shojaee and Kazem Meidani and Shashank Gupta and Amir Barati Farimani and Chandan K. Reddy , booktitle=. 2025 , url=

work page 2025
[49]

The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

Symbolic Regression with a Learned Concept Library , author=. The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

work page
[50]

2025 , url=

Hengzhe Zhang and Qi Chen and Bing XUE and Wolfgang Banzhaf and Mengjie Zhang , booktitle=. 2025 , url=

work page 2025
[51]

Ioannou and Fei-Yue Wang , title =

Zelin Guo and Siqi Wang and Yonglin Tian and Jing Yang and Hui Yu and Xiaoxiang Na and Levente Kovács and Li Li and Petros A. Ioannou and Fei-Yue Wang , title =. Proceedings of the National Academy of Sciences , volume =. 2025 , doi =

work page 2025

[1] [1]

Genetic Programming Theory and Practice IX , pages=

FFX: Fast, scalable, deterministic symbolic regression technology , author=. Genetic Programming Theory and Practice IX , pages=. 2011 , publisher=

work page 2011

[2] [2]

Proceedings of the national academy of sciences , volume=

Discovering governing equations from data by sparse identification of nonlinear dynamical systems , author=. Proceedings of the national academy of sciences , volume=. 2016 , publisher=

work page 2016

[3] [3]

IFAC-PapersOnLine , volume=

Sparse identification of nonlinear dynamics with control (SINDYc) , author=. IFAC-PapersOnLine , volume=. 2016 , publisher=

work page 2016

[4] [4]

Fluids , volume=

Equation discovery using fast function extraction: a deterministic symbolic regression approach , author=. Fluids , volume=. 2019 , publisher=

work page 2019

[5] [5]

International Conference on Computer Aided Systems Theory , pages=

Symbolic regression with fast function extraction and nonlinear least squares optimization , author=. International Conference on Computer Aided Systems Theory , pages=. 2022 , organization=

work page 2022

[6] [6]

Statistics and computing , volume=

Genetic programming as a means for programming computers by natural selection , author=. Statistics and computing , volume=. 1994 , publisher=

work page 1994

[7] [7]

science , volume=

Distilling free-form natural laws from experimental data , author=. science , volume=. 2009 , publisher=

work page 2009

[8] [8]

Proceedings of the genetic and evolutionary computation conference , pages=

Linear scaling with and within semantic backpropagation-based genetic programming for symbolic regression , author=. Proceedings of the genetic and evolutionary computation conference , pages=

work page

[9] [9]

2023 , eprint=

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl , author=. 2023 , eprint=

work page 2023

[10] [10]

Extrapolation and learning equations

Extrapolation and learning equations , author=. arXiv preprint arXiv:1610.02995 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[11] [11]

International Conference on Machine Learning , pages=

Learning equations for extrapolation and control , author=. International Conference on Machine Learning , pages=. 2018 , organization=

work page 2018

[12] [12]

IEEE Transactions on Evolutionary Computation , year=

Evolving equation learner for symbolic regression , author=. IEEE Transactions on Evolutionary Computation , year=

work page

[13] [13]

Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients , author=. Proc. of the International Conference on Learning Representations , year=

work page

[14] [14]

arXiv preprint arXiv:2505.10762 , year=

Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics , author=. arXiv preprint arXiv:2505.10762 , year=

work page arXiv

[15] [15]

Advances in Neural Information Processing Systems , year=

A Unified Framework for Deep Symbolic Regression , author=. Advances in Neural Information Processing Systems , year=

work page

[16] [16]

Proceedings of the 41st International Conference on Machine Learning , pages =

A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data , author =. Proceedings of the 41st International Conference on Machine Learning , pages =. 2024 , editor =

work page 2024

[17] [17]

International Conference on Machine Learning , pages=

Deep generative symbolic regression with monte-carlo-tree-search , author=. International Conference on Machine Learning , pages=. 2023 , organization=

work page 2023

[18] [18]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Improving Monte Carlo Tree Search for Symbolic Regression , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

work page

[19] [19]

In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery

Merler, Matteo and Haitsiukevich, Katsiaryna and Dainese, Nicola and Marttinen, Pekka. In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop). 2024. doi:10.18653/v1/2024.acl-srw.49

work page doi:10.18653/v1/2024.acl-srw.49 2024

[20] [20]

2025 , eprint=

Finetuning Large Language Model as an Effective Symbolic Regressor , author=. 2025 , eprint=

work page 2025

[21] [21]

Structural and Multidisciplinary Optimization , volume=

Metamodeling by symbolic regression and Pareto simulated annealing , author=. Structural and Multidisciplinary Optimization , volume=. 2008 , publisher=

work page 2008

[22] [22]

proceedings of the genetic and evolutionary computation conference , pages=

Simulated annealing for symbolic regression , author=. proceedings of the genetic and evolutionary computation conference , pages=

work page

[23] [23]

2009 21st IEEE International Conference on Tools with Artificial Intelligence , pages=

Evolution strategies for constants optimization in genetic programming , author=. 2009 21st IEEE International Conference on Tools with Artificial Intelligence , pages=. 2009 , organization=

work page 2009

[24] [24]

International Conference on Intelligent Systems Design and Applications , pages=

Continuous Cartesian Genetic Programming with Particle Swarm Optimization , author=. International Conference on Intelligent Systems Design and Applications , pages=. 2018 , organization=

work page 2018

[25] [25]

Evolutionary Design of a PSO-Tuned Multigene Symbolic Regression Genetic Programming Model for River Flow Forecasting , author=. Int. J. Adv. Comput. Sci. Appl , volume=

work page

[26] [26]

Proceedings of the 10th annual conference on Genetic and evolutionary computation , pages=

Using differential evolution for symbolic regression and numerical constant creation , author=. Proceedings of the 10th annual conference on Genetic and evolutionary computation , pages=

work page

[27] [27]

and Paige, Brooks and Hern\'

Kusner, Matt J. and Paige, Brooks and Hern\'. Grammar variational autoencoder , year =. Proceedings of the 34th International Conference on Machine Learning - Volume 70 , pages =

work page

[28] [28]

arXiv preprint arXiv:2104.05417 , year=

An approach to symbolic regression using feyn , author=. arXiv preprint arXiv:2104.05417 , year=

work page arXiv

[29] [29]

Proceedings of the 2020 genetic and evolutionary computation conference companion , pages=

Operon C++ an efficient genetic programming framework for symbolic regression , author=. Proceedings of the 2020 genetic and evolutionary computation conference companion , pages=

work page 2020

[30] [30]

Classification, clustering, and data analysis: recent advances and applications , pages=

Symbolic regression analysis , author=. Classification, clustering, and data analysis: recent advances and applications , pages=. 2002 , publisher=

work page 2002

[31] [31]

Science advances , volume=

AI Feynman: A physics-inspired method for symbolic regression , author=. Science advances , volume=. 2020 , publisher=

work page 2020

[32] [32]

Artificial Intelligence Review , volume=

Interpretable scientific discovery with symbolic regression: a review , author=. Artificial Intelligence Review , volume=. 2024 , publisher=

work page 2024

[33] [33]

arXiv e-prints , pages=

Benchmarking symbolic regression constant optimization schemes , author=. arXiv e-prints , pages=

work page

[34] [34]

2026 , eprint=

An Empirical Investigation of Neural ODEs and Symbolic Regression for Dynamical Systems , author=. 2026 , eprint=

work page 2026

[35] [35]

SIAM Journal on Numerical Analysis , volume=

Algorithms for the solution of the nonlinear least-squares problem , author=. SIAM Journal on Numerical Analysis , volume=. 1978 , publisher=

work page 1978

[36] [36]

Genetic Programming and Evolvable Machines , volume=

Parameter identification for symbolic regression using nonlinear least squares , author=. Genetic Programming and Evolvable Machines , volume=. 2020 , publisher=

work page 2020

[37] [37]

Advances in stochastic and deterministic global optimization , pages=

On the least-squares fitting of data by sinusoids , author=. Advances in stochastic and deterministic global optimization , pages=. 2016 , publisher=

work page 2016

[38] [38]

SIAM Journal on Scientific Computing , volume=

Deflation Techniques for Finding Multiple Local Minima of a Nonlinear Least Squares Problem , author=. SIAM Journal on Scientific Computing , volume=. 2025 , publisher=

work page 2025

[39] [39]

Journal of Symbolic Computation , volume=

Effects of reducing redundant parameters in parameter optimization for symbolic regression using genetic programming , author=. Journal of Symbolic Computation , volume=. 2025 , publisher=

work page 2025

[40] [40]

Advances in neural information processing systems , volume=

Stein variational gradient descent: A general purpose bayesian inference algorithm , author=. Advances in neural information processing systems , volume=

work page

[41] [41]

Journal of Machine Learning Research , volume=

On the geometry of Stein variational gradient descent , author=. Journal of Machine Learning Research , volume=

work page

[42] [42]

Department of Civil and Environmental Engineering Duke University August , volume=

The Levenberg-Marquardt algorithm for nonlinear least squares curve-fitting problems , author=. Department of Civil and Environmental Engineering Duke University August , volume=

work page

[43] [43]

Advances in neural information processing systems , volume=

Stein variational gradient descent with matrix-valued kernels , author=. Advances in neural information processing systems , volume=

work page

[44] [44]

Journal of global optimization , volume=

Differential evolution--a simple and efficient heuristic for global optimization over continuous spaces , author=. Journal of global optimization , volume=. 1997 , publisher=

work page 1997

[45] [45]

SIAM Journal on optimization , volume=

An interior trust region approach for nonlinear minimization subject to bounds , author=. SIAM Journal on optimization , volume=. 1996 , publisher=

work page 1996

[46] [46]

Numerische Mathematik , volume=

A Levenberg--Marquardt method for large nonlinear least-squares problems with dynamic accuracy in functions and gradients , author=. Numerische Mathematik , volume=. 2018 , publisher=

work page 2018

[47] [47]

Inverse problems , volume=

Separable nonlinear least squares: the variable projection method and its applications , author=. Inverse problems , volume=. 2003 , publisher=

work page 2003

[48] [48]

Reddy , booktitle=

Parshin Shojaee and Kazem Meidani and Shashank Gupta and Amir Barati Farimani and Chandan K. Reddy , booktitle=. 2025 , url=

work page 2025

[49] [49]

The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

Symbolic Regression with a Learned Concept Library , author=. The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

work page

[50] [50]

2025 , url=

Hengzhe Zhang and Qi Chen and Bing XUE and Wolfgang Banzhaf and Mengjie Zhang , booktitle=. 2025 , url=

work page 2025

[51] [51]

Ioannou and Fei-Yue Wang , title =

Zelin Guo and Siqi Wang and Yonglin Tian and Jing Yang and Hui Yu and Xiaoxiang Na and Levente Kovács and Li Li and Petros A. Ioannou and Fei-Yue Wang , title =. Proceedings of the National Academy of Sciences , volume =. 2025 , doi =

work page 2025