pith. sign in

hub Mixed citations

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl

Mixed citation behavior. Most common role is background (43%).

56 Pith papers citing it
Background 43% of classified citations
abstract

PySR is an open-source library for practical symbolic regression, a type of machine learning which aims to discover human-interpretable symbolic models. PySR was developed to democratize and popularize symbolic regression for the sciences, and is built on a high-performance distributed back-end, a flexible search algorithm, and interfaces with several deep learning packages. PySR's internal search algorithm is a multi-population evolutionary algorithm, which consists of a unique evolve-simplify-optimize loop, designed for optimization of unknown scalar constants in newly-discovered empirical expressions. PySR's backend is the extremely optimized Julia library SymbolicRegression.jl, which can be used directly from Julia. It is capable of fusing user-defined operators into SIMD kernels at runtime, performing automatic differentiation, and distributing populations of expressions to thousands of cores across a cluster. In describing this software, we also introduce a new benchmark, "EmpiricalBench," to quantify the applicability of symbolic regression algorithms in science. This benchmark measures recovery of historical empirical equations from original and synthetic datasets.

hub tools

citation-role summary

background 6 method 6 baseline 2

citation-polarity summary

clear filters

representative citing papers

SEVerA: Verified Synthesis of Self-Evolving Agents

cs.LG · 2026-03-26 · unverdicted · novelty 8.0

SEVerA uses Formally Guarded Generative Models and a three-stage Search-Verification-Learning process to synthesize self-evolving agents that satisfy hard formal constraints while improving task performance.

KAN: Kolmogorov-Arnold Networks

cs.LG · 2024-04-30 · conditional · novelty 8.0

KANs with learnable univariate spline activations on edges achieve better accuracy than MLPs with fewer parameters, faster scaling, and direct visualization for scientific discovery.

Diversified Residual Symbolic Regression

cs.NE · 2026-05-15 · unverdicted · novelty 7.0

DRSR uses Quality-Diversity to produce diverse symbolic regression expressions differing in residual distributions, enabling post-search selection on synthetic and astronomical data.

Machine Collective Intelligence for Explainable Scientific Discovery

cs.AI · 2026-04-30 · unverdicted · novelty 7.0

Machine collective intelligence uses coordinated AI agents to evolve symbolic hypotheses and recover governing equations from observations in deterministic, stochastic, and uncharacterized systems, achieving up to six orders of magnitude better extrapolation than neural networks with 5-40 parameters

Neuro-Symbolic ODE Discovery with Latent Grammar Flow

cs.LG · 2026-04-17 · unverdicted · novelty 7.0

Latent Grammar Flow discovers ODEs by placing grammar-based equation representations in a discrete latent space, using a behavioral loss to cluster similar equations, and sampling via a discrete flow model guided by data fit and constraints.

Symbolic recovery of PDEs from measurement data

cs.LG · 2026-02-17 · unverdicted · novelty 7.0

Symbolic rational-function networks recover an admissible PDE from noiseless complete measurements and select the regularization-minimizing parameterization within the architecture.

AlphaEvolve: A coding agent for scientific and algorithmic discovery

cs.AI · 2025-06-16 · unverdicted · novelty 7.0

AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.

Symbolic Density Estimation for Discrete Distributions

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

SDE recovers closed-form PMFs for discrete distributions via evolutionary search guided by domain priors, recovering all benchmark families with accurate parameters and improving mixture fits on real data.

citing papers explorer

Showing 9 of 9 citing papers after filters.

  • Evaluating Large Language Models in Scientific Discovery cs.AI · 2025-12-17 · unverdicted · none · ref 54 · internal anchor

    The SDE benchmark shows LLMs lag on scientific discovery tasks relative to general science tests, with diminishing scaling returns and shared weaknesses across models.

  • Back to the Beginning of Heuristic Design: Bridging Code and Knowledge with LLMs cs.AI · 2026-05-07 · unverdicted · none · ref 65 · internal anchor

    A knowledge-first approach to LLM-driven automatic heuristic design in combinatorial optimization yields better discovery efficiency, transfer, and generalization than code-centric baselines by formalizing a distortion-compression trade-off.

  • Machine Collective Intelligence for Explainable Scientific Discovery cs.AI · 2026-04-30 · unverdicted · none · ref 20 · internal anchor

    Machine collective intelligence uses coordinated AI agents to evolve symbolic hypotheses and recover governing equations from observations in deterministic, stochastic, and uncharacterized systems, achieving up to six orders of magnitude better extrapolation than neural networks with 5-40 parameters

  • AlphaEvolve: A coding agent for scientific and algorithmic discovery cs.AI · 2025-06-16 · unverdicted · none · ref 21 · internal anchor

    AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.

  • STRIDE: A Self-Reflective Agent Framework for Reliable Automatic Equation Discovery cs.AI · 2026-05-18 · unverdicted · none · ref 4 · internal anchor

    STRIDE is a self-reflective agent framework that improves accuracy, OOD robustness, and structural recovery in LLM-based symbolic regression by integrating generation, evaluation, repair, and diversity-preserving memory.

  • GESR: A Genetic Programming-Based Symbolic Regression Method with Gene Editing cs.AI · 2026-05-11 · unverdicted · none · ref 8 · 3 links · internal anchor

    GESR uses two BERT models to intelligently direct mutations and crossovers inside genetic programming, yielding higher efficiency and competitive accuracy on symbolic regression benchmarks.

  • Discovering Ordinary Differential Equations with LLM-Based Qualitative and Quantitative Evaluation cs.AI · 2026-05-08 · unverdicted · none · ref 2 · internal anchor

    DoLQ employs a sampler agent, parameter optimizer, and LLM-based scientist agent to iteratively propose, refine, and evaluate ODE candidates, yielding higher success rates and better symbolic term recovery than prior symbolic regression methods on multi-dimensional benchmarks.

  • Programmatic Context Augmentation for LLM-based Symbolic Regression cs.AI · 2026-05-04 · unverdicted · none · ref 17 · internal anchor

    Programmatic context augmentation lets LLM-based symbolic regression perform code-driven data analysis during search, yielding superior efficiency and accuracy over baselines on LLM-SRBench.

  • ASP-Assisted Symbolic Regression: Uncovering Hidden Physics in Fluid Mechanics cs.AI · 2025-07-22 · unverdicted · none · ref 8 · internal anchor

    A hybrid symbolic regression and answer set programming framework derives compact, physically plausible equations for velocity and pressure in 3D laminar channel flow from simulation data.