hub

LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

· 2024 · arXiv 2404.18400

24 Pith papers cite this work. Polarity classification is still indexing.

24 Pith papers citing it

read on arXiv browse 24 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2 baseline 1 dataset 1

citation-polarity summary

background 3 baseline 1

representative citing papers

LLM-Guided ODE Discovery and Parameter Inference from Small-Cohort Aggregate Data

cs.LG · 2026-07-01 · unverdicted · novelty 7.0

AgentODE uses LLMs to discover ODE structures and infer parameter distributions from aggregate data, recovering consistent structures on benchmarks and RDEB clinical data with 231 observations from 46 patients.

A Novel Method for Differential-Algebraic Dynamic Model Discovery in Power Systems: An LLM-Based Multi-Agent Collaborative Framework

eess.SY · 2026-06-30 · unverdicted · novelty 7.0

An LLM multi-agent framework decomposes differential-algebraic model discovery into parallel structure search and algebraic closure, recovering state dynamics and constraints from data and outperforming single-agent LLM and symbolic regression baselines on generator and inverter cases.

Agentic Symbolic Search: Characterizing PDEs Beyond Hand-crafted Expressions, Meshes, and Neural Networks

cs.LG · 2026-06-18 · unverdicted · novelty 7.0

ASYS recovers known analytical PDE forms and generates new interpretable symbolic approximations, such as a geometric interface formula for 2D Allen-Cahn and a nine-parameter contraction law for Keller-Segel blow-up, via agent-guided evolutionary search on differentiable programs.

FunctionEvolve: Structure-Guided Symbolic Regression with LLMs

cs.LG · 2026-06-05 · unverdicted · novelty 7.0

FunctionEvolve recovers 107 exact symbolic forms out of 129 synthetic tasks (82.9% SA@50) by using expression-tree structure for evolutionary search, parent selection, mutation, and coefficient scoring with LLMs.

Learning First Integrals via Backward-Generated Data and Guided Reinforcement Learning

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

FISolver trains a compact LLM on backward-generated (differential equation, first integral) pairs and uses guided reinforcement learning to outperform larger models and Mathematica on first-integral benchmarks at lower cost.

SMCEvolve: Principled Scientific Discovery via Sequential Monte Carlo Evolution

cs.AI · 2026-05-14 · unverdicted · novelty 7.0

SMCEvolve applies Sequential Monte Carlo sampling to LLM program search with adaptive resampling, mutation mixtures, and convergence control, delivering finite-sample complexity bounds and benchmark gains over prior systems.

Neuro-Symbolic ODE Discovery with Latent Grammar Flow

cs.LG · 2026-04-17 · unverdicted · novelty 7.0

Latent Grammar Flow discovers ODEs by placing grammar-based equation representations in a discrete latent space, using a behavioral loss to cluster similar equations, and sampling via a discrete flow model guided by data fit and constraints.

Retrieval-Augmented Large Language Models for Evidence-Informed Guidance on Cannabidiol Use in Older Adults

cs.IR · 2026-01-16 · unverdicted · novelty 7.0

Retrieval-augmented LLMs produce more cautious and guideline-aligned recommendations on cannabidiol for older adults than standalone models, demonstrated via automated evaluation on 64 diverse scenarios.

EditSR: Enhancing Neural Symbolic Regression via Edit-based Rectification

cs.AI · 2026-06-06 · unverdicted · novelty 6.0

EditSR improves neural symbolic regression accuracy on complex expressions by pretraining an edit-based rectifier on state-transition correction chains that enforce syntactic validity and condition edits only on the current expression state.

Programmatic Context Augmentation for LLM-based Symbolic Regression

cs.AI · 2026-05-04 · unverdicted · novelty 6.0

Programmatic context augmentation lets LLM-based symbolic regression perform code-driven data analysis during search, yielding superior efficiency and accuracy over baselines on LLM-SRBench.

From Data to Theory: Autonomous Large Language Model Agents for Materials Science

cs.AI · 2026-04-01 · unverdicted · novelty 6.0

An LLM agent autonomously selects, codes, and validates materials equations from data, recovering known laws reliably but requiring checks for new or specialized cases.

FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data

cs.AI · 2025-10-29 · unverdicted · novelty 6.0

FELA deploys specialized LLM agents in an evolutionary framework to generate, validate, and refine explainable features from heterogeneous industrial event logs, improving downstream model performance.

LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers

cs.LG · 2025-03-18 · unverdicted · novelty 6.0

LLM-FE is a framework that treats feature engineering as LLM-driven program search with data feedback, reporting consistent gains over baselines on classification and regression tabular tasks.

Joint discovery of governing partial differential equations from multi-source datasets by competitive optimization

cs.LG · 2026-06-29 · unverdicted · novelty 5.0

MCO-PDE trains per-dataset neural surrogates, applies soft-competitive weighting for consensus coefficients, and uses a genetic algorithm to identify shared PDE structures from multi-source data.

Physics-Distilled Neural Network enabled by Large Language Models for Manufacturing Process-Property Predictive Modeling

cs.LG · 2026-06-10 · unverdicted · novelty 5.0

A framework extracts physics priors via LLMs, distills them through a Graph-Masked Attention teacher into a fast student model, and shows high accuracy plus fault tolerance across five manufacturing processes.

EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation

cs.CL · 2026-05-29 · unverdicted · novelty 5.0

EvoGens uses rank-based mutation, semantic-aware crossover, and lightweight evaluation to evolve populations of LLM-generated scientific ideas, boosting novelty and diversity metrics.

GP-GOMEA with GPU-Based Fitness Evaluations: Design and Performance Analysis

cs.NE · 2026-05-29 · unverdicted · novelty 5.0

GPU fitness evaluation for GP-GOMEA boosts throughput, improves benchmark results especially on large datasets, and allows reliable regression of large Feynman equations within hours.

PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

PACEvolve++ uses a phase-adaptive reinforcement learning advisor to decouple hypothesis selection from execution in LLM-driven evolutionary search, delivering faster convergence than prior frameworks on load balancing, recommendation, and protein tasks.

In Context Learning and Reasoning for Symbolic Regression with Large Language Models

cs.CL · 2024-10-22 · unverdicted · novelty 5.0

GPT-4 models rediscover Langmuir isotherms and produce fits on Nikuradse pipe-flow data via iterative chain-of-thought prompting with scientific context and external code feedback.

MOOSE-Copilot: A Web-Based Interactive Assistant for Unified Exploratory and Fine-Grained Scientific Hypothesis Discovery

cs.CL · 2026-05-28 · unverdicted · novelty 4.0

MOOSE-Copilot introduces a unified HAII framework and no-code web interface for LLM-driven scientific hypothesis discovery that integrates exploratory search with fine-grained refinement via user-provided blueprints, routing, and feedback.

Time Series Forecasting Through the Lens of Dynamics

cs.LG · 2025-07-21 · unverdicted · novelty 4.0

Proposes dynamics-based analysis of time series models showing partial dynamics learning and end-positioning as key to performance, plus a plug-and-play improvement method.

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

cs.AI · 2025-01-16 · unverdicted · novelty 3.0

The paper surveys reinforced reasoning techniques for LLMs, covering automated data construction, learning-to-reason methods, and test-time scaling as steps toward Large Reasoning Models.

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

cs.CL · 2025-02-05 · unverdicted · novelty 2.0

Position paper claims multimodal LLMs can significantly advance scientific reasoning and proposes a four-stage roadmap plus challenges and suggestions.

Identifying Topological Invariants of Non-Hermitian Systems via Domain-Adaptive Multimodal Model for Mathematics

cond-mat.other · 2026-04-08 · 2 refs

citing papers explorer

Showing 24 of 24 citing papers.

LLM-Guided ODE Discovery and Parameter Inference from Small-Cohort Aggregate Data cs.LG · 2026-07-01 · unverdicted · none · ref 5
AgentODE uses LLMs to discover ODE structures and infer parameter distributions from aggregate data, recovering consistent structures on benchmarks and RDEB clinical data with 231 observations from 46 patients.
A Novel Method for Differential-Algebraic Dynamic Model Discovery in Power Systems: An LLM-Based Multi-Agent Collaborative Framework eess.SY · 2026-06-30 · unverdicted · none · ref 19
An LLM multi-agent framework decomposes differential-algebraic model discovery into parallel structure search and algebraic closure, recovering state dynamics and constraints from data and outperforming single-agent LLM and symbolic regression baselines on generator and inverter cases.
Agentic Symbolic Search: Characterizing PDEs Beyond Hand-crafted Expressions, Meshes, and Neural Networks cs.LG · 2026-06-18 · unverdicted · none · ref 7
ASYS recovers known analytical PDE forms and generates new interpretable symbolic approximations, such as a geometric interface formula for 2D Allen-Cahn and a nine-parameter contraction law for Keller-Segel blow-up, via agent-guided evolutionary search on differentiable programs.
FunctionEvolve: Structure-Guided Symbolic Regression with LLMs cs.LG · 2026-06-05 · unverdicted · none · ref 58
FunctionEvolve recovers 107 exact symbolic forms out of 129 synthetic tasks (82.9% SA@50) by using expression-tree structure for evolutionary search, parent selection, mutation, and coefficient scoring with LLMs.
Learning First Integrals via Backward-Generated Data and Guided Reinforcement Learning cs.LG · 2026-05-20 · unverdicted · none · ref 31
FISolver trains a compact LLM on backward-generated (differential equation, first integral) pairs and uses guided reinforcement learning to outperform larger models and Mathematica on first-integral benchmarks at lower cost.
SMCEvolve: Principled Scientific Discovery via Sequential Monte Carlo Evolution cs.AI · 2026-05-14 · unverdicted · none · ref 7
SMCEvolve applies Sequential Monte Carlo sampling to LLM program search with adaptive resampling, mutation mixtures, and convergence control, delivering finite-sample complexity bounds and benchmark gains over prior systems.
Neuro-Symbolic ODE Discovery with Latent Grammar Flow cs.LG · 2026-04-17 · unverdicted · none · ref 27
Latent Grammar Flow discovers ODEs by placing grammar-based equation representations in a discrete latent space, using a behavioral loss to cluster similar equations, and sampling via a discrete flow model guided by data fit and constraints.
Retrieval-Augmented Large Language Models for Evidence-Informed Guidance on Cannabidiol Use in Older Adults cs.IR · 2026-01-16 · unverdicted · none · ref 71
Retrieval-augmented LLMs produce more cautious and guideline-aligned recommendations on cannabidiol for older adults than standalone models, demonstrated via automated evaluation on 64 diverse scenarios.
EditSR: Enhancing Neural Symbolic Regression via Edit-based Rectification cs.AI · 2026-06-06 · unverdicted · none · ref 52
EditSR improves neural symbolic regression accuracy on complex expressions by pretraining an edit-based rectifier on state-transition correction chains that enforce syntactic validity and condition edits only on the current expression state.
Programmatic Context Augmentation for LLM-based Symbolic Regression cs.AI · 2026-05-04 · unverdicted · none · ref 3
Programmatic context augmentation lets LLM-based symbolic regression perform code-driven data analysis during search, yielding superior efficiency and accuracy over baselines on LLM-SRBench.
From Data to Theory: Autonomous Large Language Model Agents for Materials Science cs.AI · 2026-04-01 · unverdicted · none · ref 33
An LLM agent autonomously selects, codes, and validates materials equations from data, recovering known laws reliably but requiring checks for new or specialized cases.
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data cs.AI · 2025-10-29 · unverdicted · none · ref 45
FELA deploys specialized LLM agents in an evolutionary framework to generate, validate, and refine explainable features from heterogeneous industrial event logs, improving downstream model performance.
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers cs.LG · 2025-03-18 · unverdicted · none · ref 36
LLM-FE is a framework that treats feature engineering as LLM-driven program search with data feedback, reporting consistent gains over baselines on classification and regression tabular tasks.
Joint discovery of governing partial differential equations from multi-source datasets by competitive optimization cs.LG · 2026-06-29 · unverdicted · none · ref 22
MCO-PDE trains per-dataset neural surrogates, applies soft-competitive weighting for consensus coefficients, and uses a genetic algorithm to identify shared PDE structures from multi-source data.
Physics-Distilled Neural Network enabled by Large Language Models for Manufacturing Process-Property Predictive Modeling cs.LG · 2026-06-10 · unverdicted · none · ref 29
A framework extracts physics priors via LLMs, distills them through a Graph-Masked Attention teacher into a fast student model, and shows high accuracy plus fault tolerance across five manufacturing processes.
EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation cs.CL · 2026-05-29 · unverdicted · none · ref 37
EvoGens uses rank-based mutation, semantic-aware crossover, and lightweight evaluation to evolve populations of LLM-generated scientific ideas, boosting novelty and diversity metrics.
GP-GOMEA with GPU-Based Fitness Evaluations: Design and Performance Analysis cs.NE · 2026-05-29 · unverdicted · none · ref 19
GPU fitness evaluation for GP-GOMEA boosts throughput, improves benchmark results especially on large datasets, and allows reliable regression of large Feynman equations within hours.
PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents cs.LG · 2026-05-07 · unverdicted · none · ref 35
PACEvolve++ uses a phase-adaptive reinforcement learning advisor to decouple hypothesis selection from execution in LLM-driven evolutionary search, delivering faster convergence than prior frameworks on load balancing, recommendation, and protein tasks.
In Context Learning and Reasoning for Symbolic Regression with Large Language Models cs.CL · 2024-10-22 · unverdicted · none · ref 37
GPT-4 models rediscover Langmuir isotherms and produce fits on Nikuradse pipe-flow data via iterative chain-of-thought prompting with scientific context and external code feedback.
MOOSE-Copilot: A Web-Based Interactive Assistant for Unified Exploratory and Fine-Grained Scientific Hypothesis Discovery cs.CL · 2026-05-28 · unverdicted · none · ref 2
MOOSE-Copilot introduces a unified HAII framework and no-code web interface for LLM-driven scientific hypothesis discovery that integrates exploratory search with fine-grained refinement via user-provided blueprints, routing, and feedback.
Time Series Forecasting Through the Lens of Dynamics cs.LG · 2025-07-21 · unverdicted · none · ref 32
Proposes dynamics-based analysis of time series models showing partial dynamics learning and end-positioning as key to performance, plus a plug-and-play improvement method.
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models cs.AI · 2025-01-16 · unverdicted · none · ref 132
The paper surveys reinforced reasoning techniques for LLMs, covering automated data construction, learning-to-reason methods, and test-time scaling as steps toward Large Reasoning Models.
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning cs.CL · 2025-02-05 · unverdicted · none · ref 169
Position paper claims multimodal LLMs can significantly advance scientific reasoning and proposes a four-stage roadmap plus challenges and suggestions.
Identifying Topological Invariants of Non-Hermitian Systems via Domain-Adaptive Multimodal Model for Mathematics cond-mat.other · 2026-04-08 · unreviewed · ref 70 · 2 links

LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer