arxiv: 2602.19967 · v3 · submitted 2026-02-23 · 💻 cs.LG

Unlearning Noise in PINNs: A Selective Pruning Framework for PDE Inverse Problems

Yongsheng Chen , Yong Chen , Wei Guo , Xinghui Zhong This is my paper

Pith reviewed 2026-05-15 20:34 UTC · model grok-4.3

classification 💻 cs.LG

keywords PINNsPDE inverse problemsnoise robustnessselective pruningmachine unlearningphysics-informed learningnetwork pruning

0 comments p. Extension

The pith

P-PINN prunes neurons driven by corrupted observations to recover accurate solutions for noisy PDE inverse problems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes P-PINN as a post-training pruning procedure that starts from a PINN trained on the full noisy dataset and removes units whose internal representations are dominated by corrupted samples. It first partitions observations into reliable and corrupted subsets by scoring each point with a weighted sum of data misfit and PDE residual. A bias-based importance score then measures how much each neuron's activation direction differs between the two subsets, allowing iterative removal of the most noise-sensitive neurons layer by layer. The pruned network is finally fine-tuned only on the reliable subset while still enforcing the original PDE constraints. A reader would care because even modest noise fractions can collapse standard PINN training on inverse problems, and the method offers a lightweight repair that avoids full retraining.

Core claim

Starting from a pretrained PINN on the full dataset, P-PINN uses a joint residual-data fidelity indicator to split samples, applies a bias-based neuron importance measure to detect units driven by the corrupted subset, iteratively prunes those neurons layer by layer, and fine-tunes the resulting network on reliable data under the original PDE constraints, yielding up to a 96.6 percent reduction in relative error on PDE inverse-problem benchmarks compared with baseline PINNs.

What carries the argument

The bias-based neuron importance measure, which computes directional activation discrepancies between reliable and corrupted data subsets to rank and remove noise-sensitive neurons layer by layer before fine-tuning.

If this is right

Training stability increases because the pruned network no longer fits corrupted observations during fine-tuning.
Parameter recovery accuracy rises on inverse problems where baseline PINNs diverge under modest noise.
The procedure functions as a cheap post-processing step rather than requiring retraining from random initialization.
Robustness gains hold across multiple PDE types provided the partition step isolates noise effectively.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same partition-and-prune logic could be tested on forward PDE solvers or other physics-informed architectures if the fidelity indicator generalizes beyond inverse settings.
If the bias measure proves stable across network depths, it might supply a general tool for cleaning label noise in supervised learning without task-specific physics constraints.
Real sensor datasets with unknown corruption patterns would test whether the method needs a trusted clean subset or can run with only the indicator score.

Load-bearing premise

The joint residual-data fidelity indicator can reliably separate reliable observations from corrupted ones, and the bias measure correctly identifies neurons whose representations are predominantly shaped by the corrupted subset.

What would settle it

Apply P-PINN to a standard PDE inverse benchmark with 5-20 percent added noise; if the final relative error on the target parameters or fields stays comparable to or higher than a baseline PINN trained on the same data, the pruning mechanism has not delivered the claimed improvement.

read the original abstract

Physics-informed neural networks (PINNs) provide a promising framework for solving inverse problems governed by partial differential equations (PDEs) by integrating observational data and physical constraints in a unified optimization objective. However, the ill-posed nature of PDE inverse problems makes them highly sensitive to noise. Even a small fraction of corrupted observations can distort internal neural representations, severely impairing accuracy and destabilizing training. Motivated by recent advances in machine unlearning and structured network pruning, we propose P-PINN, a selective pruning framework designed to unlearn the influence of corrupted data in a pretrained PINN. Specifically, starting from a PINN trained on the full dataset, P-PINN evaluates a joint residual--data fidelity indicator, a weighted combination of data misfit and PDE residuals, to partition the training set into reliable and corrupted subsets. Next, we introduce a bias-based neuron importance measure that quantifies directional activation discrepancies between the two subsets, identifying neurons whose representations are predominantly driven by corrupted samples. Building on this, an iterative pruning strategy then removes noise-sensitive neurons layer by layer. The resulting pruned network is fine-tuned on the reliable data subject to the original PDE constraints, acting as a lightweight post-processing stage rather than a complete retraining. Numerical experiments on extensive PDE inverse-problem benchmarks demonstrate that P-PINN substantially improves robustness, accuracy, and training stability under noisy conditions, achieving up to a 96.6% reduction in relative error compared with baseline PINNs. These results indicate that activation-level post hoc pruning is a promising mechanism for enhancing the reliability of physics-informed learning in noise-contaminated settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

P-PINN adds a post-training pruning step to PINNs for noisy PDE inverses using a bias-based neuron score, but the initial partition into clean and corrupted data rests on an unverified assumption that may break when the network has already fit the noise.

read the letter

The paper's core contribution is a lightweight post-processing routine for pretrained PINNs. After training on the full noisy set, it scores points with a weighted residual-plus-data-misfit indicator, splits them into reliable and corrupted subsets, then prunes neurons whose activations differ most between the two subsets before fine-tuning on the reliable part only. That construction is new in this setting and avoids full retraining, which is a practical plus for inverse problems where data are expensive to re-collect or re-simulate.

Referee Report

3 major / 2 minor

Summary. The manuscript proposes P-PINN, a post-processing framework for PINNs solving PDE inverse problems under noisy data. Starting from a PINN pretrained on the full dataset, it computes a joint residual-data fidelity indicator to partition observations into reliable and corrupted subsets, applies a bias-based neuron importance score to identify and iteratively prune neurons whose activations are driven by the corrupted subset, and fine-tunes the resulting sparse network on the reliable data while enforcing the original PDE constraints. Numerical experiments on multiple inverse-problem benchmarks are reported to yield up to a 96.6% reduction in relative error versus standard PINNs.

Significance. If the partitioning and pruning steps are shown to be robust, the method supplies a lightweight, activation-level unlearning technique that avoids full retraining and could improve the practical reliability of PINNs in ill-posed inverse settings where data corruption is common. The work usefully imports structured pruning ideas from machine unlearning into the physics-informed setting and supplies concrete benchmark comparisons.

major comments (3)

[§3.1 (joint residual-data fidelity indicator)] The central claim that the joint residual-data fidelity indicator reliably recovers the corrupted subset is load-bearing for the entire pipeline, yet the manuscript provides no verification (e.g., via synthetic ground-truth partitions or controlled overfitting experiments) that residuals remain systematically larger on corrupted points once the initial PINN has fitted the noise; in ill-posed inverse problems this assumption can fail and directly invalidates the subsequent pruning mask.
[§3.2 (bias-based neuron importance)] The bias-based neuron importance measure is defined via directional activation discrepancies between the two subsets, but the paper does not show that this quantity is independent of the partitioning step or that it avoids circular selection; without an ablation that isolates the measure from the indicator, the reported error reductions cannot be attributed unambiguously to the pruning mechanism.
[§4 (numerical experiments)] Table 2 and the associated figures report up to 96.6% relative-error reduction, yet no error bars, statistical significance tests, or ablation on the weighting factor in the joint indicator are supplied; the absence of these controls leaves open the possibility that the gains arise from post-hoc subset selection rather than the proposed unlearning procedure.

minor comments (2)

[§4] The abstract and §4 refer to 'extensive PDE inverse-problem benchmarks' without listing the exact PDEs, noise levels, or data-split ratios in the main text; these details should be tabulated for reproducibility.
[§3.1] Notation for the joint indicator (e.g., the weighting hyper-parameter) is introduced without an explicit equation number or sensitivity analysis; a short appendix deriving its form would improve clarity.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the insightful comments, which have helped us identify areas to improve the clarity and rigor of our work. We will incorporate the suggested additions and revisions to address the concerns regarding verification of the partitioning, ablations for the pruning mechanism, and statistical analysis in the experiments.

read point-by-point responses

Referee: [§3.1 (joint residual-data fidelity indicator)] The central claim that the joint residual-data fidelity indicator reliably recovers the corrupted subset is load-bearing for the entire pipeline, yet the manuscript provides no verification (e.g., via synthetic ground-truth partitions or controlled overfitting experiments) that residuals remain systematically larger on corrupted points once the initial PINN has fitted the noise; in ill-posed inverse problems this assumption can fail and directly invalidates the subsequent pruning mask.

Authors: We thank the referee for highlighting this critical aspect. While the joint indicator is motivated by the expectation that corrupted points will exhibit larger residuals after fitting, we agree that explicit verification is necessary to confirm its reliability. In the revised version, we will add experiments with synthetic ground-truth partitions on the benchmark problems, showing the distribution of indicator values on reliable vs. corrupted subsets. Additionally, we will include controlled experiments where we overfit the PINN to noisy data and measure the separation in residuals. These additions will substantiate the partitioning step and address potential failures in ill-posed settings. revision: yes
Referee: [§3.2 (bias-based neuron importance)] The bias-based neuron importance measure is defined via directional activation discrepancies between the two subsets, but the paper does not show that this quantity is independent of the partitioning step or that it avoids circular selection; without an ablation that isolates the measure from the indicator, the reported error reductions cannot be attributed unambiguously to the pruning mechanism.

Authors: We appreciate this point regarding potential circularity. The bias-based measure is computed after partitioning, but to demonstrate its contribution independently, we will include an ablation study in the revision. Specifically, we will compare the full P-PINN pipeline against a variant that uses random pruning or pruning based solely on the partitioning without the bias measure. This will help isolate the effect of the neuron importance score and clarify that the error reductions stem from the selective pruning rather than the subset selection alone. revision: yes
Referee: [§4 (numerical experiments)] Table 2 and the associated figures report up to 96.6% relative-error reduction, yet no error bars, statistical significance tests, or ablation on the weighting factor in the joint indicator are supplied; the absence of these controls leaves open the possibility that the gains arise from post-hoc subset selection rather than the proposed unlearning procedure.

Authors: We agree that additional statistical rigor and ablations would strengthen the experimental section. In the revised manuscript, we will report error bars based on multiple random seeds for all experiments in Table 2 and the figures. We will also perform statistical significance tests (e.g., paired t-tests) to confirm the improvements are significant. Furthermore, we will add an ablation study varying the weighting factor in the joint indicator across a range of values and show its impact on performance, demonstrating robustness to this hyperparameter and that the gains are not solely due to subset selection. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper introduces a new selective pruning framework (P-PINN) whose core components—a joint residual-data fidelity indicator for subset partitioning and a bias-based neuron importance measure—are defined directly from the outputs of a pretrained PINN and the PDE residual terms. These quantities are not shown to reduce by construction to any fitted parameters or prior results internal to the paper. The central performance claims rest on external numerical benchmarks rather than any self-referential derivation or self-citation chain, rendering the method self-contained.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the unproven premise that a weighted residual-data indicator cleanly separates corrupted from reliable points and that activation bias between those subsets isolates noise-driven neurons without discarding physics information.

free parameters (1)

weighting factor in joint residual-data fidelity indicator
Controls the partition of training points into reliable versus corrupted subsets; its value is not derived from first principles.

axioms (1)

domain assumption Neurons can be classified as noise-sensitive based on directional activation discrepancies between reliable and corrupted data subsets.
This premise underpins the bias-based importance measure and the subsequent pruning decisions.

pith-pipeline@v0.9.0 · 5596 in / 1407 out tokens · 55325 ms · 2026-05-15T20:34:18.814226+00:00 · methodology