arxiv: 2605.04130 · v1 · submitted 2026-05-05 · 💻 cs.LG

Recognition: 3 theorem links

· Lean Theorem

Constrained Extreme Gradient Boosting for Adapting Reduced-Order Models

Melika Baghi , Xiao Liu , Kamran Paynabar

Authors on Pith no claims yet

Pith reviewed 2026-05-08 17:59 UTC · model grok-4.3

classification 💻 cs.LG

keywords reduced-order modelsPOD basesGrassmann manifoldextreme gradient boostingconstrained ensemble learningparametric adaptationfluid dynamicswave propagation

0 comments

The pith

Constrained XGBoost predicts parameter-dependent POD bases by regressing geometrically mapped subspaces with a norm constraint.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a constrained ensemble learning approach called cXGBoost to adapt projection-based reduced-order models when system parameters vary. High-fidelity simulations of complex systems become too costly for repeated runs under different conditions, so the method represents subspaces on the Grassmann manifold, maps them into Euclidean space, and uses gradient boosting trees for regression. A norm constraint is added during training to keep the inverse mapping valid and preserve the subspaces' structure. This produces accurate bases for new parameters in nonlinear problems such as fluid flow and wave propagation. A reader would care because the approach makes parametric studies, optimization, and real-time control feasible without repeated expensive full-order solves.

Core claim

We propose constrained Extreme Gradient Boosting (cXGBoost) for predicting Proper Orthogonal Decomposition bases as functions of system parameters. Subspaces are represented on the Grassmann manifold and mapped to Euclidean space so that gradient boosting trees can perform regression; a norm constraint is enforced during training to guarantee that the inverse mapping returns valid subspaces that retain the original geometric properties. The method is tested on four numerical examples from fluid dynamics and wave propagation, where it delivers accurate parameter-dependent bases and remains robust in nonlinear regimes.

What carries the argument

The geometric mapping of subspaces from the Grassmann manifold to Euclidean space together with a norm constraint imposed on the outputs during cXGBoost training, which enables regression while ensuring the predicted points invert to valid subspaces.

If this is right

The method yields accurate predictions of parameter-dependent POD bases for use in adaptive reduced-order models.
It maintains robustness in nonlinear regimes across fluid dynamics and wave propagation examples.
It supports efficient parametric studies, optimization, and real-time control by avoiding repeated full simulations.
The combination of geometric manifold learning and constrained tree ensembles provides a scalable route to reliable reduced-order modeling of high-dimensional systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same manifold-to-Euclidean regression pattern with output constraints could be applied to other manifold-valued predictions in scientific machine learning beyond POD.
Success on the four examples indicates the framework may extend to additional classes of parametric partial differential equations without major redesign.
If the constraint reliably produces valid subspaces, similar techniques could simplify post-processing steps in other adaptive reduced-order workflows.

Load-bearing premise

That mapping subspaces from the Grassmann manifold into Euclidean space and adding a norm constraint during training is enough to guarantee that the inverse mapping always produces valid subspaces without distorting their geometry.

What would settle it

A new parametric test case in which the ROMs built from the predicted bases show substantially larger projection errors than those from a fixed basis or from full-order re-computation across the same parameter range.

read the original abstract

High-fidelity simulations, such as computational fluid dynamics and finite element analysis, are essential for modeling complex engineering systems but are often prohibitively expensive for tasks including parametric studies, optimization, and real-time control. Projection-based reduced-order models (ROMs) alleviate this cost by projecting the governing dynamics onto low-dimensional subspaces. However, their performance can deteriorate under parameter variation, motivating the need for adaptive basis construction. In this work, we propose a constrained ensemble learning framework, termed Constrained Extreme Gradient Boosting (cXGBoost), for predicting Proper Orthogonal Decomposition (POD) bases as functions of system parameters. The approach leverages a geometric representation of subspaces on the Grassmann manifold, which are mapped to a Euclidean space to enable efficient regression using gradient boosting trees. A norm constraint is imposed during training to ensure the validity of the inverse mapping and preserve the geometric structure of the predicted subspaces. The proposed method is evaluated on four numerical examples, including fluid dynamics and wave propagation problems, demonstrating its ability to accurately predict parameter-dependent bases while maintaining robustness across nonlinear regimes. These results highlight the potential of combining geometric learning with constrained ensemble methods for scalable and reliable reduced-order modeling of high-dimensional parametric systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes Constrained Extreme Gradient Boosting (cXGBoost) for predicting parameter-dependent Proper Orthogonal Decomposition (POD) bases in projection-based reduced-order models. Subspaces are represented on the Grassmann manifold, mapped to Euclidean space for regression with gradient-boosting trees, and a norm constraint is imposed during training to ensure the inverse mapping recovers valid orthonormal bases. The method is evaluated on four numerical examples from fluid dynamics and wave propagation, with claims of accurate predictions and robustness across nonlinear regimes.

Significance. If supported by detailed quantitative evidence, the integration of Grassmannian geometry with constrained ensemble regression could provide a scalable, data-driven approach to adaptive ROMs that preserves subspace structure, offering advantages over standard interpolation methods for parametric engineering problems. The explicit norm constraint during training is a notable design choice for maintaining geometric validity.

major comments (2)

[Abstract] Abstract: The central claim that the method 'demonstrating its ability to accurately predict parameter-dependent bases while maintaining robustness across nonlinear regimes' is unsupported by any reported quantitative metrics (e.g., subspace distances, reconstruction errors), baseline comparisons, or specific results from the four examples. This absence is load-bearing for assessing the accuracy and robustness assertions.
[Abstract] Abstract (norm constraint): The statement that 'a norm constraint is imposed during training to ensure the validity of the inverse mapping' lacks a derivation of the feasible region, analysis showing the trained model remains inside it (especially for extrapolation or strongly nonlinear regimes), and post-hoc validation that recovered bases satisfy U^T U = I. Tree-based predictors can violate implicit bounds, directly undermining the geometric validity claim.

minor comments (1)

[Abstract] Abstract: The four numerical examples are mentioned but not identified (e.g., specific PDEs or parameter ranges), which would help readers assess the scope of the robustness claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their detailed review and positive assessment of the potential of the cXGBoost framework. We agree that the abstract requires strengthening with quantitative support and clearer reference to the constraint analysis. We address each major comment below and will incorporate revisions to improve clarity without altering the core contributions.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that the method 'demonstrating its ability to accurately predict parameter-dependent bases while maintaining robustness across nonlinear regimes' is unsupported by any reported quantitative metrics (e.g., subspace distances, reconstruction errors), baseline comparisons, or specific results from the four examples. This absence is load-bearing for assessing the accuracy and robustness assertions.

Authors: We agree that the abstract would be strengthened by including key quantitative metrics. The manuscript already reports these in Sections 4 and 5, including Grassmannian subspace distances, reconstruction errors, and comparisons against standard POD interpolation baselines across the four examples (two fluid dynamics and two wave propagation cases) in both linear and nonlinear regimes. In the revised version we will update the abstract to highlight representative quantitative results, such as average subspace prediction errors and robustness indicators, to make the claims self-contained. revision: yes
Referee: [Abstract] Abstract (norm constraint): The statement that 'a norm constraint is imposed during training to ensure the validity of the inverse mapping' lacks a derivation of the feasible region, analysis showing the trained model remains inside it (especially for extrapolation or strongly nonlinear regimes), and post-hoc validation that recovered bases satisfy U^T U = I. Tree-based predictors can violate implicit bounds, directly undermining the geometric validity claim.

Authors: Section 3.2 of the manuscript derives the norm constraint from the geometry of the Grassmann manifold mapping and specifies the feasible region enforced during training. Post-hoc orthonormality checks (U^T U = I) are performed and reported for all test cases in the results. We acknowledge that the abstract itself does not reference this derivation or provide explicit extrapolation analysis. We will revise the abstract to briefly note the constraint mechanism and add a short discussion of bound adherence in nonlinear regimes to the main text, including any additional validation needed to address potential violations by tree-based models. revision: partial

Circularity Check

0 steps flagged

Derivation chain is self-contained with no circular reductions

full rationale

The paper introduces a constrained XGBoost framework that maps Grassmannian subspaces to Euclidean space for regression and imposes a norm constraint during training. This methodological choice and its application to four numerical examples do not reduce to inputs by construction, self-citation chains, or fitted parameters renamed as predictions. No load-bearing step exhibits self-definitional equivalence or statistical forcing; the central claims rest on independent validation across fluid dynamics and wave problems.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that the Grassmann-to-Euclidean mapping and norm constraint adequately preserve subspace geometry for regression, but specific details on implementation and validation are limited in the abstract.

axioms (1)

domain assumption Subspaces represented on the Grassmann manifold can be mapped to Euclidean space for efficient regression while allowing valid inverse mapping back to the manifold.
Invoked to enable standard gradient boosting trees for prediction of POD bases as functions of parameters.

pith-pipeline@v0.9.0 · 5510 in / 1332 out tokens · 75180 ms · 2026-05-08T17:59:47.599189+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 9 canonical work pages

[1]

An Online Method for Interpolating Linear Parametric Reduced-Order Models,

Amsallem, D., and Farhat, C., “An Online Method for Interpolating Linear Parametric Reduced-Order Models,”SIAM Journal on Scientific Computing, Vol. 33, No. 5, 2011, pp. 2169–2198. https://doi.org/10.1137/100813051

work page doi:10.1137/100813051 2011
[2]

Data-Driven Surrogate Modeling for Structural Mechanics Using Gaussian Processes and Gradient Boosting Trees,

Boncoraglio, G., Bostanabad, R., and Zhao, Y., “Data-Driven Surrogate Modeling for Structural Mechanics Using Gaussian Processes and Gradient Boosting Trees,”Computer Methods in Applied Mechanics and Engineering, Vol. 398, 2022, p. 115360. https://doi.org/10.1016/j.cma.2022.115360

work page doi:10.1016/j.cma.2022.115360 2022
[3]

ApplicationsofPOD-BasedReduced-OrderModelstoRapid3-DFluid-Flow Prediction,

Sebastia-Capdevila,R.,Trias,F.X.,andOliva,A.,“ApplicationsofPOD-BasedReduced-OrderModelstoRapid3-DFluid-Flow Prediction,”Applied Thermal Engineering, Vol. 234, 2024, p. 120123. https://doi.org/10.1016/j.apthermeng.2024.120123

work page doi:10.1016/j.apthermeng.2024.120123 2024
[4]

Non-Intrusive Reduced-Order Modeling Using Convolutional Autoencoders,

Halder, R., Fidkowski, K. J., and Maki, K. J., “Non-Intrusive Reduced-Order Modeling Using Convolutional Autoencoders,” International Journal for Numerical Methods in Engineering, Vol. 123, No. 21, 2022, pp. 5369–5390. https://doi.org/10.1002/ nme.7049

2022
[5]

Balanced Proper-Orthogonal-Decomposition-Based Feedback Control of Vortex-Induced Vibration,

Jiang, H., and Cao, S., “Balanced Proper-Orthogonal-Decomposition-Based Feedback Control of Vortex-Induced Vibration,” Physical Review Fluids, Vol. 9, No. 7, 2024, p. 073901. https://doi.org/10.1103/PhysRevFluids.9.073901

work page doi:10.1103/physrevfluids.9.073901 2024
[6]

Model Reduction of Dynamical Systems on Nonlinear Manifolds Using Deep Convolutional Autoencoders,

Lee, K., and Carlberg, K. T., “Model Reduction of Dynamical Systems on Nonlinear Manifolds Using Deep Convolutional Autoencoders,”Journal of Computational Physics, Vol. 404, 2020, p. 108973. https://doi.org/10.1016/j.jcp.2019.108973

work page doi:10.1016/j.jcp.2019.108973 2020
[7]

Interpolation Method for Adapting Reduced-Order Models and Application to Aeroelasticity,

Amsallem, D., and Farhat, C., “Interpolation Method for Adapting Reduced-Order Models and Application to Aeroelasticity,” AIAA Journal, Vol. 46, No. 7, 2008, pp. 1803–1813. https://doi.org/10.2514/1.35374

work page doi:10.2514/1.35374 2008
[8]

On the Stability of POD Basis Interpolation on Grassmann Manifolds for Parametric MOR,

Néron, B., Courty, N., and Berland, J.-Y., “On the Stability of POD Basis Interpolation on Grassmann Manifolds for Parametric MOR,”Computational Mechanics, 2022

2022
[9]

Regression Trees on Grassmann Manifold for Adapting Reduced-Order Models,

Liu, X., and Liu, X., “Regression Trees on Grassmann Manifold for Adapting Reduced-Order Models,” , 2022. URL https://arxiv.org/abs/2206.11324

work page arXiv 2022
[10]

Adapting Projection-Based Reduced-Order Models using Projected Gaussian Process,

Liu, X., Feng, J., and Liu, X., “Adapting Projection-Based Reduced-Order Models using Projected Gaussian Process,”SIAM Journal on Scientific Computing (to appear); arXiv:2410.14090, 2026

work page arXiv 2026
[11]

Data-driven surrogates for high dimensional models using Gaussian process regression on the Grassmann manifold,

Giovanis, D. G., and Shields, M. D., “Data-driven surrogates for high dimensional models using Gaussian process regression on the Grassmann manifold,”Computer Methods in Applied Mechanics and Engineering, Vol. 370, 2020, p. 113269

2020
[12]

Schubert varieties and distances between subspaces of different dimensions,

Ye, K., and Lim, L.-H., “Schubert varieties and distances between subspaces of different dimensions,”SIAM Journal on Matrix Analysis and Applications, Vol. 37, No. 3, 2016, pp. 1176–1197

2016
[13]

A survey of projection-based model reduction methods for parametric dynamical systems,

Benner, P., Gugercin, S., and Willcox, K., “A survey of projection-based model reduction methods for parametric dynamical systems,”SIAM review, Vol. 57, No. 4, 2015, pp. 483–531

2015
[14]

Chen and C

Chen,T.,andGuestrin,C.,“XGBoost: AScalableTreeBoostingSystem,”Proceedingsofthe22ndACMSIGKDDInternational Conference on Knowledge Discovery and Data Mining, 2016, pp. 785–794. https://doi.org/10.1145/2939672.2939785. 28

work page doi:10.1145/2939672.2939785 2016
[15]

On the stability of POD basis interpolation on Grassmann manifolds for parametric model order reduction,

Friderikos, O., Baranger, E., Olive, M., and Neron, D., “On the stability of POD basis interpolation on Grassmann manifolds for parametric model order reduction,”Computational Mechanics, Vol. 70, 2020, pp. 181–204

2020
[16]

On pressure and velocity boundary conditions for the lattice Boltzmann BGK model,

Zou, Q., and He, X., “On pressure and velocity boundary conditions for the lattice Boltzmann BGK model,”Physics of Fluids, Vol. 9, No. 6, 1997, pp. 1591–1598

1997
[17]

A model for collision processes in gases. I. Small amplitude processes in charged and neutral one-component systems,

Bhatnagar, P. L., Gross, E. P., and Krook, M., “A model for collision processes in gases. I. Small amplitude processes in charged and neutral one-component systems,”Physical Review, Vol. 94, No. 3, 1954, pp. 511–525

1954
[18]

Lattice BGK models for Navier-Stokes equation,

Qian, Y.-H., d’Humières, D., and Lallemand, P., “Lattice BGK models for Navier-Stokes equation,”Europhysics letters, Vol. 17, No. 6, 1992, p. 479

1992
[19]

Hecht, E.,Optics, 5th ed., Pearson, 2017. 29

2017