pith. sign in

arxiv: 2604.03284 · v1 · submitted 2026-03-24 · 📊 stat.CO · stat.ME

FunctionalCalibration: an R package for estimation in aggregated functional data model

Pith reviewed 2026-05-15 00:58 UTC · model grok-4.3

classification 📊 stat.CO stat.ME
keywords functional dataaggregated curvesR packagesplineswaveletscurve estimationBeer-Lambert lawadditive errors
0
0 comments X

The pith

The FunctionalCalibration R package estimates individual curves from aggregated functional observations using splines or wavelets.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the FunctionalCalibration package for recovering constituent curves when only their summed aggregates are observed under additive noise. This setup appears in chemometrics through models like the Beer-Lambert law. The package implements the recovery step via spline or wavelet basis expansions. A reader would care because direct measurement of separate curves is often impossible while the aggregate is easy to record, so the method turns composite data into usable component estimates inside standard R workflows.

Core claim

The package FunctionalCalibration provides functions to estimate individual curves from aggregated curves by using splines or wavelet basis expansion in models with additive errors.

What carries the argument

Spline or wavelet basis expansion to approximate and recover the constituent curves from their observed sums.

Load-bearing premise

The observed aggregated curves equal the exact sum of the individual curves plus an additive error term.

What would settle it

Generate simulated aggregates from known individual curves plus noise that matches the model, apply the package, and check whether the recovered curves match the originals within the expected error level.

Figures

Figures reproduced from arXiv: 2604.03284 by Alex Rodrigo dos Santos Sousa, Vitor Ribas Perrone.

Figure 1
Figure 1. Figure 1: Tecator dataset of the R package fda.usc. 2.1.1 Estimation procedures 2.1.2 Wavelet approach Under the wavelet approach, we suppose M = 2J (J ∈ N). The discretized model can be written in matrix form as A = αy + ϵ, (2) where A = (Amn = An(tm))1≤m≤M,1≤n≤N , α = (αml = αl(tm))1≤m≤M,1≤l≤L, y = (yln)1≤l≤L,1≤n≤N and ϵ = (ϵmn = ϵn(tm))1≤m≤M,1≤n≤N . To perform the estimation procedure via wavelets, the functions … view at source ↗
Figure 2
Figure 2. Figure 2: Haar and Daubechies with 10 null moments wavelet functions. [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Cubic B-splines. 3 Package Development The proposed methods were implemented in the R language, and a package named FunctionalCalibration was developed. The package is available both on CRAN and on GitHub. It includes a simulated dataset to illustrate and validate the implemented procedures, as well as four main functions for performing the calibration process. The next subsections describe each function o… view at source ↗
Figure 4
Figure 4. Figure 4: First component function [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: α1(x) estimated for the simulated dataset using wavelet-based calibration by the functional calibration wavelets function. −1.0 −0.5 0.0 0.5 1.0 1.5 2.0 −2 −1 0 1 2 3 Estimated alpha 2 x y [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗
Figure 8
Figure 8. Figure 8: α1(x) estimated for the simulated dataset us￾ing spline-based calibration by functional calibration splines function. −1.0 −0.5 0.0 0.5 1.0 1.5 2.0 −2 −1 0 1 2 3 Estimated alpha 2 x y [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗
Figure 10
Figure 10. Figure 10: Example of an aggregated curve with weight 0.7 assigned to [PITH_FULL_IMAGE:figures/full_fig_p014_10.png] view at source ↗
read the original abstract

We consider the statistical problem of estimating constituent curves from observations of their aggregated curves, referred to as aggregated functional data, in models with additive errors. A typical model arises in chemometrics via the Beer-Lambert law. The package FunctionalCalibration provides functions to estimate individual curves from aggregated curves by using splines or wavelet basis expansion.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. The manuscript introduces the R package FunctionalCalibration for estimating individual constituent curves from observations of their aggregated sums under an additive error model. The package implements estimation via spline or wavelet basis expansions, motivated by applications such as the Beer-Lambert law in chemometrics.

Significance. The work provides a convenient software implementation of standard basis-expansion methods for a recurring problem in functional data analysis. If the functions are correctly coded and accompanied by clear documentation and examples, the package could serve as a practical tool for applied researchers. However, the absence of any reported simulation studies, error analysis, or real-data validation substantially reduces the demonstrated utility and reliability of the contribution.

major comments (1)
  1. [Abstract] The manuscript contains no simulation studies, cross-validation results, or real-data applications demonstrating the finite-sample performance of the spline and wavelet estimators. This omission is load-bearing because the central claim is the practical utility of the package functions for recovering individual curves from aggregates.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript describing the FunctionalCalibration R package. We address the major comment below and will revise the manuscript to strengthen the demonstration of the package's utility.

read point-by-point responses
  1. Referee: [Abstract] The manuscript contains no simulation studies, cross-validation results, or real-data applications demonstrating the finite-sample performance of the spline and wavelet estimators. This omission is load-bearing because the central claim is the practical utility of the package functions for recovering individual curves from aggregates.

    Authors: We agree that the absence of simulation studies and real-data examples limits the demonstrated reliability of the package. In the revised manuscript we will add a dedicated section containing Monte Carlo simulations that evaluate the finite-sample performance of both the spline and wavelet estimators under varying noise levels, numbers of aggregated curves, and basis dimensions. We will also include a real-data example drawn from a chemometrics application consistent with the Beer-Lambert motivation. These additions will directly address the practical utility claim. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper describes an R package implementing standard spline and wavelet basis expansions to recover individual curves from aggregated sums plus noise under the externally motivated additive model (Beer-Lambert law). No equations, derivations, fitted-parameter predictions, uniqueness theorems, or self-citation chains are present that reduce the central claim to its own inputs by construction. The contribution is the software functionality itself, which is internally consistent with the stated model and does not rely on any load-bearing step that collapses to a fit or renaming.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the standard assumption of additive errors in aggregated functional observations and the use of established spline and wavelet bases; no free parameters, invented entities, or non-standard axioms are mentioned.

axioms (1)
  • domain assumption Aggregated curves equal the sum of constituent curves plus additive errors.
    Explicitly referenced in the abstract as the typical model arising via the Beer-Lambert law.

pith-pipeline@v0.9.0 · 5340 in / 1118 out tokens · 28454 ms · 2026-05-15T00:58:44.044749+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

17 extracted references · 17 canonical work pages

  1. [1]

    Chemometrics: data analysis for the laboratory and chemical plant

    Richard G Brereton. Chemometrics: data analysis for the laboratory and chemical plant. John Wiley & Sons, 2003

  2. [2]

    Bayesian wavelength selection in multicomponent analysis

    Philip J Brown, Marina Vannucci, and Tom Fearn. Bayesian wavelength selection in multicomponent analysis. Journal of Chemometrics: A Journal of the Chemometrics Society, 12 0 (3): 0 173--182, 1998 a

  3. [3]

    Multivariate bayesian variable selection and prediction

    Philip J Brown, Marina Vannucci, and Tom Fearn. Multivariate bayesian variable selection and prediction. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 60 0 (3): 0 627--641, 1998 b

  4. [4]

    Bayesian wavelet regression on curves with application to a spectroscopic calibration problem

    Philip J Brown, Tom Fearn, and Marina Vannucci. Bayesian wavelet regression on curves with application to a spectroscopic calibration problem. Journal of the American Statistical Association, 96 0 (454): 0 398--408, 2001

  5. [5]

    The use of principal components in the analysis of near-infrared spectra

    Ian A Cowe and James W McNicol. The use of principal components in the analysis of near-infrared spectra. Applied spectroscopy, 39 0 (2): 0 257--266, 1985

  6. [6]

    R. Dias, N. L. Garcia, and A. Martarelli. Non-parametric estimation for aggregated functional data for electric load monitoring. Environmetrics, 20 0 (2): 0 111--130, 2009

  7. [7]

    R. Dias, N. L. Garcia, and A. Schmidt. A hierarchical model for aggregated functional data. Technometrics, 55 0 (13): 0 321--334, 2013

  8. [8]

    Statistical computing in functional data analysis: The R package fda.usc

    Manuel Febrero-Bande and Manuel Oviedo de la Fuente . Statistical computing in functional data analysis: The R package fda.usc . Journal of Statistical Software, 51 0 (4): 0 1--28, 2012. URL https://www.jstatsoft.org/v51/i04/

  9. [9]

    wavethresh: Wavelets Statistics and Transforms, 2024

    Guy Nason. wavethresh: Wavelets Statistics and Transforms, 2024. URL https://CRAN.R-project.org/package=wavethresh. R package version 4.7.3

  10. [10]

    FunctionalCalibration: Aggregated Functional Data Calibration using Splines and Wavelets, 2025

    Vitor Perrone and Alex Sousa. FunctionalCalibration: Aggregated Functional Data Calibration using Splines and Wavelets, 2025. URL https://CRAN.R-project.org/package=FunctionalCalibration. R package version 1.0.0

  11. [11]

    fda: Functional Data Analysis, 2024

    James Ramsay. fda: Functional Data Analysis, 2024. URL https://CRAN.R-project.org/package=fda. R package version 6.1.8

  12. [12]

    Functional data analysis

    James O Ramsay and Bernard W Silverman. Functional data analysis. Springer, 1997

  13. [13]

    Análise não-paramétrica de dados funcionais: Uma aplicação à quimiometria

    Marley Apolinário Saraiva and Ronaldo Dias. Análise não-paramétrica de dados funcionais: Uma aplicação à quimiometria. Master's thesis, Universidade Estadual de Campinas, 2009

  14. [14]

    A multivariate calibration problem in analytical chemistry solved by partial least-squares models in latent variables

    Michael Sj \"o str \"o m, Svante Wold, Walter Lindberg, Jan- ke Persson, and Harald Martens. A multivariate calibration problem in analytical chemistry solved by partial least-squares models in latent variables. Analytica Chimica Acta, 150: 0 61--70, 1983

  15. [15]

    Bayesian wavelet shrinkage with logistic prior

    Alex Rodrigo dos Santos Sousa. Bayesian wavelet shrinkage with logistic prior. Communications in Statistics-Simulation and Computation, 51 0 (8): 0 4700--4714, 2022

  16. [16]

    A.R.S. Sousa. A wavelet-based method in aggregated functional data analysis. Monte Carlo Methods and Applications, 30 0 (1): 0 19--30, 2024

  17. [17]

    Vidakovic

    B. Vidakovic. Statistical modeling by wavelets. Wiley, New York, 1999