pith. sign in

arxiv: 2604.23469 · v1 · submitted 2026-04-25 · 📊 stat.ME · econ.EM

Estimation of MIDAS Regressions with Errors-in-the-Variables

Pith reviewed 2026-05-08 07:26 UTC · model grok-4.3

classification 📊 stat.ME econ.EM
keywords MIDAS regressionmeasurement errorerrors-in-variablesprofile likelihoodcorrected scoreconsistent estimatorMonte Carlo simulation
0
0 comments X

The pith

Measurement errors render the profile-likelihood estimator inconsistent in MIDAS regressions, but a corrected-score adjustment restores consistency.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper studies Mixed Data Sampling (MIDAS) regressions in which both the high-frequency and low-frequency variables are observed with additive measurement error. It first shows that the standard profile-likelihood estimator loses consistency under this contamination. The authors then construct a consistent estimator by embedding a corrected-score correction inside the profile-likelihood framework. Monte Carlo experiments examine how the estimator's performance changes with sample size, number of lags, and the profiling parameter.

Core claim

In the MIDAS model with measurement error in both variables, the profile-likelihood estimator is inconsistent, whereas the estimator obtained by combining the corrected-score approach with profile likelihood remains consistent; its small- and large-sample properties are verified through simulation.

What carries the argument

The corrected-score function, which adjusts the profile-likelihood score to cancel the bias induced by measurement error in the MIDAS parameters.

If this is right

  • The new estimator delivers consistent parameter estimates for MIDAS models even when measurement error contaminates the observed series.
  • Simulation evidence indicates that finite-sample accuracy improves with larger samples and suitable choices of lag length and profiling parameter.
  • The approach yields reliable inference in settings where high-frequency predictors are known to be noisy.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The corrected-score technique could be adapted to other mixed-frequency or distributed-lag specifications that currently rely on profile likelihood.
  • In macroeconomic nowcasting applications, the estimator may reduce bias arising from data revisions that act as measurement error.
  • Direct comparison of the corrected estimator against instrumental-variable alternatives on the same simulated designs would clarify relative efficiency.

Load-bearing premise

That an explicit correction term for the score function can be derived that fully offsets the inconsistency caused by additive measurement errors in both the high- and low-frequency series.

What would settle it

A Monte Carlo study in which the corrected estimator's bias fails to approach zero as the sample size tends to infinity while measurement-error variances remain fixed and positive.

Figures

Figures reproduced from arXiv: 2604.23469 by Kanchan Jain, Pooja Soni, Sukhbir Kaur, Sukhbir Singh.

Figure 1
Figure 1. Figure 1: Plots of NMedB, medB(θ) and medB(σ 2 ϵ ) with respect to Sample size (T) for j max = 12 and θ = (2, 5) For observing the effect of j max on the proposed estimator ( ˆβ ′ c , ˆθc, σˆ 2 ϵc), Figures 2(a)-2(d) depict graphical presentation of NMedB, trMedSEM, medB(θ) and medB(σ 2 ϵ ) with respect to sample size for various values of j max . Figures 2(a) and 2(b) suggest that as sample size increases, the bias… view at source ↗
Figure 2
Figure 2. Figure 2: Plots of NMedB, trMedSEM, medB(θ) and medB(σ 2 ϵ ) with respect to sample size (T) for θ = 2 and j max = (6, 12, 24) trMedSEM and medB(θ) increase. However, for the higher sample size, the bias and variability are comparatively lower. Measure of σ 2 ϵ , medB(σ 2 ϵ ) shows different pattern view at source ↗
Figure 3
Figure 3. Figure 3: Plots of NMedB, trMedSEM, medB(θ) and medB(σ 2 ϵ ) with respect to j max at fixed Sample size, T = 36 and for θ = (2, 5, 10) The effects of σ 2 u and σ 2 v on NMedB, trMedSEM, medB(θ), and medB(σ 2 ϵ ) are discussed, with more detailed results provided in Tables 1–4. Comparing the values of NMedB, trMedSEM, medB(θ) and medB(σ 2 ϵ ) from view at source ↗
Figure 4
Figure 4. Figure 4: Plots of NMedB, trMedSEM, medB(θ) and medB(σ 2 ϵ ) with respect to j max for θ = 2 and T = (24, 36, 48) j max=9 T θ=2 θ=5 NMedB trMedSEM medB(θ) medB(σ 2 ϵ ) NMedB trMedSEM medB(θ) medB(σ 2 ϵ ) 24 0.1693 0.1527 0.2770 0.0575 0.1135 0.1078 0.3079 0.0623 48 0.0539 0.0668 0.1045 0.0188 0.0596 0.0479 0.2138 0.0141 72 0.0317 0.0459 0.0443 0.0082 0.0374 0.0353 0.0204 0.0059 120 0.0148 0.0162 0.0253 0.0033 0.0201… view at source ↗
read the original abstract

In this paper, a Mixed Data Sampling (MIDAS) model is studied when both low and high frequency variables are contaminated with measurement error. It is shown that the profile likelihood estimator becomes inconsistent in the presence of measurement error. Using the corrected score approach along with profile likelihood approach, a consistent estimator for parameters of MIDAS Measurement Error model is proposed. Small and large sample properties of the estimator are examined by performing a monte carlo simulation study and considering the effect of sample size, number of lags and profiling parameter.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript studies MIDAS regressions when both the low-frequency response and high-frequency regressors are subject to measurement error. It asserts that the profile likelihood estimator is inconsistent in this setting and proposes a new estimator that combines the corrected-score approach with profile likelihood to achieve consistency. Finite- and large-sample properties are investigated through Monte Carlo simulations that vary sample size, number of lags, and the profiling parameter.

Significance. If the corrected-score estimator is shown to be consistent, the paper would supply a useful extension of errors-in-variables methods to the MIDAS framework, which is widely used in econometrics and finance. The Monte Carlo experiments provide concrete evidence on finite-sample behavior under different lag structures and profiling values, a positive feature. However, the absence of explicit derivations for both the inconsistency result and the form of the correction term limits the theoretical contribution at present.

major comments (3)
  1. [Abstract and theoretical results section] The abstract states that the profile likelihood estimator becomes inconsistent, yet no derivation, bias expression, or set of assumptions is supplied. The full paper must contain an explicit calculation (likely in the model or theoretical section) showing how measurement error in the high-frequency series, after nonlinear lag weighting, produces asymptotic bias in the profiled estimator.
  2. [Estimator construction section] The corrected-score construction for the MIDAS model with errors in both variables is presented at a high level. Because the high-frequency regressor enters through a parameterized lag polynomial, the correction term E[score | observed data] depends on the entire weighted sum; the manuscript must specify the exact functional form of this term, the required knowledge of error variances, and any distributional assumptions (e.g., normality or independence across frequencies) needed for the expectation to be computable and for consistency to hold.
  3. [Simulation study section] The Monte Carlo study examines the effects of sample size, number of lags, and the profiling parameter, but the data-generating process, the precise measurement-error variances, and the implementation of the profiling step are not described. These details are necessary to assess whether the reported small- and large-sample properties actually support the consistency claim under the conditions where the analytic correction is valid.
minor comments (2)
  1. The abstract would be strengthened by a single sentence listing the key assumptions (error distribution, variance knowledge, lag structure) under which the corrected estimator is consistent.
  2. [Notation and model section] Notation for the low- and high-frequency variables, the lag polynomial, and the measurement-error terms should be introduced once and used uniformly to avoid ambiguity when the corrected score is defined.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments, which help clarify the presentation of our theoretical results and simulation design. We address each major comment below and will revise the manuscript to incorporate the requested details.

read point-by-point responses
  1. Referee: [Abstract and theoretical results section] The abstract states that the profile likelihood estimator becomes inconsistent, yet no derivation, bias expression, or set of assumptions is supplied. The full paper must contain an explicit calculation (likely in the model or theoretical section) showing how measurement error in the high-frequency series, after nonlinear lag weighting, produces asymptotic bias in the profiled estimator.

    Authors: We agree that an explicit derivation is required to support the inconsistency claim. In the revised manuscript we will add a dedicated subsection deriving the asymptotic bias of the profile likelihood estimator. The derivation will show how the measurement error in the high-frequency series, after passing through the nonlinear lag polynomial, induces a nonzero plim for the profiled estimator, and will state the required assumptions on the error distribution and lag structure. revision: yes

  2. Referee: [Estimator construction section] The corrected-score construction for the MIDAS model with errors in both variables is presented at a high level. Because the high-frequency regressor enters through a parameterized lag polynomial, the correction term E[score | observed data] depends on the entire weighted sum; the manuscript must specify the exact functional form of this term, the required knowledge of error variances, and any distributional assumptions (e.g., normality or independence across frequencies) needed for the expectation to be computable and for consistency to hold.

    Authors: The referee is correct that the current presentation is too high-level. We will expand the estimator section to give the precise functional form of the correction term, which is the conditional expectation of the score given the observed weighted sum. Under the maintained assumptions of independent normal measurement errors with known variances, the correction takes an explicit closed-form adjustment involving the variance of the latent regressor sum. The revised text will list these assumptions and the requirement that error variances be known or separately estimated. revision: yes

  3. Referee: [Simulation study section] The Monte Carlo study examines the effects of sample size, number of lags, and the profiling parameter, but the data-generating process, the precise measurement-error variances, and the implementation of the profiling step are not described. These details are necessary to assess whether the reported small- and large-sample properties actually support the consistency claim under the conditions where the analytic correction is valid.

    Authors: We accept that additional implementation details are needed. The revised simulation section will fully specify the data-generating process (true MIDAS parameters, lag polynomial, and how measurement errors are added to both series), the exact measurement-error variances employed, and the numerical procedure used to implement the profiling step (grid search or optimization routine). These additions will allow readers to verify that the reported finite-sample behavior is consistent with the analytic conditions under which the corrected estimator is valid. revision: yes

Circularity Check

0 steps flagged

No significant circularity; estimator derived independently and validated via simulation

full rationale

The paper derives the inconsistency of the profile likelihood estimator under measurement error in both variables and constructs a new consistent estimator by combining the corrected score approach with profile likelihood. This construction is presented as a direct analytic correction for the MIDAS lag polynomial structure, then evaluated for finite-sample and asymptotic behavior through independent Monte Carlo experiments that vary sample size, number of lags, and the profiling parameter. No step reduces by definition to its own inputs, renames a fitted quantity as a prediction, or relies on a load-bearing self-citation whose validity is presupposed. The derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only review prevents identification of specific free parameters, axioms, or invented entities; the profiling parameter is referenced as a simulation factor but its status is unclear.

axioms (1)
  • domain assumption Profile likelihood estimator is inconsistent when both low- and high-frequency variables contain measurement error.
    Directly stated as shown in the paper.

pith-pipeline@v0.9.0 · 5384 in / 1291 out tokens · 59527 ms · 2026-05-08T07:26:30.346447+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

20 extracted references · 20 canonical work pages

  1. [1]

    Andreou, E., Ghysels, E., and Kourtellos, A. (2007). Regression models with mixed sampling frequencies.Journal of Econometrics, 158:246–261

  2. [2]

    J., Eltinge, J

    Carroll, R. J., Eltinge, J. L., and Ruppert, D. (1993). Robust linear regression in replicated measurement error models.Statistics and Probability Letters, 16(3):169–175

  3. [3]

    and Kukush, A

    Cheng, C.-L. and Kukush, A. (2006). Non-existence of the first moment of the adjusted least squares estimator.Metrika, 64:41–46

  4. [4]

    Cheng, C. L. and Van Ness, J. W. (1999).Statistical Regression with Measurement Error. Arnold

  5. [5]

    Dolby, G. R. (1976). The ultrastructural relation: A synthesis of the functional and structural relations.Biometrika, 63(1):39–50

  6. [6]

    Fuller, W. A. (1987).Measurement Error Models. John Wiley

  7. [7]

    and Qian, H

    Ghysels, E. and Qian, H. (2019). Estimating midas regressions via ols with polynomial parameter profiling.Econometrics and Statistics, 9:1–16

  8. [8]

    Ghysels, E., Santa-Clara, P., and Valkanov, R. (2004). The midas touch: Mixed data sampling regression models.CIRANO Working Papers

  9. [9]

    Ghysels, E., Sinko, A., and Valkanov, R. (2007). Midas regressions: Further results and new directions.Econometric Reviews, 26:53–90

  10. [10]

    Gleser, L. J. (1992). The importance of assessing measurement reliability.Journal of the American Statistical Association, 87:696–707

  11. [11]

    Gleser, L. J. (1993). Estimators of slopes in linear errors-in-variables regression models. Statistics and Probability Letters, 17(2):113–121

  12. [12]

    Jain, K., Singh, S., and Sharma, S. (2011). Restricted estimation in multivariate measurement error regression model.Journal of Multivariate Analysis, 102:264–280

  13. [13]

    Nakamura, T. (1990). Corrected score function for errors-in-variables models.Biometrika, 77(1):127–137. 18 Reiersøl, O. (1941). Confluence analysis by means of lag moments.Econometrica, 9(1):1–24. Reiersøl, O. (1945). Confluence analysis by means of instrumental sets of variables.Arkiv for Matematik, 32:1–119

  14. [14]

    and Srivastava, V

    Schneeweiss, H. and Srivastava, V . K. (1994). Bias and mean squared error of the slope estimator. Statistical Papers, 35:329–335. Shalabh (2001). Consistent estimation through weighted harmonic mean.Econometric Reviews, 20(4):507–510. Shalabh (2003). Consistent estimation of coefficients in measurement error models.Journal of Multivariate Analysis, 86(2):227–241

  15. [15]

    Shalabh, Garg, G., and Misra, N. (2007). Restricted regression estimation in measurement error models.Computational Statistics and Data Analysis, 52(2):1149–1166

  16. [16]

    Shalabh, Garg, G., and Misra, N. (2009). Use of prior information in estimation of regression coefficients.Journal of Multivariate Analysis, 100(7):1498–1520

  17. [17]

    Singh, S., Jain, K., and Sharma, S. (2012). Using stochastic prior information in consistent estimation.Journal of Multivariate Analysis, 111:198–212

  18. [18]

    Singh, S., Jain, K., and Sharma, S. (2014). Replicated measurement error model under exact linear restrictions.Statistical Papers, pages 253–274

  19. [19]

    and Buonaccorsi, J

    Staudenmayer, J. and Buonaccorsi, J. (2005). Measurement error in linear autoregressive models. Journal of the American Statistical Association, 100:841–852

  20. [20]

    Ullah, A., Shalabh, and Mukherjee, D. (2001). Consistent estimation of regression coefficients. Annals of Economics and Finance, 2:249–264. 19 Appendix Probability Limits The following notations and orders have been used: •Ψis a known matrix of order(T−p)×(j max +p+ 1), •Eis a random vector of order(T−p)×1, •Vis a random matrix of order(T−p)×(j max +p+ 1)...