pith. sign in

arxiv: 2605.20636 · v1 · pith:MKRDPW7Hnew · submitted 2026-05-20 · 💱 q-fin.PM

Continuous Timing Signals for Growth-Defensive Style Allocation: Factor Attribution, Risk Matching, and Out-of-Sample Evidence

Pith reviewed 2026-05-21 02:31 UTC · model grok-4.3

classification 💱 q-fin.PM
keywords growth defensive allocationstyle timingcontinuous scoringmacro market signalsETF portfolio allocationfactor attributionrisk adjusted performancesmooth score
0
0 comments X

The pith

A continuous smooth score from rate relief, drawdown depth, VIX stress and growth crowding can allocate between growth and defensive ETF baskets to deliver a Sharpe ratio of 1.01 and maximum drawdown of 31.63 percent in 2017-2026 tests.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper tests whether known style exposures can be timed dynamically rather than treated as fixed or discovered anew. It frames the problem as growth-versus-defensive allocation between two ETF baskets and replaces regime switches with a single continuous score. The score blends four macro conditions through softplus interactions, maps the result to weights with a hyperbolic tangent, and smooths the positions with EWMA. Over the main window from June 2017 to May 2026 with 10 basis point costs, the policy with a 50 percent active tilt produces a 19.24 percent CAGR, a Sharpe ratio of 1.01, and a maximum drawdown of negative 31.63 percent. These figures beat a 50/50 mix, single-signal matches, the market index, and volatility-adjusted growth on risk-adjusted grounds, though they trail a pure growth static portfolio on raw return.

Core claim

The relative growth-minus-defensive portfolio exhibits recognizable style betas (market 0.273, HML -0.552, momentum 0.117) and negligible alpha, and a continuous timing rule built from rate relief, SPY drawdown depth, high-VIX relief, and a growth-crowding penalty improves risk-adjusted performance and drawdown control over static and partially matched benchmarks while leaving high static growth exposure as a tough total-return benchmark.

What carries the argument

The continuous smooth score that combines rate relief, SPY drawdown depth, high-VIX stress relief, and a growth-crowding penalty through softplus interaction terms, maps total score to G/D weights via hyperbolic tangent, and smooths realized weights with EWMA.

If this is right

  • The policy improves Sharpe ratio and drawdown control relative to 50/50, TNX-only, core-only, SPY, and volatility-matched 100 percent G benchmarks.
  • Walk-forward and post-2022 checks support drawdown reduction and risk-adjusted value beyond the main sample.
  • Continuous, interpretable timing adds measurable allocation value even though it does not surpass the best high-growth static portfolios in raw CAGR.
  • The underlying G-D return series behaves as a recognizable style portfolio rather than a new anomaly.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same continuous-score construction could be applied to other style pairs such as value-momentum or quality-low-volatility.
  • Replacing the fixed 50 percent tilt limit with a volatility-scaled version might further improve the risk-return profile in high-volatility regimes.
  • Testing the score on individual stocks or futures instead of ETF baskets would show whether the timing logic scales beyond basket-level implementation.

Load-bearing premise

The specific functional form that combines the four macro conditions through softplus interactions and tanh mapping will keep delivering effective timing outside the 2017-2026 window without material overfitting to the chosen signals or period.

What would settle it

A new multi-year window after May 2026 in which the smooth-score policy shows both lower Sharpe ratio and larger maximum drawdown than the 50/50 G/D benchmark would falsify the claim of reliable timing value.

Figures

Figures reproduced from arXiv: 2605.20636 by Zheli Xiong.

Figure 1
Figure 1. Figure 1: Active-Tilt Sensitivity Equity Curves, 2017-06-28 to 2026-05-15 [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Selected Smooth-Score Policy Equity Curves, 2017-06-28 to 2026-05-15 [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Aligned Main Equity Curves, 2017-06-28 to 2026-05-15 [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Buy-and-Hold G/D Baseline Equity Curves, 2017-06-28 to 2026-05-15 8 Volatility-Matched and Static G/D Comparisons The selected smooth policy has much lower average G exposure than 100% G. A direct comparison of CAGR therefore mixes signal quality with risk level. The vol-matched comparison scales 100% G to the smooth strategy’s annualized volatility. The required scaling weight is 81.95% [PITH_FULL_IMAGE:… view at source ↗
Figure 5
Figure 5. Figure 5: Vol-Matched and Static G/D Equity Curves, 2017-06-28 to 2026-05-15 9 Out-of-Sample Validation The validation section combines walk-forward expanding, walk-forward rolling, and fixed￾parameter validation into one aligned OOS window. The parameter pool is the candidate local grid described above. The initial training window is 252 trading days, and the test block length is 63 trading days. Walk-forward expan… view at source ↗
Figure 6
Figure 6. Figure 6: OOS Validation Equity Curves, 2018-06-28 to 2026-05-15 [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Post-2022 OOS Validation Equity Curves, 2022-01-03 to 2026-05-15 [PITH_FULL_IMAGE:figures/full_fig_p013_7.png] view at source ↗
read the original abstract

This paper studies conditional allocation between a growth/technology ETF basket, denoted by $G$, and a defensive income/value-oriented ETF basket, denoted by $D$. The objective is not to discover a new standalone alpha factor, but to examine whether known style exposures can be dynamically allocated using macro-market timing signals. Fama-French five-factor plus momentum attribution shows that the relative portfolio $G-D$ is a recognizable style portfolio: its market beta is 0.273, its HML beta is -0.552, its momentum beta is 0.117, and its annualized alpha is 1.95\% with a Newey-West t-statistic of only 0.81. The empirical object is therefore interpreted as a growth-versus-defensive style allocation problem rather than a new return anomaly. The allocation framework replaces discrete regime labels and if-then trading rules with a continuous smooth score. The score combines rate relief, SPY drawdown depth, high-VIX stress relief, and a growth-crowding penalty. Interaction terms are smoothed with softplus functions, the total score is mapped to G/D weights through a hyperbolic tangent function, and realized weights are smoothed with EWMA. In the main aligned comparison window from June 28, 2017 to May 15, 2026, with 10bp transaction costs, the selected smooth-score policy uses a 50\% maximum active tilt and obtains a 19.24\% CAGR, a Sharpe ratio of 1.01, and a maximum drawdown of -31.63\%. It improves over 50/50 G/D, matched TNX-only, matched core-only, SPY, and volatility-matched 100\% G benchmarks. It does not, however, exceed 100\% G or the best high-G static portfolios in raw CAGR. Walk-forward and post-2022 validations provide additional evidence of drawdown reduction and risk-adjusted allocation value. Overall, the evidence supports continuous, interpretable style timing, while also showing that high static growth exposure remains a strong benchmark.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 3 minor

Summary. The paper examines dynamic allocation between a growth/technology ETF basket (G) and a defensive income/value ETF basket (D) using a continuous smooth timing score rather than discrete regimes. The score integrates rate relief, SPY drawdown depth, high-VIX stress relief, and a growth-crowding penalty via softplus interactions, maps the result through a hyperbolic tangent, applies EWMA smoothing, and caps the active tilt at 50%. Fama-French five-factor plus momentum attribution shows the G-D spread has market beta 0.273, HML beta -0.552, momentum beta 0.117, and insignificant alpha (1.95%, NW t-stat 0.81). In the June 28, 2017–May 15, 2026 window with 10 bp costs, the policy delivers 19.24% CAGR, Sharpe ratio 1.01, and max drawdown -31.63%, outperforming 50/50 G/D, TNX-matched, core-matched, SPY, and volatility-matched 100% G benchmarks but not 100% G or top static high-G portfolios. Walk-forward and post-2022 checks are cited as supporting evidence.

Significance. If the continuous timing rule proves robust, the work supplies a practical, interpretable alternative to regime-switching for style allocation that demonstrably reduces drawdowns while preserving growth exposure. The explicit factor attribution and risk-matched benchmarks help situate the results within existing style literature rather than claiming a new anomaly. The emphasis on out-of-sample validation and transaction-cost realism is a positive feature for applied portfolio-management research.

major comments (3)
  1. [Section 3] Timing-score construction (Section 3, around the softplus-tanh-EWMA definition): the specific functional form that combines the four signals through softplus interactions, applies tanh mapping, and uses EWMA smoothing with the 50% tilt cap contains multiple tunable elements whose selection process is not fully documented. Even with walk-forward validation, it is unclear whether these choices were fixed ex ante or refined on the 2017-2026 sample; this directly affects the credibility of the headline 19.24% CAGR and Sharpe 1.01 as generalizable rather than sample-specific.
  2. [Section 5] Out-of-sample and post-2022 validation (Section 5 or 6): the abstract states that walk-forward and post-2022 checks support drawdown reduction, yet it is not explicit whether the identical fixed functional form and parameter values were applied without any re-calibration on the later sub-period. If any component weights or smoothing constants were adjusted, the reported risk-adjusted improvement over volatility-matched benchmarks would require re-evaluation.
  3. [Table 2] Benchmark comparisons (Table 2 or equivalent results table): while the strategy improves on 50/50 G/D and matched benchmarks, it does not exceed 100% G or the best static high-G portfolios in raw CAGR. This limitation should be stated more prominently when interpreting the economic value of the timing overlay, as the central claim of “allocation value” rests partly on risk-adjusted metrics that may be sensitive to the chosen volatility-matching procedure.
minor comments (3)
  1. [Section 4] Clarify the exact definition of the “aligned comparison window” and how the 10 bp transaction-cost assumption is implemented (round-trip or one-way) in the performance calculations.
  2. [Section 2] The Newey-West t-statistic of 0.81 for the alpha in the factor attribution should be accompanied by the lag length chosen and a brief robustness check to alternative lag specifications.
  3. [Section 1] Notation for the G and D baskets and the precise ETF constituents could be listed in a table or appendix for reproducibility.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We have prepared point-by-point responses to the major comments and indicate where revisions will be made to improve clarity and address the concerns raised.

read point-by-point responses
  1. Referee: [Section 3] Timing-score construction (Section 3, around the softplus-tanh-EWMA definition): the specific functional form that combines the four signals through softplus interactions, applies tanh mapping, and uses EWMA smoothing with the 50% tilt cap contains multiple tunable elements whose selection process is not fully documented. Even with walk-forward validation, it is unclear whether these choices were fixed ex ante or refined on the 2017-2026 sample; this directly affects the credibility of the headline 19.24% CAGR and Sharpe 1.01 as generalizable rather than sample-specific.

    Authors: The functional form was developed based on economic considerations and insights from existing macro timing literature, with parameters selected to reflect reasonable sensitivities prior to examining the full sample results. To enhance transparency, we will revise Section 3 to provide a more detailed account of the rationale for each component, including the choice of softplus for interactions, the tanh mapping, EWMA smoothing parameter, and the 50% tilt cap. We will also explicitly state that these choices were fixed before conducting the main empirical analysis and that the walk-forward procedure applied the pre-specified form without further optimization on the 2017-2026 data. revision: yes

  2. Referee: [Section 5] Out-of-sample and post-2022 validation (Section 5 or 6): the abstract states that walk-forward and post-2022 checks support drawdown reduction, yet it is not explicit whether the identical fixed functional form and parameter values were applied without any re-calibration on the later sub-period. If any component weights or smoothing constants were adjusted, the reported risk-adjusted improvement over volatility-matched benchmarks would require re-evaluation.

    Authors: We confirm that the exact same functional form, including all weights, interaction terms, mapping functions, smoothing constants, and the 50% tilt cap, was applied without any re-calibration or adjustment in the walk-forward tests and the post-2022 sub-period. No parameters were tuned based on the later data. We will add a clear statement in the revised Section 5 to this effect to remove any ambiguity. revision: yes

  3. Referee: [Table 2] Benchmark comparisons (Table 2 or equivalent results table): while the strategy improves on 50/50 G/D and matched benchmarks, it does not exceed 100% G or the best static high-G portfolios in raw CAGR. This limitation should be stated more prominently when interpreting the economic value of the timing overlay, as the central claim of “allocation value” rests partly on risk-adjusted metrics that may be sensitive to the chosen volatility-matching procedure.

    Authors: We agree that this limitation merits more prominent discussion. Although the manuscript already notes in the abstract and introduction that the strategy does not exceed 100% G in raw CAGR, we will expand the interpretation section to more explicitly highlight this point. We will clarify that the primary benefits of the continuous timing approach are in risk-adjusted performance and drawdown mitigation, and discuss the sensitivity of these metrics to the volatility-matching procedure used for benchmarks. This will provide a more balanced view of the economic value. revision: yes

Circularity Check

0 steps flagged

No significant circularity: score constructed from observables, performance measured externally

full rationale

The paper defines a continuous allocation score by combining observable market signals (rate relief, SPY drawdown depth, high-VIX relief, growth-crowding penalty) via explicit functional forms: softplus for interactions, hyperbolic tangent for weight mapping, and EWMA for smoothing. These forms are stated as design choices, not derived from the target G-D returns. Reported metrics (CAGR, Sharpe, drawdown) are computed by applying the pre-specified score to a historical window and compared to external benchmarks (50/50 G/D, SPY, volatility-matched portfolios). Walk-forward and post-2022 checks are mentioned as validation steps. No equation reduces the performance numbers to a fitted parameter or self-citation by construction; the central empirical claim remains an out-of-sample-style measurement against independent benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The framework rests on standard factor-model interpretation and observable market signals. No new particles or forces are postulated. A small number of implementation choices function as free parameters.

free parameters (2)
  • maximum active tilt = 50%
    Caps deviation from neutral 50/50 weights at 50% to control risk exposure
  • EWMA decay parameter
    Controls smoothing of realized portfolio weights over time
axioms (1)
  • domain assumption The relative G-D portfolio can be interpreted as a recognizable style portfolio on the basis of its Fama-French five-factor plus momentum betas and small alpha
    Used to frame the problem as style timing rather than discovery of a new anomaly

pith-pipeline@v0.9.0 · 5916 in / 1713 out tokens · 52609 ms · 2026-05-21T02:31:29.478491+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

12 extracted references · 12 canonical work pages

  1. [1]

    Journal of Financial Economics , volume=

    Common risk factors in the returns on stocks and bonds , author=. Journal of Financial Economics , volume=

  2. [2]

    Journal of Financial Economics , volume=

    A five-factor asset pricing model , author=. Journal of Financial Economics , volume=

  3. [3]

    Journal of Finance , volume=

    On persistence in mutual fund performance , author=. Journal of Finance , volume=

  4. [4]

    Review of Financial Studies , volume=

    A comprehensive look at the empirical performance of equity premium prediction , author=. Review of Financial Studies , volume=

  5. [5]

    Review of Financial Studies , volume=

    Predicting excess stock returns out of sample: Can anything beat the historical average? , author=. Review of Financial Studies , volume=

  6. [6]

    Econometrica , volume=

    A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix , author=. Econometrica , volume=

  7. [7]

    Econometrica , volume=

    A new approach to the economic analysis of nonstationary time series and the business cycle , author=. Econometrica , volume=

  8. [8]

    Journal of Economic Dynamics and Control , volume=

    Asset allocation under multivariate regime switching , author=. Journal of Economic Dynamics and Control , volume=

  9. [9]

    Review of Financial Studies , volume=

    Parametric portfolio policies: Exploiting characteristics in the cross-section of equity returns , author=. Review of Financial Studies , volume=

  10. [10]

    Econometrica , volume=

    A reality check for data snooping , author=. Econometrica , volume=

  11. [11]

    Journal of Business & Economic Statistics , volume=

    A test for superior predictive ability , author=. Journal of Business & Economic Statistics , volume=

  12. [12]

    Journal of Computational Finance , volume=

    The probability of backtest overfitting , author=. Journal of Computational Finance , volume=