pith. sign in

arxiv: 2605.16324 · v1 · pith:SQWIB7D6new · submitted 2026-05-05 · 💻 cs.LG · cs.CE· q-fin.ST

Bi-Level Chaotic Fusion Based Graph Convolutional Network for Stock Market Prediction Interval

Pith reviewed 2026-05-21 00:30 UTC · model grok-4.3

classification 💻 cs.LG cs.CEq-fin.ST
keywords prediction intervalsgraph convolutional networkstock market forecastingchaotic fusionvolatility-aware gatinguncertainty quantificationNSE data
0
0 comments X

The pith

A bi-level chaotic fusion graph model produces tighter and better-calibrated prediction intervals for stock prices than standard LSTM, GRU, and GCN baselines.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to replace single-value stock forecasts with prediction intervals that bound possible future prices and express their uncertainty. It builds a graph convolutional network in which bi-level chaotic fusion applies separate nonlinear maps to compute interval center and width, while a volatility-aware gate adjusts the flow of information according to current market conditions. Relationships among stocks are encoded as graph edges and processed sequentially so that temporal patterns reflect both asset interdependencies and regime shifts. The network is trained by minimizing a lower-upper bound estimation loss. When evaluated on daily data for 43 major companies across eight sectors of the National Stock Exchange from 2016 to 2026, the resulting intervals record the lowest Winkler score, the smallest average width, and the highest coverage probability, all differences reaching statistical significance against the listed baselines.

Core claim

The central claim is that embedding stock relationships in a graph, fusing chaotic nonlinear transformations at two levels to set interval bounds, and gating the temporal updates by volatility produces prediction intervals that are simultaneously sharp and well-calibrated for financial time series.

What carries the argument

Bi-level chaotic fusion inside graph convolutional layers, which uses two distinct nonlinear transformation functions to estimate the center and width of each prediction interval together with a volatility-aware gating mechanism that conditions the recurrence on the prevailing market regime.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same fusion-plus-gating pattern could be tested on other networked financial series such as commodity prices or currency pairs where assets also influence one another.
  • Adding macroeconomic nodes to the graph might let the intervals reflect external shocks without changing the core training objective.
  • Real-time deployment on live order-book data would reveal whether the interval widths remain useful for setting dynamic risk limits during intraday trading.

Load-bearing premise

That the graph of 43 selected stocks and the volatility patterns observed between 2016 and 2026 are representative enough for the fusion and gating steps to generalize without overfitting to that particular market slice.

What would settle it

Retraining the model on an earlier decade (for example 2005-2015) or on a different exchange and finding that the Winkler score and coverage advantages disappear or lose statistical significance would falsify the claim that the architecture reliably captures regime-dependent dependencies.

Figures

Figures reproduced from arXiv: 2605.16324 by Eshwar Sai Kandimalla, Hem Sundhar Korukunda, Sravan Chowdary Kankanala, Sumana Bhimineni, Vivek Yelleti.

Figure 1
Figure 1. Figure 1: Architecture of LSTM Gated Recurrent Unit Gated Recurrent Units (GRUs) are advanced recurrent neural networks that are used to capture the temporal dynamics in a more efficient manner using a gating mechanism [17]. In contrast to long short-term memory (LSTM) neural networks, the GRU model is a simpler structure because it combines both input and forget gates to one update gate. In every time instant t, th… view at source ↗
Figure 2
Figure 2. Figure 2: Architecture of GRU Graph Convolution network Graph Convolutional Networks (GCNs) generalize convolution to graph-based data structures that facilitate learning on non-Euclidean spaces [14]. The graph structure G=(V,E) consists of nodes representing entities and edges denoting relations among these entities. Within finance, stock assets can be considered as nodes while the edges can be considered as correl… view at source ↗
Figure 3
Figure 3. Figure 3: Architecture of BCF-GCN [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗
Figure 2
Figure 2. Figure 2: Graph representaƟon learning using the correlaƟon graph. The stocks are nodes, while edges are formed based on a condiƟon whereby the returns of the stock correlaƟon is above a threshold value. Note : BANKi indicates the ith bank, while BANK, IT, PHARMA, and ENERGY are stocks of respecƟve sectors. 4.4 Bi-Level Chaotic Transformation Layer In this layer, we utilize both the logistic and tent chaotic maps to… view at source ↗
Figure 3
Figure 3. Figure 3: Logistic and tent maps used in the bi-level chaotic fusion design. [PITH_FULL_IMAGE:figures/full_fig_p016_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Gating and interval generation logic [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗
Figure 4
Figure 4. Figure 4: Data Preprocessing Pipeline [PITH_FULL_IMAGE:figures/full_fig_p020_4.png] view at source ↗
Figure 8
Figure 8. Figure 8: Coverage-width trade-off showing BCF-GCN as the sharpest covered model [PITH_FULL_IMAGE:figures/full_fig_p025_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Ablation study: impact of component removal on PIAW and Winkler score [PITH_FULL_IMAGE:figures/full_fig_p028_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Ablation study heatmap showing relative performance across metrics [PITH_FULL_IMAGE:figures/full_fig_p029_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Mean Winkler score across multi-seed testing. [PITH_FULL_IMAGE:figures/full_fig_p030_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Backtesting SMAPE results [PITH_FULL_IMAGE:figures/full_fig_p032_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Diebold-Mariano significance results [PITH_FULL_IMAGE:figures/full_fig_p032_13.png] view at source ↗
Figure 12
Figure 12. Figure 12: Example Prediction Interval for high-priced stocks [PITH_FULL_IMAGE:figures/full_fig_p034_12.png] view at source ↗
read the original abstract

Financial market forecasting is inherently uncertain, yet most deep learning approaches rely on point predictions that provide only single-value estimates without quantifying uncertainty. Such predictions are insufficient for risk-aware decision-making, as they fail to capture the range of possible outcomes and the associated confidence of forecasts.The problem can be solved using prediction intervals, which allow obtaining an upper and lower bound for the prediction, thus enabling uncertainty representation in the model. Yet, the current methods tend to disregard relationships between assets or cannot simultaneously ensure good calibration and sharpness of the resulting intervals in dynamically changing market regimes. In our work, we propose a spatio-temporal graph-based approach with a bi-level chaotic fusion technique to solve this problem. Our model uses separate nonlinear transformation functions to estimate the interval center and width. Additionally, a volatility-aware gating mechanism is used to make predictions dependent on the regime in which the market operates. Temporal dependencies are considered by embedding graph structures and sequentially modeling them. Training is conducted according to a Lower-Upper Bound Estimation (LUBE) objective. Our experimental results show significant improvements compared to existing baselines (LSTM, GRU, GCN, HGNN) when applied to data from 2016 to 2026 with 43 leading companies in eight sectors of the NSE. It provides the lowest Winkler score (0.0778), tightest prediction intervals (PIAW = 0.1407), and highest coverage (PICP = 96.6%), with all differences statistically significant (p < 0.001) according to the Diebold-Mariano test.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript presents a Bi-Level Chaotic Fusion Based Graph Convolutional Network for generating prediction intervals in stock market forecasting. The model incorporates graph structures to capture inter-asset relationships, uses nonlinear transformations for interval center and width, a volatility-aware gating mechanism for regime dependence, and is trained using the Lower-Upper Bound Estimation (LUBE) objective. Experiments on data from 43 leading companies in the NSE from 2016 to 2026 demonstrate improved performance over baselines like LSTM, GRU, GCN, and HGNN, with the lowest Winkler score of 0.0778, PIAW of 0.1407, PICP of 96.6%, and statistically significant differences (p < 0.001) per the Diebold-Mariano test.

Significance. If the empirical results are robust to proper temporal validation and graph construction without lookahead bias, this work could contribute to better uncertainty quantification in financial time series by integrating spatio-temporal modeling with chaotic fusion and regime-aware gating. The approach addresses a relevant problem in risk-aware decision making.

major comments (3)
  1. [Experimental Setup] The paper does not specify how the graph adjacency matrix is constructed. If it is based on correlations computed over the full 2016-2026 period, this would introduce lookahead bias, as future data informs the graph used for earlier time steps, potentially inflating the reported PICP and sharpness metrics.
  2. [Results] No error bars, standard deviations, or confidence intervals are provided for the reported metrics (Winkler score, PIAW, PICP). This makes it difficult to evaluate the practical significance of the improvements and the reliability of the Diebold-Mariano test results.
  3. [Methods] There is no discussion of data leakage prevention, such as using rolling window validation or ensuring the graph is constructed only on training data. This is critical for time series forecasting claims.
minor comments (2)
  1. [Abstract] The abstract mentions 'bi-level chaotic fusion technique' but does not provide a brief explanation of what this entails.
  2. [Notation] The definitions of interval center and width transformations could be clarified with equations for better reproducibility.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive and detailed feedback. The comments highlight important aspects of temporal integrity and reporting that we will address through revisions to strengthen the manuscript. We respond to each major comment below.

read point-by-point responses
  1. Referee: [Experimental Setup] The paper does not specify how the graph adjacency matrix is constructed. If it is based on correlations computed over the full 2016-2026 period, this would introduce lookahead bias, as future data informs the graph used for earlier time steps, potentially inflating the reported PICP and sharpness metrics.

    Authors: We agree that the manuscript should have explicitly described the adjacency matrix construction to preclude any concern of lookahead bias. In our experiments the graph was formed using Pearson correlations computed only on the training window data for each prediction step, with edges retained above a fixed threshold. We will revise the Methods section to provide the precise construction formula, the threshold value, and confirmation that the process is repeated independently for every training fold using exclusively in-sample observations. revision: yes

  2. Referee: [Results] No error bars, standard deviations, or confidence intervals are provided for the reported metrics (Winkler score, PIAW, PICP). This makes it difficult to evaluate the practical significance of the improvements and the reliability of the Diebold-Mariano test results.

    Authors: This observation is correct and we will remedy it. The original experiments were repeated across five independent random seeds; we will now report mean values accompanied by standard deviations for all metrics and will add these statistics to the result tables. We will also include a brief discussion of the observed variability to support interpretation of the Diebold-Mariano test outcomes. revision: yes

  3. Referee: [Methods] There is no discussion of data leakage prevention, such as using rolling window validation or ensuring the graph is constructed only on training data. This is critical for time series forecasting claims.

    Authors: We acknowledge that an explicit description of leakage-prevention measures was omitted. Our protocol uses a rolling-window scheme in which both the training set and the adjacency matrix are updated at each step using only data available up to the end of the current training window. We will add a dedicated subsection in Experimental Setup that details the window length, stride, and the per-window graph reconstruction procedure, thereby documenting that temporal ordering is strictly preserved. revision: yes

Circularity Check

0 steps flagged

No circularity: derivation chain self-contained against external benchmarks

full rationale

The abstract and described methods introduce a GCN-based model with bi-level chaotic fusion, volatility-aware gating, and LUBE training objective to produce prediction intervals. Standard evaluation metrics (Winkler score, PIAW, PICP) are applied post-training on held-out data from 2016-2026 NSE stocks, with comparisons to baselines. No equations or steps are visible that define a quantity in terms of itself, rename a fitted parameter as a prediction, or reduce the central claim to a self-citation chain. The graph construction and temporal modeling are presented as independent architectural choices whose performance is externally falsifiable via the reported statistical tests. This is the common honest finding for papers whose core contribution is an empirical architecture rather than a closed-form derivation.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 1 invented entities

Review performed on abstract only; concrete free parameters, axioms, and invented entities cannot be extracted. The abstract implies several modeling choices whose details are unavailable.

free parameters (2)
  • nonlinear transformation functions for interval center and width
    Separate functions are used to estimate center and width; their parameters are necessarily fitted to data.
  • volatility-aware gating parameters
    Gating mechanism depends on market regime and must be parameterized and trained.
axioms (1)
  • domain assumption Graph structures derived from asset relationships capture relevant spatio-temporal dependencies in stock returns
    Invoked when the model embeds graph structures to model temporal dependencies.
invented entities (1)
  • bi-level chaotic fusion technique no independent evidence
    purpose: To fuse information at two levels for improved interval estimation
    Introduced as a core component of the proposed architecture

pith-pipeline@v0.9.0 · 5842 in / 1519 out tokens · 52044 ms · 2026-05-21T00:30:53.817701+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages

  1. [1]

    D., & Waegeman, W

    Dewolf, N., Baets, B. D., & Waegeman, W. (2023). Valid prediction intervals for regression problems. Artificial Intelligence Review, 56(1), 577-613

  2. [2]

    J., & Meeker, W

    Tian, Q., Nordman, D. J., & Meeker, W. Q. (2022). Methods to compute prediction intervals: A review and new results. Statistical Science, 37(4), 580-597

  3. [3]

    Yadav, N., Chakraborty, N., & Tewari, A. (2022). Interval prediction machine learning models for predicting experimental thermal conductivity of high entropy alloys. Computational Materials Science, 214, 111754

  4. [4]

    S., Chen, Y ., Wei, Z., & Pham, V

    Borah, M., Gayan, A., Sharma, J. S., Chen, Y ., Wei, Z., & Pham, V . T. (2022). Is fractional-order chaos theory the new tool to model chaotic pandemics as Covid- 19?. Nonlinear dynamics, 109(2), 1187-1215

  5. [5]

    Liang, F., Qian, C., Yu, W., Griffith, D., & Golmie, N. (2022). Survey of graph neural networks and applications. Wireless Communications and Mobile Computing, 2022(1), 9261537

  6. [6]

    Borenstein, M. (2023). How to understand and report heterogeneity in a meta- analysis: the difference between I-squared and prediction intervals. Integrative Medicine Research, 12(4), 101014

  7. [7]

    H., & Larocque, D

    Roy, M. H., & Larocque, D. (2020). Prediction intervals with random forests. Statistical Methods in Medical Research, 29(1), 205-229

  8. [8]

    D., & Leeb, H

    Kivaranovic, D., Johnson, K. D., & Leeb, H. (2020, June). Adaptive, distribution-free prediction intervals for deep networks. In International Conference on Artificial Intelligence and Statistics (pp. 4346-4356). PMLR

  9. [9]

    Zhang, H., Zimmerman, J., Nettleton, D., & Nordman, D. J. (2020). Random forest prediction intervals. The American Statistician

  10. [10]

    F., & Candès, E

    Xie, R., Barber, R. F., & Candès, E. J. (2024). Boosted conformal prediction intervals. Advances in Neural Information Processing Systems, 37, 71868-71899

  11. [11]

    Grushka-Cockayne, Y ., & Jose, V . R. R. (2020). Combining prediction intervals in the M4 competition. International Journal of Forecasting, 36(1), 178-185

  12. [12]

    (2021, May)

    Elder, B., Arnold, M., Murthi, A., & Navrátil, J. (2021, May). Learning prediction intervals for model performance. In Proceedings of the AAAI Conference on Artificial Intelligence (V ol. 35, No. 8, pp. 7305-7313)

  13. [13]

    M., & Roth, A

    Gupta, V ., Jung, C., Noarov, G., Pai, M. M., & Roth, A. (2021). Online multivalid learning: Means, moments, and prediction intervals. arXiv preprint arXiv:2101.01739

  14. [14]

    Zhao, C., Wan, C., & Song, Y . (2021). Cost-oriented prediction intervals: On bridging the gap between forecasting and decision. IEEE Transactions on Power Systems, 37(4), 3048-3062

  15. [15]

    Zhou, M., Wang, B., Guo, S., & Watada, J. (2021). Multi-objective prediction intervals for wind power forecast based on deep neural networks. Information sciences, 550, 207-220

  16. [16]

    (2021, March)

    Chen, H., Huang, Z., Lam, H., Qian, H., & Zhang, H. (2021, March). Learning prediction intervals for regression: Generalization and calibration. In International Conference on Artificial Intelligence and Statistics (pp. 820-828). PMLR

  17. [17]

    S., Langseth, H., & Ramampiaro, H

    Salem, T. S., Langseth, H., & Ramampiaro, H. (2020, August). Prediction intervals: Split normal mixture from quality-driven deep ensembles. In Conference on Uncertainty in Artificial Intelligence (pp. 1179-1187). PMLR

  18. [18]

    Lai, Y ., Shi, Y ., Han, Y ., Shao, Y ., Qi, M., & Li, B. (2022). Exploring uncertainty in regression neural networks for construction of prediction intervals. Neurocomputing, 481, 249-257

  19. [19]

    D., Akbari, M., Khosravi, A., Nahavandi, S., & Carmichael, D

    Nasirzadeh, F., Kabir, H. D., Akbari, M., Khosravi, A., Nahavandi, S., & Carmichael, D. G. (2020). ANN-based prediction intervals to forecast labour productivity. Engineering, Construction and Architectural Management, 27(9), 2335- 2351

  20. [20]

    Simhayev, E., Katz, G., & Rokach, L. (2020). Piven: A deep neural network for prediction intervals with specific value prediction. arXiv preprint arXiv:2006.05139

  21. [21]

    Zhao, C., Wan, C., Song, Y ., & Cao, Z. (2020). Optimal nonparametric prediction intervals of electricity load. IEEE Transactions on Power Systems, 35(3), 2467-2470

  22. [22]

    Mathonsi, T., & Zyl, T. L. V . (2025). Multivariate anomaly detection based on prediction intervals constructed using deep learning. Neural Computing and Applications, 37(2), 707-721

  23. [23]

    M., & Aler, R

    Alcántara, A., Galván, I. M., & Aler, R. (2022). Direct estimation of prediction intervals for solar and wind regional energy forecasting with deep neural networks. Engineering Applications of Artificial Intelligence, 114, 105128

  24. [24]

    (2022, May)

    Thirumuruganathan, S., Shetiya, S., Koudas, N., & Das, G. (2022, May). Prediction intervals for learned cardinality estimation: an experimental evaluation. In 2022 IEEE 38th International Conference on Data Engineering (ICDE) (pp. 3051-3064). IEEE

  25. [25]

    Liu, F., Liu, Q., Tao, Q., Huang, Y ., Li, D., & Sidorov, D. (2023). Deep reinforcement learning based energy storage management strategy considering prediction intervals of wind power. International Journal of Electrical Power & Energy Systems, 145, 108608

  26. [26]

    Sarveswararao, V ., & Ravi, V . (2021). Prediction intervals for macroeconomic variables using LSTM based LUBE method. In Modern Approaches in Machine Learning and Cognitive Science: A Walkthrough: Latest Trends in AI, Volume 2 (pp. 267-275). Cham: Springer International Publishing

  27. [27]

    Sarveswararao, V ., Ravi, V ., & Huq, S. T. U. (2022). Optimal prediction intervals for macroeconomic time series using chaos and evolutionary multi-objective optimization algorithms. Swarm and Evolutionary Computation, 71, 101070

  28. [28]

    Michael, O., & Goldshtein, B. L. (2026, May). Semi-Supervised GNN for Sound Source Localization with Prediction Intervals. In ICASSP 2026-2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 20881-20885). IEEE

  29. [29]

    R., Yang, Z., & Liu, K

    Liao, W., Wang, S., Bak-Jensen, B., Pillai, J. R., Yang, Z., & Liu, K. (2023). Ultra- short-term interval prediction of wind power based on graph neural network and improved bootstrap technique. Journal of Modern Power Systems and Clean Energy, 11(4), 1100-1114

  30. [30]

    Huang, Z., Wang, D., Yin, Y ., & Cheng, T. C. E. (2025). A prediction interval framework-based spatial–temporal convolution block network for traffic demand prediction. Transportation Research Part E: Logistics and Transportation Review, 204, 104426

  31. [31]

    S., & Dini, M

    Nourani, V ., Zonouz, R. S., & Dini, M. (2023). Estimation of prediction intervals for uncertainty assessment of artificial neural network based wastewater treatment plant effluent modeling. Journal of Water Process Engineering, 55, 104145

  32. [32]

    Wang, L., Mao, M., Xie, J., Liao, Z., Zhang, H., & Li, H. (2023). Accurate solar PV power prediction interval method based on frequency-domain decomposition and LSTM model. Energy, 262, 125592

  33. [33]

    Dehghani, A., Moazam, H. M. Z. H., Mortazavizadeh, F., Ranjbar, V ., Mirzaei, M., Mortezavi, S., & Dehghani, A. (2023). Comparative evaluation of LSTM, CNN, and ConvLSTM for hourly short-term streamflow forecasting using deep learning approaches. Ecological Informatics, 75, 102119

  34. [34]

    K., Ravi, V ., & Krishna, P

    Vivek, Y ., Vadlamani, S. K., Ravi, V ., & Krishna, P. R. (2026). Improved differential evolution based feature selection through chaos, quantum, and lasso logistic regression. Quantum Machine Intelligence, 8(1), 13

  35. [35]

    Vivek, Y ., Ravi, V ., & Krishna, P. R. (2025). Feature subset selection for big data via parallel chaotic binary differential evolution and feature-level elitism. Computers and Electrical Engineering, 123, 110232

  36. [36]

    Gangadhar, K. S. N. V . K., Kumar, B. A., Vivek, Y ., & Ravi, V . (2025, August). Chaotic variational auto encoder based one class classifier for insurance fraud detection. In 2025 International Conference on Emerging Techniques in Computational Intelligence (ICETCI) (pp. 1-8). IEEE

  37. [37]

    Reddy, D. P. V . S., Vivek, Y ., Pranay, G., & Ravi, V . (2025). Chaotic variational auto encoder-based adversarial machine learning. Computers and Electrical Engineering, 128, 110646

  38. [39]

    Vivek, Y ., Ravi, V ., & Krishna, P. R. (2025). Parallel Chaotic bi-objective evolutionary algorithms for scalable feature subset selection via migration strategy. Applied Soft Computing, 114009

  39. [40]

    S., Korukonda, H

    Kandimalla, E. S., Korukonda, H. S., Bhimineni, S., Kankanala, S. C., & Vivek, Y. (2025). Spatio-temporal chaotic graph convolutional network for stock market forecasting. Accepted at the 10th International Conference on Data Management, Analytics and Innovation (ICDMAI 2026)