Interpretable Deep Learning for Stock Returns: A Consensus-Bottleneck Asset Pricing Model
Pith reviewed 2026-05-16 21:23 UTC · model grok-4.3
The pith
Embedding aggregate analyst consensus as a bottleneck in a neural network reveals priced stock-return variation missed by standard factor models.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The CB-APM embeds aggregate analyst consensus as a structural bottleneck inside a deep network, treating professional beliefs as a sufficient statistic for the market's high-dimensional information set. The bottleneck simultaneously regularizes the model for better predictive performance and anchors its outputs to interpretable belief-driven drivers. Sorted portfolios on CB-APM forecasts exhibit a strong monotonic return gradient that remains robust across regimes, and the learned consensus captures priced variation that canonical factor models systematically miss.
What carries the argument
The consensus bottleneck, a layer that compresses the network's hidden state onto aggregate analyst forecasts and thereby serves as both regularizer and interpretable channel.
If this is right
- Portfolios sorted on CB-APM forecasts exhibit a strong monotonic return gradient.
- The learned consensus encodes priced variation not spanned by canonical factor models.
- The bottleneck improves out-of-sample predictive accuracy while preserving economic interpretability.
- The identified risk heterogeneity remains robust across macroeconomic regimes.
Where Pith is reading between the lines
- If the bottleneck truly isolates belief-driven risk, replacing analyst consensus with alternative belief proxies should produce comparable monotonic portfolios.
- The approach suggests that belief heterogeneity may explain cross-sectional return patterns that linear models attribute to noise or omitted variables.
- Testing the model on non-equity assets or during periods of analyst forecast dispersion spikes would reveal whether the mechanism generalizes beyond stocks.
Load-bearing premise
Professional beliefs serve as a sufficient statistic for the market's high-dimensional information set.
What would settle it
Out-of-sample portfolios sorted on CB-APM forecasts fail to display a monotonic return gradient, or the extracted consensus component is fully spanned by existing factor models such as Fama-French.
Figures
read the original abstract
We introduce the Consensus-Bottleneck Asset Pricing Model (CB-APM), which embeds aggregate analyst consensus as a structural bottleneck, treating professional beliefs as a sufficient statistic for the market's high-dimensional information set. Unlike post-hoc explainability approaches, CB-APM achieves interpretability-by-design: the bottleneck constraint functions as an endogenous regularizer that simultaneously improves out-of-sample predictive accuracy and anchors inference to economically interpretable drivers. Portfolios sorted on CB-APM forecasts exhibit a strong monotonic return gradient, robust across macroeconomic regimes. Pricing diagnostics further reveal that the learned consensus encodes priced variation not spanned by canonical factor models, identifying belief-driven risk heterogeneity that standard linear frameworks systematically miss.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces the Consensus-Bottleneck Asset Pricing Model (CB-APM), a deep neural network architecture that imposes aggregate analyst consensus as a structural bottleneck, treating professional beliefs as a sufficient statistic for the market's high-dimensional information set. The model is claimed to achieve interpretability-by-design via the bottleneck regularizer, deliver superior out-of-sample predictive accuracy, generate portfolios with strong monotonic return gradients across macroeconomic regimes, and identify priced variation in the learned consensus that is not spanned by canonical linear factor models such as Fama-French-Carhart.
Significance. If the central claims hold after rigorous verification, the work offers a novel bridge between deep learning and asset pricing by embedding economic structure directly into the network rather than relying on post-hoc explanations. It could provide evidence for belief-driven risk heterogeneity that linear frameworks miss and supply a practical, interpretable tool for return forecasting and portfolio construction.
major comments (2)
- [Abstract and §3] Abstract and §3 (model construction): the bottleneck is defined directly in terms of the same consensus variable used for both training and evaluation; without explicit residual orthogonality diagnostics after training (e.g., regression of CB-APM residuals on Fama-French-Carhart factors), it remains unclear whether the reported alphas reflect independently identified priced variation or a nonlinear transformation of the consensus already partially captured by linear models.
- [§4] §4 (pricing diagnostics): the claim that the learned consensus encodes priced variation not spanned by canonical factors requires a formal test that the consensus residual is orthogonal to the factor space post-training; the abstract provides no such statistic or cross-validation procedure, leaving the central pricing result vulnerable to the circularity concern that the bottleneck compresses away precisely the variation it claims to isolate.
minor comments (2)
- [Abstract] The abstract refers to 'robust across macroeconomic regimes' without specifying the regime classification method or the exact number of regimes tested.
- [Model section] Notation for the bottleneck layer and the consensus variable should be introduced with explicit equations in the model section to aid reproducibility.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive comments. We address each major point below and will revise the manuscript to incorporate additional diagnostics that directly respond to the concerns.
read point-by-point responses
-
Referee: [Abstract and §3] Abstract and §3 (model construction): the bottleneck is defined directly in terms of the same consensus variable used for both training and evaluation; without explicit residual orthogonality diagnostics after training (e.g., regression of CB-APM residuals on Fama-French-Carhart factors), it remains unclear whether the reported alphas reflect independently identified priced variation or a nonlinear transformation of the consensus already partially captured by linear models.
Authors: We agree that explicit post-training orthogonality checks would strengthen the interpretation. In the CB-APM, the consensus acts as a structural bottleneck on the high-dimensional inputs, and the network is trained end-to-end to map this compressed representation to returns. The reported alphas arise from out-of-sample portfolio sorts on the resulting forecasts. To address the circularity concern directly, we will add regressions of the CB-APM residuals on the Fama-French-Carhart factors (and report R-squared and significance) in a revised §4, along with a brief discussion in the abstract. revision: yes
-
Referee: [§4] §4 (pricing diagnostics): the claim that the learned consensus encodes priced variation not spanned by canonical factors requires a formal test that the consensus residual is orthogonal to the factor space post-training; the abstract provides no such statistic or cross-validation procedure, leaving the central pricing result vulnerable to the circularity concern that the bottleneck compresses away precisely the variation it claims to isolate.
Authors: The referee is correct that a formal orthogonality test is needed to substantiate the claim of priced variation outside the linear factor space. While the architecture and out-of-sample results provide supporting evidence, we will add the requested formal test (regression of post-training residuals on the canonical factors) together with cross-validation statistics in the revised §4. We will also reference these diagnostics in the abstract to clarify that the bottleneck isolates incremental priced risk. revision: yes
Circularity Check
Consensus bottleneck renders 'unspanned priced variation' claim circular by construction
specific steps
-
self definitional
[Abstract]
"We introduce the Consensus-Bottleneck Asset Pricing Model (CB-APM), which embeds aggregate analyst consensus as a structural bottleneck, treating professional beliefs as a sufficient statistic for the market's high-dimensional information set. ... Pricing diagnostics further reveal that the learned consensus encodes priced variation not spanned by canonical factor models, identifying belief-driven risk heterogeneity that standard linear frameworks systematically miss."
The model explicitly defines the bottleneck as the analyst consensus variable and then claims the output 'learned consensus' encodes additional priced variation. By construction the network is forced to compress all predictive signal through this input, so the unspanned-variation claim is equivalent to the bottleneck assumption rather than derived from it.
full rationale
The paper's core architecture defines the bottleneck directly as aggregate analyst consensus and treats it as a sufficient statistic for the full information set. All subsequent claims about the learned consensus encoding priced variation not spanned by canonical factors therefore reduce to transformations of this same input variable. The pricing diagnostics and portfolio sorts cannot isolate independent belief-driven heterogeneity because the network is structurally constrained to route signal through the consensus; any apparent alpha after factor controls is an artifact of the bottleneck definition rather than an emergent result. This is a self-definitional reduction with no independent identification step.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Aggregate analyst consensus constitutes a sufficient statistic for the market's high-dimensional information set
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We introduce the Consensus-Bottleneck Asset Pricing Model (CB-APM), which embeds aggregate analyst consensus as a structural bottleneck, treating professional beliefs as a sufficient statistic for the market's high-dimensional information set.
-
IndisputableMonolith/Foundation/AlphaCoordinateFixation.leanJ_uniquely_calibrated_via_higher_derivative unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the structural constraint acts as an endogenous regularizer that simultaneously improves out-of-sample predictive accuracy
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
variables with missing-value rates exceeding 20% across the firm panel are removed
-
[2]
variables with insufficient historical coverage (sample starting year after January 1994) are excluded. The resulting set of firm-level characteristics provides a balanced trade-off between data complete- ness and information diversity, ensuring that each firm contributes a meaningful set of observations to both the consensus and return-prediction modules...
work page 1994
-
[3]
4,683 firms with nonmissing analyst consensus data, 66
-
[4]
114 firm-level predictors and 123 macroeconomic indicators (including 115 from FRED-MD and 8 from Welch and Goyal, 2008),
work page 2008
-
[5]
a total of 605,722 firm-month observations spanning January 1994 to December 2023. This refined panel forms the empirical foundation for all model estimation and evaluation procedures described in Section 2. B.3 Data imputation Although the majority of studies neglect the importance of data imputation methods and simply handle missing values by substituti...
work page 1994
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.