pith. sign in

arxiv: 2605.23703 · v1 · pith:V742IAIVnew · submitted 2026-05-22 · 💰 econ.EM

Dynamic Consumer Demand at Large Scale

Pith reviewed 2026-05-25 02:24 UTC · model grok-4.3

classification 💰 econ.EM
keywords consumer demanddynamic factor modeldiscrete choiceinertiaheterogeneityvariational inferenceretail datapredictive performance
0
0 comments X

The pith

A dynamic factor model pools heterogeneity across products and categories to improve demand prediction in large retail settings with sparse data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a dynamic product-level factor model for estimating consumer demand with many products, categories, and repeated purchases. It factorizes individual coefficients for baseline preferences, price sensitivity, and inertia into shared latent factors that pool information across consumers and categories. Bayesian variational inference makes estimation feasible at scale with tens of thousands of parameters. Simulations calibrated to retail data show better out-of-sample predictions than static factor models or mixed logit, especially with limited purchase histories per person. Including inertia produces more elastic demand estimates.

Core claim

The dynamic product-level factor model factorizes individual-product coefficients through a shared latent factor structure to capture correlated heterogeneity in baseline preferences, price sensitivity, and inertia; this yields substantially better predictive performance than static factor models and mixed logit benchmarks when individual histories are sparse and produces more elastic demand estimates by accounting for inertia.

What carries the argument

The dynamic product-level factor model that factorizes individual-product coefficients via shared latent factors to represent joint heterogeneity in preferences, price sensitivity, and inertia.

If this is right

  • Demand estimates become more elastic once inertia is incorporated.
  • Predictive gains are largest when individual purchase histories are short.
  • The factor structure enables scaling to high-dimensional product and category spaces.
  • Information pools across individuals and categories without separate per-person estimation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The approach could support multi-category pricing policies by linking elasticities through the shared factors.
  • Extensions might test whether the same structure improves forecasts in non-retail dynamic choice settings such as subscription services.
  • If the latent factors prove stable over time, the model could reduce data requirements for new product introductions.

Load-bearing premise

The shared latent factor structure is assumed to adequately represent and pool the joint heterogeneity in baseline preferences, price sensitivity, and inertia across individuals and categories without substantial misspecification.

What would settle it

A hold-out test on real retail transaction data with sparse individual histories where the dynamic factor model shows no improvement in predictive accuracy over static models or where demand elasticities do not rise when inertia is included would falsify the central claim.

Figures

Figures reproduced from arXiv: 2605.23703 by Anna B. Schmidt, Daniel Brunner, Florian Heiss.

Figure 1
Figure 1. Figure 1: Probability of choosing the same product in [PITH_FULL_IMAGE:figures/full_fig_p010_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Simulation results for up to C = 200 categories with R = 25 datasets per setting and I = 40, J = 10, T = 20 and K = 5. The right panel of figure 2 reports runtime on a log scale. The dynamic and static factor models exhibit roughly proportional growth in runtime as the number of categories increases, consistent with the need to evaluate multinomial logit choice probabilities over a large dataset. By contra… view at source ↗
Figure 3
Figure 3. Figure 3: Mean own-price elasticities with 95% credibility intervals by category for [PITH_FULL_IMAGE:figures/full_fig_p014_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Simulation results for standard normal priors and less informative [PITH_FULL_IMAGE:figures/full_fig_p018_4.png] view at source ↗
read the original abstract

We study consumer demand in large-scale retail settings with many products, multiple categories and repeated purchase behavior. While inertia and brand loyalty are well documented, existing discrete choice models typically focus on single categories or become computationally infeasible in high-dimensional environments. We propose a dynamic product-level factor model that captures heterogeneity in baseline preferences, price sensitivity and inertia through a shared latent factor structure. By factorizing individual-product coefficients, the model pools information across individuals and categories and allows for correlated heterogeneity. We estimate the model using Bayesian variational inference, enabling scalable estimation with tens of thousands of parameters. In a simulation study calibrated to realistic retail data, we show that the dynamic factor model substantially improves predictive performance relative to static factor models and mixed logit benchmarks, particularly when individual purchase histories are sparse. Accounting for inertia also leads to more elastic demand estimates, underscoring the importance of dynamics for measuring consumer responsiveness. Our results highlight dynamic factor models as a scalable and flexible approach for demand estimation in modern, high-dimensional retail markets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. The paper proposes a dynamic product-level factor model for estimating consumer demand in large-scale retail settings with many products and categories. The model factorizes individual-product coefficients to capture correlated heterogeneity in baseline preferences, price sensitivity, and inertia, estimated via scalable Bayesian variational inference. A simulation study calibrated to realistic retail data shows improved out-of-sample predictive performance relative to static factor models and mixed logit benchmarks (especially with sparse purchase histories) and more elastic demand estimates when inertia is included.

Significance. If the simulation results prove robust, the work would offer a computationally feasible method for incorporating dynamics and high-dimensional heterogeneity into discrete choice models, addressing a key limitation in modern retail demand estimation where individual histories are often sparse. The variational inference approach for handling tens of thousands of parameters is a notable technical strength for scalability.

major comments (1)
  1. [Simulation study] The central performance and elasticity claims rest entirely on a single simulation study (described in the abstract). Without details on robustness to alternative hyperparameter choices, holdout designs, or data-generating processes, it is difficult to assess whether the reported gains over static factors and mixed logit generalize beyond the calibrated setting.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive feedback. We address the single major comment below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [Simulation study] The central performance and elasticity claims rest entirely on a single simulation study (described in the abstract). Without details on robustness to alternative hyperparameter choices, holdout designs, or data-generating processes, it is difficult to assess whether the reported gains over static factors and mixed logit generalize beyond the calibrated setting.

    Authors: We agree that the simulation results would be strengthened by additional robustness checks. In the revised manuscript we will expand Section 4 to include (i) sensitivity to alternative hyperparameter values in the variational inference, (ii) alternative holdout designs that vary the share and length of sparse purchase histories, and (iii) two further data-generating processes that change the degree of inertia and the correlation structure of heterogeneity. These additions will clarify the extent to which the reported gains generalize beyond the baseline calibration, which itself is chosen to match key moments from scanner data. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper proposes a dynamic product-level factor model with shared latent factors for heterogeneity in preferences, price sensitivity, and inertia, estimated via Bayesian variational inference. Central performance claims rest on an external simulation study calibrated to realistic retail data, comparing out-of-sample predictions against static factor models and mixed logit benchmarks. No equations, fitted parameters, or self-citations are shown to reduce any reported prediction or result to the model's own inputs by construction. The factor-pooling structure is presented as a standard scalable approach without self-definitional loops or imported uniqueness theorems. This is a self-contained proposal with independent simulation evidence.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review is based solely on the abstract; the model introduces latent factors as a modeling device but no explicit free parameters, axioms, or invented entities are enumerated in the provided text.

pith-pipeline@v0.9.0 · 5695 in / 1085 out tokens · 25112 ms · 2026-05-25T02:24:49.633997+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages

  1. [1]

    and Blei, D

    Athey, S. and Blei, D. and Donnelly, R. and Ruiz, F. and Schmidt, T. , title =. AEA Papers and Proceedings , volume =. 2018 , pages =

  2. [2]

    Quantitative Marketing and Economics , pages=

    Counterfactual inference for consumer choice across many product categories , author=. Quantitative Marketing and Economics , pages=

  3. [3]

    Journal of Machine Learning Research , volume=

    Automatic differentiation variational inference , author=. Journal of Machine Learning Research , volume=

  4. [4]

    The Annals of Applied Statistics , volume=

    SHOPPER: a probabilistic model of consumer choice with substitutes and complements , author=. The Annals of Applied Statistics , volume=. 2020 , pages=

  5. [5]

    and McFadden, D

    Heiss, F. and McFadden, D. and Winter, J. and Wuppermann, A. and B. Zhou , title =. American Economic Review , volume=

  6. [6]

    American Economic Journal: Economic Policy , volume=

    Power to choose? An analysis of consumer inertia in the residential electricity market , author=. American Economic Journal: Economic Policy , volume=

  7. [7]

    American Economic Review , volume=

    The evolution of brand preferences: Evidence from consumer migration , author=. American Economic Review , volume=

  8. [8]

    Available at SSRN 3380390 , year=

    Consumer inertia and market power , author=. Available at SSRN 3380390 , year=

  9. [9]

    Marketing Science , volume=

    A logit model of brand choice calibrated on scanner data , author=. Marketing Science , volume=

  10. [10]

    Marketing Science , volume=

    Decision-making under uncertainty: Capturing dynamic brand choice processes in turbulent consumer goods markets , author=. Marketing Science , volume=

  11. [11]

    Journal of Business & Economic Statistics , volume=

    Modeling heterogeneity and state dependence in consumer choice behavior , author=. Journal of Business & Economic Statistics , volume=

  12. [12]

    Discrete Choice Methods with Simulation , author=

  13. [13]

    Frontiers in Econometrics , editor =

    McFadden, Daniel , title =. Frontiers in Econometrics , editor =. 1974 , pages =

  14. [14]

    The Econometrics Journal , volume =

    Aguirregabiria, Victor , title =. The Econometrics Journal , volume =

  15. [15]

    and Kashaev, Nail , title =

    Aguiar, Victor H. and Kashaev, Nail , title =. 2019 , howpublished =. 1907.04853 , archivePrefix=

  16. [16]

    and Kashaev, Nail , title =

    Aguiar, Victor H. and Kashaev, Nail , title =. Journal of Business & Economic Statistics , volume =

  17. [17]

    , title =

    Manrai, Ajay K. , title =. European Journal of Operational Research , volume =

  18. [18]

    Econometrica , volume =

    Berry, Steven and Levinsohn, James and Pakes, Ariel , title =. Econometrica , volume =

  19. [19]

    and Blei, David M

    Hoffman, Matthew D. and Blei, David M. and Wang, Chong and Paisley, John , title =. Journal of Machine Learning Research , volume =

  20. [20]

    , title =

    Ranganath, Rajesh and Gerrish, Sean and Blei, David M. , title =. Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics (AISTATS) , year =

  21. [21]

    and Welling, Max , title =

    Kingma, Diederik P. and Welling, Max , title =. Proceedings of the International Conference on Learning Representations (ICLR) , year =

  22. [22]

    Journal of Statistical Software , volume =

    Stan: A probabilistic programming language , author =. Journal of Statistical Software , volume =

  23. [23]

    The RAND Journal of Economics , volume =

    State dependence and alternative explanations for consumer inertia , author =. The RAND Journal of Economics , volume =

  24. [24]

    , title =

    Heckman, James J. , title =. Structural Analysis of Discrete Data with Econometric Applications , editor =. 1981 , pages =

  25. [25]

    The RAND Journal of Economics , volume =

    Mergers with differentiated products: The case of the ready-to-eat cereal industry , author =. The RAND Journal of Economics , volume =

  26. [26]

    Computer , volume =

    Matrix Factorization Techniques for Recommender Systems , author =. Computer , volume =

  27. [27]

    Advances in Neural Information Processing Systems , year =

    Probabilistic Matrix Factorization , author =. Advances in Neural Information Processing Systems , year =

  28. [28]

    Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability , volume =

    Statistical inference in factor analysis , author =. Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability , volume =

  29. [29]

    Biometrika , volume =

    Bayesian factor analysis and density estimation using a class of factor models , author =. Biometrika , volume =

  30. [30]

    Biometrika , volume =

    Sparse Bayesian infinite factor models , author =. Biometrika , volume =

  31. [31]

    and Kucukelbir, Alp and McAuliffe, Jon D

    Blei, David M. and Kucukelbir, Alp and McAuliffe, Jon D. , title =. Journal of the American Statistical Association , volume =

  32. [32]

    Bayesian Statistics and Marketing , author =

  33. [33]

    Annual Review of Economics , volume=

    The use of scanner data for economics research , author=. Annual Review of Economics , volume=. 2022 , publisher=

  34. [34]

    RAND Journal of Economics , pages=

    Empirically distinguishing informative and prestige effects of advertising , author=. RAND Journal of Economics , pages=

  35. [35]

    The Review of Economic Studies , volume=

    Vertical relationships between manufacturers and retailers: Inference with limited data , author=. The Review of Economic Studies , volume=

  36. [36]

    Journal of Political Economy , volume=

    Vertical contracts with endogenous product selection: An empirical analysis of vendor allowance contracts , author=. Journal of Political Economy , volume=

  37. [37]

    2021 , institution=

    Common ownership and competition in the ready-to-eat cereal industry , author=. 2021 , institution=

  38. [38]

    2023 , institution=

    Scalable demand and markups , author=. 2023 , institution=

  39. [39]

    Journal of Political Economy , volume=

    Rising markups and the role of consumer preferences , author=. Journal of Political Economy , volume=

  40. [40]

    Journal of Applied Econometrics , volume=

    Mixed MNL Models For Discrete Response , author=. Journal of Applied Econometrics , volume=

  41. [41]

    2018 , eprint=

    A Conceptual Introduction to Hamiltonian Monte Carlo , author=. 2018 , eprint=

  42. [42]

    Estimation of Random Utility Models in

    Yves Croissant , journal =. Estimation of Random Utility Models in. 2020 , volume =

  43. [43]

    Journal of Econometrics , volume=

    Bayesian and classical approaches to instrumental variable regression , author=. Journal of Econometrics , volume=