Variational decision diagrams for quantum-inspired machine learning applications
Pith reviewed 2026-05-23 03:45 UTC · model grok-4.3
The pith
Variational decision diagrams represent quantum states and show no barren plateaus when trained on ground states of Ising and Heisenberg Hamiltonians.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Variational decision diagrams (VDDs) are a new graph structure for representing quantum states that combines decision diagrams with variational methods. When applied to ground state estimation for transverse-field Ising and Heisenberg Hamiltonians, gradient variance analysis shows no signs of vanishing gradients or barren plateaus, indicating that VDDs are trainable.
What carries the argument
Variational decision diagrams (VDDs), graph structures that exploit data redundancies in quantum states while incorporating variational parameters for adaptability.
If this is right
- VDDs provide an efficient alternative to variational quantum circuits for representing states in QML.
- Training VDDs is feasible for ground state problems without gradient vanishing issues in the tested cases.
- The structural benefits of decision diagrams can be adapted for variational optimization in quantum-inspired learning.
- Insights from VDDs may guide the design of new ansatze that avoid barren plateaus.
Where Pith is reading between the lines
- Similar gradient analysis could be applied to other quantum-inspired models to check trainability.
- VDDs might extend to problems beyond ground state estimation, such as classification tasks.
- If scalable, VDDs could reduce the resource requirements compared to full quantum circuit simulations.
Load-bearing premise
The absence of barren plateaus observed for the transverse-field Ising and Heisenberg Hamiltonians implies the same for VDDs in general.
What would settle it
Finding vanishing gradients or barren plateaus when applying VDDs to ground state estimation of a different Hamiltonian or at larger system sizes.
Figures
read the original abstract
Decision diagrams (DDs) have emerged as an efficient tool for simulating quantum circuits due to their capacity to exploit data redundancies in quantum states and quantum operations, enabling the efficient computation of probability amplitudes. However, their application in quantum machine learning (QML) has remained unexplored. This paper introduces variational decision diagrams (VDDs), a novel graph structure that combines the structural benefits of DDs with the adaptability of variational methods for efficiently representing quantum states. We investigate the trainability of VDDs by applying them to the ground state estimation problem for transverse-field Ising and Heisenberg Hamiltonians. Analysis of gradient variance suggests that training VDDs is possible, as no signs of vanishing gradients--also known as barren plateaus--are observed. This work provides new insights into the use of decision diagrams in QML as an alternative to design and train variational ans\"atze.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces variational decision diagrams (VDDs), a graph-based structure that merges decision diagrams' redundancy exploitation with variational parameters for representing quantum states. It applies VDDs to the ground-state estimation task for the transverse-field Ising and Heisenberg Hamiltonians and reports that gradient-variance analysis shows no vanishing gradients (barren plateaus), concluding that VDDs are trainable and offering a new alternative to conventional variational ansätze in quantum machine learning.
Significance. If the reported absence of barren plateaus generalizes, VDDs could provide an efficient, redundancy-aware alternative to standard parameterized quantum circuits for QML tasks, potentially improving both simulation efficiency and trainability on structured problems. The work is credited for exploring decision diagrams in a variational QML setting, an underexplored direction.
major comments (2)
- [Abstract] Abstract (and results section on gradient variance): the central claim that 'training VDDs is possible' because 'no signs of vanishing gradients are observed' rests exclusively on measurements for the transverse-field Ising and Heisenberg models. No general bound on gradient variance, scaling analysis with system size or parameter count, or tests on other Hamiltonians (e.g., 2D lattices, long-range interactions) are supplied, so the extrapolation from these two 1D cases to general trainability is unsupported.
- [Abstract] The weakest assumption identified in the stress-test note is load-bearing: the manuscript treats non-vanishing variance on two specific structured 1D spin chains as evidence that VDDs avoid barren plateaus in general, yet nothing in the supplied analysis rules out variance collapse for different connectivities or larger variational parameter spaces.
minor comments (2)
- Notation for the VDD graph structure and the precise definition of the variational parameters should be introduced with an explicit diagram or pseudocode early in the manuscript to aid reproducibility.
- The abstract states that DDs enable 'efficient computation of probability amplitudes' but does not quantify the complexity gain relative to standard tensor-network or circuit simulators for the VDD case; a brief complexity statement would strengthen the motivation.
Simulated Author's Rebuttal
We thank the referee for the careful review and constructive comments on the scope of our claims. We address each major point below and agree that revisions are needed to clarify the limited scope of the reported results.
read point-by-point responses
-
Referee: [Abstract] Abstract (and results section on gradient variance): the central claim that 'training VDDs is possible' because 'no signs of vanishing gradients are observed' rests exclusively on measurements for the transverse-field Ising and Heisenberg models. No general bound on gradient variance, scaling analysis with system size or parameter count, or tests on other Hamiltonians (e.g., 2D lattices, long-range interactions) are supplied, so the extrapolation from these two 1D cases to general trainability is unsupported.
Authors: We agree that the gradient-variance results are confined to the transverse-field Ising and Heisenberg models and that no general bound, scaling analysis, or tests on other Hamiltonians are provided. The manuscript therefore does not support a claim of general trainability. We will revise the abstract and the relevant results section to state explicitly that no signs of vanishing gradients were observed for these two specific 1D models, removing any implication of broader applicability. revision: yes
-
Referee: [Abstract] The weakest assumption identified in the stress-test note is load-bearing: the manuscript treats non-vanishing variance on two specific structured 1D spin chains as evidence that VDDs avoid barren plateaus in general, yet nothing in the supplied analysis rules out variance collapse for different connectivities or larger variational parameter spaces.
Authors: The analysis is restricted to the two cited 1D spin chains; nothing in the work rules out variance collapse under different connectivities or larger parameter spaces. We will revise the abstract and discussion to make this limitation explicit and to avoid any suggestion that the results establish avoidance of barren plateaus in general. revision: yes
Circularity Check
No circularity: empirical gradient-variance measurements on two Hamiltonians presented directly as evidence
full rationale
The paper's central claim rests on explicit computation of gradient variance for VDDs applied to ground-state estimation of the transverse-field Ising and Heisenberg models, with the observation that variance does not vanish. This is a direct empirical measurement rather than a derivation that reduces by construction to a fitted parameter, self-defined quantity, or self-citation chain. No equations, ansatzes, or uniqueness theorems are invoked that loop back to the paper's own inputs. The limitation to two specific 1D Hamiltonians affects the strength of the generalization but does not constitute circularity under the defined criteria.
Axiom & Free-Parameter Ledger
invented entities (1)
-
Variational decision diagrams (VDDs)
no independent evidence
Reference graph
Works this paper leans on
-
[1]
G. Mazzola, Quantum computing for chemistry and physics applications from a monte carlo perspective, The Journal of Chemical Physics160(2024)
work page 2024
-
[2]
L. Bassman Oftelie, M. Urbanek, M. Metcalf, J. Carter, A. F. Kemper, and W. A. de Jong, Simulating quantum materials with digital quantum computers, Quantum Sci- ence and Technology6, 043002 (2021)
work page 2021
-
[3]
A. Di Meglio, K. Jansen, I. Tavernelli, C. Alexandrou, S. Arunachalam, C. W. Bauer, K. Borras, S. Carrazza, A. Crippa, V. Croft, R. de Putter, A. Delgado, V. Dun- jko, D. J. Egger, E. Fern´ andez-Combarro, E. Fuchs, L. Funcke, D. Gonz´ alez-Cuadra, M. Grossi, J. C. Hal- imeh, Z. Holmes, S. K¨ uhn, D. Lacroix, R. Lewis, D. Luc- chesi, M. L. Martinez, F. Me...
work page 2024
- [4]
- [5]
- [6]
-
[7]
Quantum Convolutional Neural Networks are Effectively Classically Simulable
P. Bermejo, P. Braccia, M. S. Rudolph, Z. Holmes, L. Cincio, and M. Cerezo, Quantum convolutional neu- ral networks are (effectively) classically simulable (2024), arXiv:2408.12739 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2024
- [8]
-
[9]
H. Fargier, P. Marquis, A. Niveau, and N. Schmidt, A knowledge compilation map for ordered real-valued deci- sion diagrams, Proceedings of the AAAI Conference on Artificial Intelligence28, 10.1609/aaai.v28i1.8853 (2014)
-
[10]
M. A. Sistla, S. Chaudhuri, and T. Reps, Cflobdds: Context-free-language ordered binary decision diagrams, ACM Trans. Program. Lang. Syst.46, 10.1145/3651157 (2024)
-
[11]
D. Kisa, G. V. den Broeck, A. Choi, and A. Darwiche, Probabilistic sentential decision diagrams, inProceedings of the Fourteenth International Conference on Princi- ples of Knowledge Representation and Reasoning (KR) (AAAI Press, Vienna, Austria, 2014) pp. 558–567
work page 2014
-
[12]
N. Wilson, Decision diagrams for the computation of semiring valuations, inProceedings of the 19th Inter- national Joint Conference on Artificial Intelligence (IJ- CAI)(Morgan Kaufmann Publishers, Edinburgh, Scot- land, 2005) pp. 331–336
work page 2005
-
[13]
Y. Thierry-Mieg, D. Poitrenaud, A. Hamez, and F. Ko- rdon, Hierarchical set decision diagrams and regular models, inTools and Algorithms for the Construction and Analysis of Systems, edited by S. Kowalewski and A. Philippou (Springer Berlin Heidelberg, Berlin, Hei- delberg, 2009) pp. 1–15
work page 2009
-
[14]
S. Sanner and D. McAllester, Affine algebraic decision diagrams (aadds) and their application to structured probabilistic inference, inProceedings of the 19th Inter- national Joint Conference on Artificial Intelligence (IJ- CAI)(Morgan Kaufmann Publishers, Edinburgh, Scot- land, 2005) pp. 1384–1390
work page 2005
- [15]
-
[16]
Y. Lai, M. Pedram, and S. B. K. Vrudhula, Formal verifi- cation using edge-valued binary decision diagrams, IEEE Transactions on Computers45, 247 (1996)
work page 1996
-
[17]
P. Tafertshofer and M. Pedram, Factored edge-valued bi- nary decision diagrams, Formal Methods in System De- sign10, 243 (1997)
work page 1997
-
[18]
L. Vinkhuijzen, T. Coopmans, and A. Laarman, A knowl- edge compilation map for quantum information (2024), arXiv:2401.01322 [quant-ph]
-
[19]
L. Burgholzer, Y. Stade, T. Peham, and R. Wille, Mqt core: The backbone of the munich quantum toolkit (mqt), Journal of Open Source Software10, 7478 (2025)
work page 2025
-
[20]
D. Miller and M. Thornton, Qmdd: A decision diagram structure for reversible and quantum circuits, in36th International Symposium on Multiple-Valued Logic (IS- MVL’06)(2006) pp. 30–30
work page 2006
-
[21]
M. Sistla, S. Chaudhuri, and T. Reps, Weighted context- free-language ordered binary decision diagrams, Proc. ACM Program. Lang.8, 10.1145/3689760 (2024)
- [22]
-
[23]
S. Brand and A. Laarman, Q-sylvan: A parallel deci- sion diagram package for quantum computing (2025), arXiv:2508.00514 [quant-ph]
-
[24]
Advanced Simulation of Quantum Computations
A. Zulehner and R. Wille, Advanced simulation of quan- tum computations (2018), arXiv:1707.00865 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[25]
S. Hillmich, I. L. Markov, and R. Wille, Just like the real thing: Fast weak simulation of quantum computation, in 2020 57th ACM/IEEE Design Automation Conference (DAC)(IEEE, 2020)
work page 2020
-
[26]
S. Hillmich, R. Kueng, I. L. Markov, and R. Wille, As ac- curate as needed, as efficient as possible: Approximations 9 in dd-based quantum circuit simulation, in2021 Design, Automation & Test in Europe Conference & Exhibition (2021) pp. 188–193
work page 2021
-
[27]
L. Burgholzer, H. Bauer, and R. Wille, Hybrid schr¨ odinger-feynman simulation of quantum circuits with decision diagrams, in2021 IEEE International Confer- ence on Quantum Computing and Engineering (QCE) (IEEE, 2021)
work page 2021
-
[28]
L. Burgholzer, A. Ploier, and R. Wille, Exploiting arbi- trary paths for the simulation of quantum circuits with decision diagrams, in2022 Design, Automation & Test in Europe Conference & Exhibition(2022) pp. 64–67
work page 2022
- [29]
- [30]
-
[31]
H. Qi, S. Xiao, Z. Liu, C. Gong, and A. Gani, Variational quantum algorithms: fundamental concepts, applications and challenges, Quantum Information Processing23, 224 (2024)
work page 2024
-
[32]
J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Bab- bush, and H. Neven, Barren plateaus in quantum neural network training landscapes, Nature Communications9, 4812 (2018)
work page 2018
- [33]
-
[34]
E. Gil-Fuster, C. Gyurik, A. P´ erez-Salinas, and V. Dun- jko, On the relation between trainability and dequan- tization of variational quantum learning models (2024), arXiv:2406.07072 [quant-ph]
-
[35]
G. Carleo and M. Troyer, Solving the quantum many- body problem with artificial neural networks, Science 355, 602 (2017)
work page 2017
-
[36]
F. Verstraete, V. Murg, and J. I. Cirac, Matrix prod- uct states, projected entangled pair states, and varia- tional renormalization group methods for quantum spin systems, Advances in physics57, 143 (2008)
work page 2008
-
[37]
R. Or´ us, A practical introduction to tensor networks: Matrix product states and projected entangled pair states, Annals of Physics349, 117–158 (2014)
work page 2014
-
[38]
U. Schollw¨ ock, The density-matrix renormalization group in the age of matrix product states, Annals of Physics 326, 96–192 (2011)
work page 2011
- [39]
- [40]
-
[41]
PyTorch: An Imperative Style, High-Performance Deep Learning Library
A. Paszke, S. Gross, F. Massa, A. Lerer, J. Brad- bury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. K¨ opf, E. Yang, Z. De- Vito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala, Pytorch: An impera- tive style, high-performance deep learning library (2019), arXiv:1912.01703 [cs.LG]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[42]
J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. Van- derPlas, S. Wanderman-Milne, and Q. Zhang, JAX: com- posable transformations of Python+NumPy programs (2018)
work page 2018
-
[43]
Wang and Malakooti, On training of artificial neural net- works, inInternational 1989 Joint Conference on Neural Networks(IEEE, 1989) pp. 387–393
work page 1989
-
[44]
E. R. Anschuetz and B. T. Kiani, Quantum variational algorithms are swamped with traps, Nature Communica- tions13, 7760 (2022)
work page 2022
-
[45]
D. P. Kingma and J. Ba, Adam: A method for stochastic optimization (2017), arXiv:1412.6980 [cs.LG]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[46]
J. Tindall, M. Fishman, E. M. Stoudenmire, and D. Sels, Efficient tensor network simulation of ibm’s eagle kicked ising experiment, PRX Quantum5, 010308 (2024)
work page 2024
- [47]
- [48]
- [49]
-
[50]
J. Alcazar, M. Ghazi Vakili, C. B. Kalayci, and A. Perdomo-Ortiz, Enhancing combinatorial optimiza- tion with classical and quantum generative models, Na- ture Communications15, 2761 (2024)
work page 2024
-
[51]
F. Vicentini, D. Hofmann, A. Szab´ o, D. Wu, C. Roth, C. Giuliani, G. Pescia, J. Nys, V. Vargas-Calder´ on, N. Astrakhantsev, and G. Carleo, NetKet 3: Machine Learning Toolbox for Many-Body Quantum Systems, Sci- Post Phys. Codebases , 7 (2022)
work page 2022
-
[52]
Ran, Encoding of matrix product states into quan- tum circuits of one- and two-qubit gates, Phys
S.-J. Ran, Encoding of matrix product states into quan- tum circuits of one- and two-qubit gates, Phys. Rev. A 101, 032310 (2020)
work page 2020
-
[53]
M. S. Rudolph, J. Chen, J. Miller, A. Acharya, and A. Perdomo-Ortiz, Decomposition of matrix product states into shallow quantum circuits, Quantum Science and Technology9, 015012 (2023)
work page 2023
-
[54]
R.-Y. Sun, Optimization for deep learning: An overview, Journal of the Operations Research Society of China8, 249 (2020)
work page 2020
-
[55]
V. Vargas-Calder´ on, N. Parra-A., H. Vinck-Posada, and F. A. Gonz´ alez, Many-qudit representation for the travelling salesman problem optimisation, Journal of the Physical Society of Japan90, 114002 (2021), https://doi.org/10.7566/JPSJ.90.114002. 10 Appendix A: V ariational Monte Carlo Variational Monte Carlo (VMC) is a probabilistic al- gorithm for est...
-
[56]
The process of exact sampling is straightforward
Building bit string samples Unlike many neural quantum state architectures that require the Metropolis-Hastings algorithm for sampling [51], exact sampling can be performed directly in VDD, resulting in an unbiased sample [39]. The process of exact sampling is straightforward. Starting at the root node, the value of each qubit is sampled sequentially. The...
-
[57]
Note that the sum in (A3) contains at mostkterms for ak-local observableA
Estimating expected values of observables and their derivatives The expected value of any observableAwith respect to the VDD can be computed as [51] ⟨A⟩=E[ ˜A] = X b p(b) ˜A(b),(A2) where ˜Ais the local estimator ofA, defined as ˜A(b) = X b′ ψθ(b′) ψθ(b) ⟨b|A|b ′⟩,(A3) where the sum runs over all elements of the Hilbert space basis. Note that the sum in (...
-
[58]
Updating variational parameters To close the circle, we are now able to update the pa- rameters in a convenient way so as to minimise the ex- pected value of the HamiltonianH. This can be achieved by several gradient descent algorithms [54], the simpler of which can be given by the stochastic gradient descent update rule: θj ←θ j −η ∂⟨H⟩ ∂θj ,(A7) where t...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.