pith. machine review for the scientific record. sign in

arxiv: 2601.19979 · v2 · submitted 2026-01-27 · ✦ hep-th · cs.LG· quant-ph

Recognition: 2 theorem links

· Lean Theorem

Exploring the holographic entropy cone via reinforcement learning

Authors on Pith no claims yet

Pith reviewed 2026-05-16 10:15 UTC · model grok-4.3

classification ✦ hep-th cs.LGquant-ph
keywords holographic entropy conereinforcement learninggraph realizationsmin-cut entropyentropy inequalitiesN=6AdS/CFT
0
0 comments X

The pith

Reinforcement learning finds graph realizations for three of six mystery extreme rays in the N=6 holographic entropy cone.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The authors train a reinforcement learning algorithm that, given any target entropy vector, searches for a graph whose min-cut entropies reproduce that vector. When no exact match exists the algorithm returns the closest achievable vector, thereby locating the nearest facet of the cone. For N=3 the method recovers the known monogamy inequality from a starting vector outside the cone. For N=6 it locates exact graph realizations for three of the six previously unrealized extreme rays of the subadditivity cone, proving those rays lie inside the holographic entropy cone. The same search fails for the remaining three rays, furnishing evidence that they lie outside the cone and therefore that additional, still-unknown holographic inequalities must exist for six parties.

Core claim

Three of the six mystery extreme rays of the subadditivity cone for N=6 admit graph realizations and are therefore genuine extreme rays of the holographic entropy cone, while the other three do not; this implies that unknown holographic entropy inequalities exist for N=6.

What carries the argument

A reinforcement learning agent that proposes graphs and receives reward based on how closely the min-cut entropies of the graph match a supplied target vector.

If this is right

  • Three previously unidentified rays are now confirmed as extreme rays of the N=6 holographic entropy cone.
  • The N=6 holographic entropy cone is strictly smaller than the subadditivity cone along at least three directions.
  • At least three new holographic inequalities are required to describe the full cone for six parties.
  • The same search procedure can be used to test any candidate vector for membership in the cone.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The algorithm can be run at higher N to generate further candidate inequalities.
  • The three non-realizable rays point to specific new inequalities that could now be derived by hand.
  • Each realized graph supplies a concrete bulk geometry whose boundary entropies saturate the corresponding inequality.

Load-bearing premise

The reinforcement-learning search is exhaustive enough that repeated failure to find a graph means no such graph exists.

What would settle it

An explicit graph whose min-cut entropies exactly equal one of the three rays the algorithm could not realize would show that ray lies inside the cone.

read the original abstract

We develop a reinforcement learning algorithm to study the holographic entropy cone. Given a target entropy vector, our algorithm searches for a graph realization whose min-cut entropies match the target vector. If the target vector does not admit such a graph realization, it must lie outside the cone, in which case the algorithm finds a graph whose corresponding entropy vector most nearly approximates the target and allows us to probe the location of the facets. For the $\sf N=3$ cone, we confirm that our algorithm successfully rediscovers monogamy of mutual information beginning with a target vector outside the holographic entropy cone. We then apply the algorithm to the $\sf N=6$ cone, analyzing the 6 "mystery" extreme rays of the subadditivity cone from arXiv:2412.15364 that satisfy all known holographic entropy inequalities yet lacked graph realizations. We found realizations for 3 of them, proving they are genuine extreme rays of the holographic entropy cone, while providing evidence that the remaining 3 are not realizable, implying unknown holographic inequalities exist for $\sf N=6$.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. This manuscript introduces a reinforcement learning algorithm to search for graph realizations whose min-cut entropies match a target vector in the holographic entropy cone. For N=3 the method recovers the monogamy inequality when initialized outside the cone. For N=6 it is applied to the six mystery extreme rays of the subadditivity cone identified in arXiv:2412.15364; explicit realizations are reported for three rays (proving membership) while repeated failure to realize the remaining three is presented as evidence that they lie outside the cone and that unknown holographic inequalities exist.

Significance. The explicit graph constructions for three rays constitute a concrete advance, furnishing new extreme rays of the N=6 holographic entropy cone and illustrating the practical value of RL for discovery where analytic methods are unavailable. If the negative results can be placed on a firmer footing, they would establish the existence of additional facets and motivate further work on higher-N inequalities. The computational approach itself is a useful addition to the toolkit for exploring the cone.

major comments (2)
  1. [N=6 results] N=6 results section: the assertion that the three unrealized rays lie outside the holographic entropy cone (and therefore imply unknown inequalities) rests entirely on the RL search failing to discover a graph. No completeness guarantee, convergence bound, exhaustive-enumeration baseline for small graphs, or proof that the policy covers all candidate realizations within the chosen search space is supplied; consequently the negative claim is inconclusive and does not yet support the stated implication.
  2. [Methods] Methods section: the state representation, action space, and reward function used when the target vector is unrealizable are described at a level that prevents independent verification of whether the search could systematically miss entire families of graphs; this directly affects the reliability of both the positive and negative outcomes.
minor comments (2)
  1. [Abstract] The abstract and conclusion should more explicitly distinguish the self-certifying positive realizations from the provisional nature of the negative evidence.
  2. Figure captions and table legends would benefit from explicit statements of the graph size and edge-weight bounds employed in the search.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for their careful reading and constructive comments on our manuscript. We have revised the paper to expand the methods description and to qualify our claims on the negative N=6 results. Point-by-point responses follow.

read point-by-point responses
  1. Referee: [N=6 results] N=6 results section: the assertion that the three unrealized rays lie outside the holographic entropy cone (and therefore imply unknown inequalities) rests entirely on the RL search failing to discover a graph. No completeness guarantee, convergence bound, exhaustive-enumeration baseline for small graphs, or proof that the policy covers all candidate realizations within the chosen search space is supplied; consequently the negative claim is inconclusive and does not yet support the stated implication.

    Authors: We agree that the RL search provides no formal completeness guarantee and that the negative results are therefore not conclusive. In the revised manuscript we have added a new paragraph detailing the computational effort (multiple random seeds, varied hyperparameters, and >5000 episodes per ray) and have changed the language from 'providing evidence that the remaining 3 are not realizable' to 'suggesting that the remaining 3 may lie outside the cone, motivating further study of possible new inequalities'. We retain the positive realizations as proofs of membership while treating the negative outcomes as heuristic indications only. revision: partial

  2. Referee: [Methods] Methods section: the state representation, action space, and reward function used when the target vector is unrealizable are described at a level that prevents independent verification of whether the search could systematically miss entire families of graphs; this directly affects the reliability of both the positive and negative outcomes.

    Authors: We have substantially expanded the Methods section with explicit definitions: the state is the pair (current graph adjacency matrix, target entropy vector); the action space consists of adding or removing a single edge (with a hard cap on total edges to bound the search); and the reward is the negative L1 distance to the target, augmented by a fixed penalty when no exact match is found. We also supply pseudocode, the full list of hyperparameters, and the training schedule. These additions allow independent reproduction and assessment of coverage. revision: yes

standing simulated objections not resolved
  • The absence of a theoretical completeness or convergence result for the RL search, which prevents a rigorous proof that the three rays are unrealizable.

Circularity Check

0 steps flagged

No circularity: direct RL search for graph realizations

full rationale

The paper develops a reinforcement learning algorithm that searches for graph realizations matching a target entropy vector via min-cuts. Positive results consist of explicit graph constructions that directly certify membership in the holographic entropy cone. Negative results are reported as repeated failure to locate realizations within the search space. Neither step reduces to a self-definition, a fitted parameter renamed as a prediction, or a load-bearing self-citation; the algorithm is an independent computational procedure whose outputs (found graphs or their absence) stand on their own without circular reduction to the inputs. The N=3 validation against monogamy of mutual information further confirms the method operates externally to the claims being tested.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the standard domain assumption that min-cut entropies on graphs faithfully represent holographic entropy vectors, together with the unproven effectiveness of the reinforcement-learning search.

axioms (1)
  • domain assumption Min-cut entropies on graphs correspond to entropy vectors realizable by holographic geometries
    This is the foundational correspondence used throughout the holographic entropy cone literature.

pith-pipeline@v0.9.0 · 5489 in / 1307 out tokens · 35928 ms · 2026-05-16T10:15:53.331967+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages · 8 internal anchors

  1. [1]

    Algorithmic construction of SSA-compatible extreme rays of the subadditivity cone and the N = 6 solution,

    T. He, V. E. Hubeny, and M. Rota, “Algorithmic construction of SSA-compatible extreme rays of the subadditivity cone and the N = 6 solution,”JHEP06(2025) 055, arXiv:2412.15364 [quant-ph]

  2. [2]

    The inequalities of quantum information theory,

    N. Pippenger, “The inequalities of quantum information theory,”IEEE Transactions on Information Theory49no. 4, (2003) 773–789

  3. [3]

    The Holographic Entropy Cone

    N. Bao, S. Nezami, H. Ooguri, B. Stoica, J. Sully, and M. Walter, “The Holographic Entropy Cone,”JHEP09(2015) 130,arXiv:1505.07839 [hep-th]

  4. [4]

    Holographic Derivation of Entanglement Entropy from AdS/CFT

    S. Ryu and T. Takayanagi, “Holographic derivation of entanglement entropy from AdS/CFT,”Phys. Rev. Lett.96(2006) 181602,arXiv:hep-th/0603001 [hep-th]

  5. [5]

    Holographic Entropy Relations Repackaged,

    T. He, M. Headrick, and V. E. Hubeny, “Holographic Entropy Relations Repackaged,” JHEP10(2019) 118,arXiv:1905.06985 [hep-th]

  6. [6]

    Superbalance of Holographic Entropy Inequalities,

    T. He, V. E. Hubeny, and M. Rangamani, “Superbalance of Holographic Entropy Inequalities,”JHEP07(2020) 245,arXiv:2002.04558 [hep-th]

  7. [7]

    On the foundations and extremal structure of the holographic entropy cone,

    D. Avis and S. Hern´ andez-Cuenca, “On the foundations and extremal structure of the holographic entropy cone,”Applications Math.328(2023) 16,arXiv:2102.07535 [math.CO]

  8. [8]

    Holographic Cone of Average Entropies,

    B. Czech and S. Shuai, “Holographic Cone of Average Entropies,”Commun. Phys.5 (2022) 244,arXiv:2112.00763 [hep-th]

  9. [9]

    Symmetrized holographic entropy cone,

    M. Fadel and S. Hern´ andez-Cuenca, “Symmetrized holographic entropy cone,”Phys. Rev. D105no. 8, (2022) 086008,arXiv:2112.03862 [quant-ph]

  10. [10]

    The holographic entropy cone from marginal independence,

    S. Hern´ andez-Cuenca, V. E. Hubeny, and M. Rota, “The holographic entropy cone from marginal independence,”JHEP09(2022) 190,arXiv:2204.00075 [hep-th]

  11. [11]

    The quantum marginal independence problem,

    S. Hern´ andez-Cuenca, V. E. Hubeny, M. Rangamani, and M. Rota, “The quantum marginal independence problem,”arXiv:1912.01041 [quant-ph]

  12. [12]

    A holographic inequality for N = 7 regions,

    B. Czech and Y. Wang, “A holographic inequality for N = 7 regions,”JHEP01(2023) 101,arXiv:2209.10547 [hep-th]

  13. [13]

    Holographic Entropy Inequalities and Multipartite Entanglement,

    S. Hern´ andez-Cuenca, V. E. Hubeny, and F. Jia, “Holographic Entropy Inequalities and Multipartite Entanglement,”arXiv:2309.06296 [hep-th]

  14. [14]

    Two infinite families of facets of the holographic entropy cone,

    B. Czech, Y. Liu, and B. Yu, “Two infinite families of facets of the holographic entropy cone,”SciPost Phys.17(2024) 084,arXiv:2401.13029 [hep-th]

  15. [15]

    Properties of the contraction map for holographic entanglement entropy inequalities,

    N. Bao and J. Naskar, “Properties of the contraction map for holographic entanglement entropy inequalities,”JHEP06(2024) 039,arXiv:2403.13283 [hep-th]

  16. [16]

    Towards a complete classification of holographic entropy inequalities,

    N. Bao, K. Furuya, and J. Naskar, “Towards a complete classification of holographic entropy inequalities,”JHEP03(2025) 117,arXiv:2409.17317 [hep-th]. – 35 –

  17. [17]

    On the completeness of contraction map proof method for holographic entropy inequalities,

    N. Bao, K. Furuya, and J. Naskar, “On the completeness of contraction map proof method for holographic entropy inequalities,”arXiv:2506.18086 [hep-th]

  18. [18]

    A new characterization of the holographic entropy cone

    G. Grimaldi, M. Headrick, and V. E. Hubeny, “A new characterization of the holographic entropy cone,”arXiv:2508.21823 [hep-th]

  19. [19]

    Renormalization Group is the principle behind the Holographic Entropy Cone,

    B. Czech and S. Shuai, “Renormalization Group is the principle behind the Holographic Entropy Cone,”arXiv:2601.02472 [hep-th]

  20. [20]

    Combinatorial properties of holographic entropy inequalities,

    G. Grimaldi, M. Headrick, V. E. Hubeny, and P. Shteyner, “Combinatorial properties of holographic entropy inequalities,”arXiv:2601.09987 [hep-th]

  21. [21]

    Holographic entropy inequalities pass the majorization test,

    B. Czech, Y. Feng, X. Wu, and M. Xie, “Holographic entropy inequalities pass the majorization test,”arXiv:2601.09989 [hep-th]

  22. [22]

    Bit threads and holographic entanglement

    M. Freedman and M. Headrick, “Bit threads and holographic entanglement,” Commun. Math. Phys.352no. 1, (2017) 407–438,arXiv:1604.00354 [hep-th]

  23. [23]

    Riemannian and Lorentzian flow-cut theorems

    M. Headrick and V. E. Hubeny, “Riemannian and Lorentzian flow-cut theorems,” Class. Quant. Grav.35no. 10, (2018) 10,arXiv:1710.09516 [hep-th]

  24. [24]

    Bit Threads and Holographic Monogamy,

    S. X. Cui, P. Hayden, T. He, M. Headrick, B. Stoica, and M. Walter, “Bit Threads and Holographic Monogamy,”Commun. Math. Phys.376no. 1, (2019) 609–648, arXiv:1808.05234 [hep-th]

  25. [25]

    Bit threads and holographic entanglement of purification,

    J. Harper and M. Headrick, “Bit threads and holographic entanglement of purification,”JHEP08(2019) 101,arXiv:1906.05970 [hep-th]

  26. [26]

    Crossing Versus Locking: Bit Threads and Continuum Multiflows,

    M. Headrick, J. Held, and J. Herman, “Crossing Versus Locking: Bit Threads and Continuum Multiflows,”Commun. Math. Phys.396no. 1, (2022) 265–313, arXiv:2008.03197 [hep-th]

  27. [27]

    Covariant bit threads,

    M. Headrick and V. E. Hubeny, “Covariant bit threads,”JHEP07(2023) 180, arXiv:2208.10507 [hep-th]

  28. [28]

    The Quantum Entropy Cone of Hypergraphs,

    N. Bao, N. Cheng, S. Hern´ andez-Cuenca, and V. P. Su, “The Quantum Entropy Cone of Hypergraphs,”SciPost Phys.9no. 5, (2020) 5,arXiv:2002.05317 [quant-ph]

  29. [29]

    Topological Link Models of Multipartite Entanglement,

    N. Bao, N. Cheng, S. Hern´ andez-Cuenca, and V. P. Su, “Topological Link Models of Multipartite Entanglement,”Quantum6(2022) 741,arXiv:2109.01150 [quant-ph]

  30. [30]

    Hypergraph min-cuts from quantum entropies,

    M. Walter and F. Witteveen, “Hypergraph min-cuts from quantum entropies,”J. Math. Phys.62no. 9, (2021) 092203,arXiv:2002.12397 [quant-ph]

  31. [31]

    A Gap Between the Hypergraph and Stabilizer Entropy Cones,

    N. Bao, N. Cheng, S. Hern´ andez-Cuenca, and V. P. Su, “A Gap Between the Hypergraph and Stabilizer Entropy Cones,”arXiv:2006.16292 [quant-ph]

  32. [32]

    A gap between holographic and quantum mechanical extreme rays of the subadditivity cone,

    T. He, V. E. Hubeny, and M. Rota, “A gap between holographic and quantum mechanical extreme rays of the subadditivity cone,”arXiv:2307.10137 [hep-th]. – 36 –

  33. [33]

    Inner bounding the quantum entropy cone with subadditivity and subsystem coarse-grainings,

    T. He, V. E. Hubeny, and M. Rota, “Inner bounding the quantum entropy cone with subadditivity and subsystem coarse-grainings,”arXiv:2312.04074 [quant-ph]

  34. [34]

    Beyond the Holographic Entropy Cone via Cycle Flows,

    T. He, S. Hern´ andez-Cuenca, and C. Keeler, “Beyond the Holographic Entropy Cone via Cycle Flows,”Commun. Math. Phys.405no. 11, (2024) 252,arXiv:2312.10137 [hep-th]

  35. [35]

    Holographic Mutual Information is Monogamous

    P. Hayden, M. Headrick, and A. Maloney, “Holographic Mutual Information is Monogamous,”Phys. Rev.D87no. 4, (2013) 046003,arXiv:1107.2940 [hep-th]

  36. [36]

    On the relation between the subadditivity cone and the quantum entropy cone,

    T. He, V. E. Hubeny, and M. Rota, “On the relation between the subadditivity cone and the quantum entropy cone,”JHEP08(2023) 018,arXiv:2211.11858 [hep-th]

  37. [37]

    Correlation hypergraph: a new representation of a quantum marginal independence pattern,

    V. E. Hubeny and M. Rota, “Correlation hypergraph: a new representation of a quantum marginal independence pattern,”JHEP09(2025) 080,arXiv:2412.18018 [hep-th]

  38. [38]

    Mastering the game of go with deep neural networks and tree search,

    D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, and D. Hassabis, “Mastering the game of go with deep neural networks and tree search,”Nature529(2016...

  39. [39]

    Highly accurate protein structure prediction with alphafold,

    J. Jumper, R. Evans, A. Pritzel, T. Green, M. Figurnov, O. Ronneberger, K. Tunyasuvunakool, R. Bates, A. ˇZ´ ıdek, A. Potapenko, A. Bridgland, C. Meyer, S. A. A. Kohl, A. Ballard, A. Cowie, B. Romera-Paredes, S. Nikolov, R. Jain, J. Adler, T. Back, S. Petersen, D. Reiman, E. Clancy, M. Zielinski, M. Steinegger, M. Pacholska, T. Berghammer, S. Bodenstein, ...

  40. [40]

    Discovering faster matrix multiplication algorithms with reinforcement learning,

    A. Fawzi, M. Balog, A. Huang, T. Hubert, B. Romera-Paredes, M. Barekatain, A. Novikov, F. J. R. Ruiz, J. Schrittwieser, G. Swirszcz, D. Silver, D. Hassabis, and P. Kohli, “Discovering faster matrix multiplication algorithms with reinforcement learning,”Nature610(2022) 47–53

  41. [41]

    Branes with Brains: Exploring String Vacua with Deep Reinforcement Learning

    J. Halverson, B. Nelson, and F. Ruehle, “Branes with brains: exploring string vacua with deep reinforcement learning,”JHEP06(2019) 003,arXiv:1903.11616 [hep-th]

  42. [42]

    Learning to unknot,

    S. Gukov, J. Halverson, F. Ruehle, and P. Sulkowski, “Learning to unknot,”Mach. Learn.: Sci. Technol.2(2021) 025035,arXiv:2010.16263

  43. [43]

    Navigating the Quantum Resource Landscape of Entropy Vector Space Using Machine Learning and Optimization,

    N. Khumalo, A. Mehta, W. Munizzi, and P. Narang, “Navigating the Quantum Resource Landscape of Entropy Vector Space Using Machine Learning and Optimization,”arXiv:2511.16724 [quant-ph]. – 37 –

  44. [44]

    Finite entropy sums in quantum field theory,

    M. Van Raamsdonk, “Finite entropy sums in quantum field theory,” arXiv:2508.21276 [hep-th]

  45. [45]

    Going Beyond Bell's Theorem

    D. M. Greenberger, M. A. Horne, and A. Zeilinger, “Going Beyond Bell’s Theorem,” Fundam. Theor. Phys.37(1989) 69–72,arXiv:0712.0921 [quant-ph]

  46. [46]

    On the construction of graph models realizing given entropy vectors,

    V. E. Hubeny and M. Rota, “On the construction of graph models realizing given entropy vectors,”arXiv:2512.18702 [hep-th]

  47. [47]

    Necessary and sufficient conditions for entropy vector realizability by holographic simple tree graph models,

    V. E. Hubeny and M. Rota, “Necessary and sufficient conditions for entropy vector realizability by holographic simple tree graph models,”arXiv:2512.24490 [hep-th]. – 38 –