Recognition: 2 theorem links
· Lean TheoremExploring the holographic entropy cone via reinforcement learning
Pith reviewed 2026-05-16 10:15 UTC · model grok-4.3
The pith
Reinforcement learning finds graph realizations for three of six mystery extreme rays in the N=6 holographic entropy cone.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Three of the six mystery extreme rays of the subadditivity cone for N=6 admit graph realizations and are therefore genuine extreme rays of the holographic entropy cone, while the other three do not; this implies that unknown holographic entropy inequalities exist for N=6.
What carries the argument
A reinforcement learning agent that proposes graphs and receives reward based on how closely the min-cut entropies of the graph match a supplied target vector.
If this is right
- Three previously unidentified rays are now confirmed as extreme rays of the N=6 holographic entropy cone.
- The N=6 holographic entropy cone is strictly smaller than the subadditivity cone along at least three directions.
- At least three new holographic inequalities are required to describe the full cone for six parties.
- The same search procedure can be used to test any candidate vector for membership in the cone.
Where Pith is reading between the lines
- The algorithm can be run at higher N to generate further candidate inequalities.
- The three non-realizable rays point to specific new inequalities that could now be derived by hand.
- Each realized graph supplies a concrete bulk geometry whose boundary entropies saturate the corresponding inequality.
Load-bearing premise
The reinforcement-learning search is exhaustive enough that repeated failure to find a graph means no such graph exists.
What would settle it
An explicit graph whose min-cut entropies exactly equal one of the three rays the algorithm could not realize would show that ray lies inside the cone.
read the original abstract
We develop a reinforcement learning algorithm to study the holographic entropy cone. Given a target entropy vector, our algorithm searches for a graph realization whose min-cut entropies match the target vector. If the target vector does not admit such a graph realization, it must lie outside the cone, in which case the algorithm finds a graph whose corresponding entropy vector most nearly approximates the target and allows us to probe the location of the facets. For the $\sf N=3$ cone, we confirm that our algorithm successfully rediscovers monogamy of mutual information beginning with a target vector outside the holographic entropy cone. We then apply the algorithm to the $\sf N=6$ cone, analyzing the 6 "mystery" extreme rays of the subadditivity cone from arXiv:2412.15364 that satisfy all known holographic entropy inequalities yet lacked graph realizations. We found realizations for 3 of them, proving they are genuine extreme rays of the holographic entropy cone, while providing evidence that the remaining 3 are not realizable, implying unknown holographic inequalities exist for $\sf N=6$.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. This manuscript introduces a reinforcement learning algorithm to search for graph realizations whose min-cut entropies match a target vector in the holographic entropy cone. For N=3 the method recovers the monogamy inequality when initialized outside the cone. For N=6 it is applied to the six mystery extreme rays of the subadditivity cone identified in arXiv:2412.15364; explicit realizations are reported for three rays (proving membership) while repeated failure to realize the remaining three is presented as evidence that they lie outside the cone and that unknown holographic inequalities exist.
Significance. The explicit graph constructions for three rays constitute a concrete advance, furnishing new extreme rays of the N=6 holographic entropy cone and illustrating the practical value of RL for discovery where analytic methods are unavailable. If the negative results can be placed on a firmer footing, they would establish the existence of additional facets and motivate further work on higher-N inequalities. The computational approach itself is a useful addition to the toolkit for exploring the cone.
major comments (2)
- [N=6 results] N=6 results section: the assertion that the three unrealized rays lie outside the holographic entropy cone (and therefore imply unknown inequalities) rests entirely on the RL search failing to discover a graph. No completeness guarantee, convergence bound, exhaustive-enumeration baseline for small graphs, or proof that the policy covers all candidate realizations within the chosen search space is supplied; consequently the negative claim is inconclusive and does not yet support the stated implication.
- [Methods] Methods section: the state representation, action space, and reward function used when the target vector is unrealizable are described at a level that prevents independent verification of whether the search could systematically miss entire families of graphs; this directly affects the reliability of both the positive and negative outcomes.
minor comments (2)
- [Abstract] The abstract and conclusion should more explicitly distinguish the self-certifying positive realizations from the provisional nature of the negative evidence.
- Figure captions and table legends would benefit from explicit statements of the graph size and edge-weight bounds employed in the search.
Simulated Author's Rebuttal
We thank the referee for their careful reading and constructive comments on our manuscript. We have revised the paper to expand the methods description and to qualify our claims on the negative N=6 results. Point-by-point responses follow.
read point-by-point responses
-
Referee: [N=6 results] N=6 results section: the assertion that the three unrealized rays lie outside the holographic entropy cone (and therefore imply unknown inequalities) rests entirely on the RL search failing to discover a graph. No completeness guarantee, convergence bound, exhaustive-enumeration baseline for small graphs, or proof that the policy covers all candidate realizations within the chosen search space is supplied; consequently the negative claim is inconclusive and does not yet support the stated implication.
Authors: We agree that the RL search provides no formal completeness guarantee and that the negative results are therefore not conclusive. In the revised manuscript we have added a new paragraph detailing the computational effort (multiple random seeds, varied hyperparameters, and >5000 episodes per ray) and have changed the language from 'providing evidence that the remaining 3 are not realizable' to 'suggesting that the remaining 3 may lie outside the cone, motivating further study of possible new inequalities'. We retain the positive realizations as proofs of membership while treating the negative outcomes as heuristic indications only. revision: partial
-
Referee: [Methods] Methods section: the state representation, action space, and reward function used when the target vector is unrealizable are described at a level that prevents independent verification of whether the search could systematically miss entire families of graphs; this directly affects the reliability of both the positive and negative outcomes.
Authors: We have substantially expanded the Methods section with explicit definitions: the state is the pair (current graph adjacency matrix, target entropy vector); the action space consists of adding or removing a single edge (with a hard cap on total edges to bound the search); and the reward is the negative L1 distance to the target, augmented by a fixed penalty when no exact match is found. We also supply pseudocode, the full list of hyperparameters, and the training schedule. These additions allow independent reproduction and assessment of coverage. revision: yes
- The absence of a theoretical completeness or convergence result for the RL search, which prevents a rigorous proof that the three rays are unrealizable.
Circularity Check
No circularity: direct RL search for graph realizations
full rationale
The paper develops a reinforcement learning algorithm that searches for graph realizations matching a target entropy vector via min-cuts. Positive results consist of explicit graph constructions that directly certify membership in the holographic entropy cone. Negative results are reported as repeated failure to locate realizations within the search space. Neither step reduces to a self-definition, a fitted parameter renamed as a prediction, or a load-bearing self-citation; the algorithm is an independent computational procedure whose outputs (found graphs or their absence) stand on their own without circular reduction to the inputs. The N=3 validation against monogamy of mutual information further confirms the method operates externally to the claims being tested.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Min-cut entropies on graphs correspond to entropy vectors realizable by holographic geometries
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
reinforcement learning algorithm ... searches for a graph realization whose min-cut entropies match the target vector ... reward ... cosine similarity
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
N=6 mystery extreme rays of the subadditivity cone
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
T. He, V. E. Hubeny, and M. Rota, “Algorithmic construction of SSA-compatible extreme rays of the subadditivity cone and the N = 6 solution,”JHEP06(2025) 055, arXiv:2412.15364 [quant-ph]
-
[2]
The inequalities of quantum information theory,
N. Pippenger, “The inequalities of quantum information theory,”IEEE Transactions on Information Theory49no. 4, (2003) 773–789
work page 2003
-
[3]
N. Bao, S. Nezami, H. Ooguri, B. Stoica, J. Sully, and M. Walter, “The Holographic Entropy Cone,”JHEP09(2015) 130,arXiv:1505.07839 [hep-th]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[4]
Holographic Derivation of Entanglement Entropy from AdS/CFT
S. Ryu and T. Takayanagi, “Holographic derivation of entanglement entropy from AdS/CFT,”Phys. Rev. Lett.96(2006) 181602,arXiv:hep-th/0603001 [hep-th]
work page internal anchor Pith review Pith/arXiv arXiv 2006
-
[5]
Holographic Entropy Relations Repackaged,
T. He, M. Headrick, and V. E. Hubeny, “Holographic Entropy Relations Repackaged,” JHEP10(2019) 118,arXiv:1905.06985 [hep-th]
-
[6]
Superbalance of Holographic Entropy Inequalities,
T. He, V. E. Hubeny, and M. Rangamani, “Superbalance of Holographic Entropy Inequalities,”JHEP07(2020) 245,arXiv:2002.04558 [hep-th]
-
[7]
On the foundations and extremal structure of the holographic entropy cone,
D. Avis and S. Hern´ andez-Cuenca, “On the foundations and extremal structure of the holographic entropy cone,”Applications Math.328(2023) 16,arXiv:2102.07535 [math.CO]
-
[8]
Holographic Cone of Average Entropies,
B. Czech and S. Shuai, “Holographic Cone of Average Entropies,”Commun. Phys.5 (2022) 244,arXiv:2112.00763 [hep-th]
-
[9]
Symmetrized holographic entropy cone,
M. Fadel and S. Hern´ andez-Cuenca, “Symmetrized holographic entropy cone,”Phys. Rev. D105no. 8, (2022) 086008,arXiv:2112.03862 [quant-ph]
-
[10]
The holographic entropy cone from marginal independence,
S. Hern´ andez-Cuenca, V. E. Hubeny, and M. Rota, “The holographic entropy cone from marginal independence,”JHEP09(2022) 190,arXiv:2204.00075 [hep-th]
-
[11]
The quantum marginal independence problem,
S. Hern´ andez-Cuenca, V. E. Hubeny, M. Rangamani, and M. Rota, “The quantum marginal independence problem,”arXiv:1912.01041 [quant-ph]
-
[12]
A holographic inequality for N = 7 regions,
B. Czech and Y. Wang, “A holographic inequality for N = 7 regions,”JHEP01(2023) 101,arXiv:2209.10547 [hep-th]
-
[13]
Holographic Entropy Inequalities and Multipartite Entanglement,
S. Hern´ andez-Cuenca, V. E. Hubeny, and F. Jia, “Holographic Entropy Inequalities and Multipartite Entanglement,”arXiv:2309.06296 [hep-th]
-
[14]
Two infinite families of facets of the holographic entropy cone,
B. Czech, Y. Liu, and B. Yu, “Two infinite families of facets of the holographic entropy cone,”SciPost Phys.17(2024) 084,arXiv:2401.13029 [hep-th]
-
[15]
Properties of the contraction map for holographic entanglement entropy inequalities,
N. Bao and J. Naskar, “Properties of the contraction map for holographic entanglement entropy inequalities,”JHEP06(2024) 039,arXiv:2403.13283 [hep-th]
-
[16]
Towards a complete classification of holographic entropy inequalities,
N. Bao, K. Furuya, and J. Naskar, “Towards a complete classification of holographic entropy inequalities,”JHEP03(2025) 117,arXiv:2409.17317 [hep-th]. – 35 –
-
[17]
On the completeness of contraction map proof method for holographic entropy inequalities,
N. Bao, K. Furuya, and J. Naskar, “On the completeness of contraction map proof method for holographic entropy inequalities,”arXiv:2506.18086 [hep-th]
-
[18]
A new characterization of the holographic entropy cone
G. Grimaldi, M. Headrick, and V. E. Hubeny, “A new characterization of the holographic entropy cone,”arXiv:2508.21823 [hep-th]
work page internal anchor Pith review Pith/arXiv arXiv
-
[19]
Renormalization Group is the principle behind the Holographic Entropy Cone,
B. Czech and S. Shuai, “Renormalization Group is the principle behind the Holographic Entropy Cone,”arXiv:2601.02472 [hep-th]
-
[20]
Combinatorial properties of holographic entropy inequalities,
G. Grimaldi, M. Headrick, V. E. Hubeny, and P. Shteyner, “Combinatorial properties of holographic entropy inequalities,”arXiv:2601.09987 [hep-th]
-
[21]
Holographic entropy inequalities pass the majorization test,
B. Czech, Y. Feng, X. Wu, and M. Xie, “Holographic entropy inequalities pass the majorization test,”arXiv:2601.09989 [hep-th]
-
[22]
Bit threads and holographic entanglement
M. Freedman and M. Headrick, “Bit threads and holographic entanglement,” Commun. Math. Phys.352no. 1, (2017) 407–438,arXiv:1604.00354 [hep-th]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[23]
Riemannian and Lorentzian flow-cut theorems
M. Headrick and V. E. Hubeny, “Riemannian and Lorentzian flow-cut theorems,” Class. Quant. Grav.35no. 10, (2018) 10,arXiv:1710.09516 [hep-th]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[24]
Bit Threads and Holographic Monogamy,
S. X. Cui, P. Hayden, T. He, M. Headrick, B. Stoica, and M. Walter, “Bit Threads and Holographic Monogamy,”Commun. Math. Phys.376no. 1, (2019) 609–648, arXiv:1808.05234 [hep-th]
-
[25]
Bit threads and holographic entanglement of purification,
J. Harper and M. Headrick, “Bit threads and holographic entanglement of purification,”JHEP08(2019) 101,arXiv:1906.05970 [hep-th]
-
[26]
Crossing Versus Locking: Bit Threads and Continuum Multiflows,
M. Headrick, J. Held, and J. Herman, “Crossing Versus Locking: Bit Threads and Continuum Multiflows,”Commun. Math. Phys.396no. 1, (2022) 265–313, arXiv:2008.03197 [hep-th]
-
[27]
M. Headrick and V. E. Hubeny, “Covariant bit threads,”JHEP07(2023) 180, arXiv:2208.10507 [hep-th]
-
[28]
The Quantum Entropy Cone of Hypergraphs,
N. Bao, N. Cheng, S. Hern´ andez-Cuenca, and V. P. Su, “The Quantum Entropy Cone of Hypergraphs,”SciPost Phys.9no. 5, (2020) 5,arXiv:2002.05317 [quant-ph]
-
[29]
Topological Link Models of Multipartite Entanglement,
N. Bao, N. Cheng, S. Hern´ andez-Cuenca, and V. P. Su, “Topological Link Models of Multipartite Entanglement,”Quantum6(2022) 741,arXiv:2109.01150 [quant-ph]
-
[30]
Hypergraph min-cuts from quantum entropies,
M. Walter and F. Witteveen, “Hypergraph min-cuts from quantum entropies,”J. Math. Phys.62no. 9, (2021) 092203,arXiv:2002.12397 [quant-ph]
-
[31]
A Gap Between the Hypergraph and Stabilizer Entropy Cones,
N. Bao, N. Cheng, S. Hern´ andez-Cuenca, and V. P. Su, “A Gap Between the Hypergraph and Stabilizer Entropy Cones,”arXiv:2006.16292 [quant-ph]
-
[32]
A gap between holographic and quantum mechanical extreme rays of the subadditivity cone,
T. He, V. E. Hubeny, and M. Rota, “A gap between holographic and quantum mechanical extreme rays of the subadditivity cone,”arXiv:2307.10137 [hep-th]. – 36 –
-
[33]
Inner bounding the quantum entropy cone with subadditivity and subsystem coarse-grainings,
T. He, V. E. Hubeny, and M. Rota, “Inner bounding the quantum entropy cone with subadditivity and subsystem coarse-grainings,”arXiv:2312.04074 [quant-ph]
-
[34]
Beyond the Holographic Entropy Cone via Cycle Flows,
T. He, S. Hern´ andez-Cuenca, and C. Keeler, “Beyond the Holographic Entropy Cone via Cycle Flows,”Commun. Math. Phys.405no. 11, (2024) 252,arXiv:2312.10137 [hep-th]
-
[35]
Holographic Mutual Information is Monogamous
P. Hayden, M. Headrick, and A. Maloney, “Holographic Mutual Information is Monogamous,”Phys. Rev.D87no. 4, (2013) 046003,arXiv:1107.2940 [hep-th]
work page internal anchor Pith review Pith/arXiv arXiv 2013
-
[36]
On the relation between the subadditivity cone and the quantum entropy cone,
T. He, V. E. Hubeny, and M. Rota, “On the relation between the subadditivity cone and the quantum entropy cone,”JHEP08(2023) 018,arXiv:2211.11858 [hep-th]
-
[37]
Correlation hypergraph: a new representation of a quantum marginal independence pattern,
V. E. Hubeny and M. Rota, “Correlation hypergraph: a new representation of a quantum marginal independence pattern,”JHEP09(2025) 080,arXiv:2412.18018 [hep-th]
-
[38]
Mastering the game of go with deep neural networks and tree search,
D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, and D. Hassabis, “Mastering the game of go with deep neural networks and tree search,”Nature529(2016...
work page 2016
-
[39]
Highly accurate protein structure prediction with alphafold,
J. Jumper, R. Evans, A. Pritzel, T. Green, M. Figurnov, O. Ronneberger, K. Tunyasuvunakool, R. Bates, A. ˇZ´ ıdek, A. Potapenko, A. Bridgland, C. Meyer, S. A. A. Kohl, A. Ballard, A. Cowie, B. Romera-Paredes, S. Nikolov, R. Jain, J. Adler, T. Back, S. Petersen, D. Reiman, E. Clancy, M. Zielinski, M. Steinegger, M. Pacholska, T. Berghammer, S. Bodenstein, ...
work page 2021
-
[40]
Discovering faster matrix multiplication algorithms with reinforcement learning,
A. Fawzi, M. Balog, A. Huang, T. Hubert, B. Romera-Paredes, M. Barekatain, A. Novikov, F. J. R. Ruiz, J. Schrittwieser, G. Swirszcz, D. Silver, D. Hassabis, and P. Kohli, “Discovering faster matrix multiplication algorithms with reinforcement learning,”Nature610(2022) 47–53
work page 2022
-
[41]
Branes with Brains: Exploring String Vacua with Deep Reinforcement Learning
J. Halverson, B. Nelson, and F. Ruehle, “Branes with brains: exploring string vacua with deep reinforcement learning,”JHEP06(2019) 003,arXiv:1903.11616 [hep-th]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[42]
S. Gukov, J. Halverson, F. Ruehle, and P. Sulkowski, “Learning to unknot,”Mach. Learn.: Sci. Technol.2(2021) 025035,arXiv:2010.16263
-
[43]
N. Khumalo, A. Mehta, W. Munizzi, and P. Narang, “Navigating the Quantum Resource Landscape of Entropy Vector Space Using Machine Learning and Optimization,”arXiv:2511.16724 [quant-ph]. – 37 –
-
[44]
Finite entropy sums in quantum field theory,
M. Van Raamsdonk, “Finite entropy sums in quantum field theory,” arXiv:2508.21276 [hep-th]
-
[45]
D. M. Greenberger, M. A. Horne, and A. Zeilinger, “Going Beyond Bell’s Theorem,” Fundam. Theor. Phys.37(1989) 69–72,arXiv:0712.0921 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 1989
-
[46]
On the construction of graph models realizing given entropy vectors,
V. E. Hubeny and M. Rota, “On the construction of graph models realizing given entropy vectors,”arXiv:2512.18702 [hep-th]
-
[47]
V. E. Hubeny and M. Rota, “Necessary and sufficient conditions for entropy vector realizability by holographic simple tree graph models,”arXiv:2512.24490 [hep-th]. – 38 –
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.