Computable Fairness: Boltzmann-Softmax Control for AI Resource Allocation

Chae Un Kim; Ji-Won Park

arxiv: 2605.22827 · v1 · pith:QEUWY7VCnew · submitted 2026-04-12 · ⚛️ physics.app-ph · cs.AI· cs.MA· cs.PF

Computable Fairness: Boltzmann-Softmax Control for AI Resource Allocation

Ji-Won Park , Chae Un Kim This is my paper

Pith reviewed 2026-05-25 00:32 UTC · model grok-4.3

classification ⚛️ physics.app-ph cs.AIcs.MAcs.PF

keywords fair divisionresource allocationBoltzmann-Softmaxadaptive controlAI systemsdominance concentrationefficiency-fairness tradeoff

0 comments

The pith

Redefining the inverse temperature beta in Boltzmann-Softmax turns it into a control variable for balancing efficiency and fairness in AI resource allocation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Computable Fair Division, a framework that treats the Boltzmann-Softmax function as a way to allocate resources probabilistically among AI agents. It shows how adjusting the parameter beta can manage the tradeoff between overall efficiency and preventing any single agent from dominating. In dynamic conditions, an adaptive controller called AHC++ uses real-time feedback on dominance levels to keep the system on target. Simulations indicate this approach handles sudden changes without big losses in total performance. This matters because unchecked dominance in large systems can reduce diversity and stability.

Core claim

The central claim is that the Boltzmann-Softmax function can be reinterpreted as a probabilistic resource allocation mechanism, with its inverse temperature parameter beta serving as a computable control variable that governs the efficiency-fairness balance. Static analysis identifies a Pareto frontier containing a near-optimal Stability Corridor where total loss stays approximately constant. In the dynamic setting, the AHC++ controller updates beta in real time based on the error between observed dominance concentration and a specified target, and simulations confirm it suppresses extreme dominance under exogenous shocks while tracking fairness targets without substantial throughput degr

What carries the argument

The AHC++ (Adaptive Hard-Cap Controller++) which updates the inverse temperature beta in real time using dominance error feedback to maintain fairness targets.

Load-bearing premise

Dominance concentration can be measured accurately and in real time to provide reliable error feedback for adjusting beta.

What would settle it

A simulation or deployment where dominance measurement has realistic noise or delay, and the system either fails to track targets or becomes unstable.

Figures

Figures reproduced from arXiv: 2605.22827 by Chae Un Kim, Ji-Won Park.

**Figure 1.** Figure 1: CFD framework: Boltzmann–Softmax feedback control architecture. Static model (left) derives optimal β and target dominance. Dynamic model (right) forms a closed-loop with AHC++ updating β in real time. 2.1. Variable Mapping: From Economics to AI Engineering The core elements of traditional income distribution models—individual, income, and capability—are redefined in multi-agent system (MAS) and federated … view at source ↗

**Figure 2.** Figure 2: Variation of Boltzmann–Softmax allocation probabilities with inverse temperature β. Concentration is continuously regulated by β; the dashed line is the Zipf-slope reference [PITH_FULL_IMAGE:figures/full_fig_p023_2.png] view at source ↗

**Figure 3.** Figure 3: Static loss components and λ-weighted total loss vs. β. The minimum-loss β shifts with policy weight λ [PITH_FULL_IMAGE:figures/full_fig_p024_3.png] view at source ↗

**Figure 4.** Figure 4: Total loss heatmap and Stability Corridor in the (β, λ) plane. Black dashed: optimal path β*(λ). Yellow lines: corridor boundaries [PITH_FULL_IMAGE:figures/full_fig_p025_4.png] view at source ↗

**Figure 5.** Figure 5: Pareto frontier, near-optimal cloud, and post-shock trajectories. (a) ★ = β*(λ = 0.60). (b) AHC++ recovers to stability zone; fixed policy collapses (×) [PITH_FULL_IMAGE:figures/full_fig_p026_5.png] view at source ↗

**Figure 6.** Figure 6: Efficiency–fairness frontier generated by the Boltzmann–Softmax rule. As β varies, attainable states trace a continuous trade-off curve; the dashed line shows top-1 dominance. 3.2.2 Dynamic Model Results (Structure Only) [PITH_FULL_IMAGE:figures/full_fig_p027_6.png] view at source ↗

**Figure 7.** Figure 7: Smoothed top-1 dominance under time-varying events. Bold dashed: policy target. Dotted: effective target (AHC++). Red dashed (right axis): K(t) [PITH_FULL_IMAGE:figures/full_fig_p028_7.png] view at source ↗

**Figure 8.** Figure 8: directly compares FixedBetaSoftmax and AHC++. Both start with low dominance, but after burst and policy change, the fixed-β policy cannot adapt. During abuse, FixedBetaSoftmax sharply exceeds the target; AHC++ suppresses both magnitude and duration of violations. After capacity reduction, AHC++ readjusts and reconverges below the target [PITH_FULL_IMAGE:figures/full_fig_p028_8.png] view at source ↗

**Figure 9.** Figure 9: β(t) trajectory of AHC++ under time-varying λ(t). Vertical lines: event onsets [PITH_FULL_IMAGE:figures/full_fig_p029_9.png] view at source ↗

**Figure 10.** Figure 10: Backlog dynamics across policies under time-varying events [PITH_FULL_IMAGE:figures/full_fig_p030_10.png] view at source ↗

**Figure 11.** Figure 11: Cumulative dominance constraint exceedance (AUC) vs. burst factor in the stable interval. AHC++ nearly eliminates violations; FixedBetaSoftmax remains constant (error bars: mean ± 95% CI). Taken together, the results from Figures 7–11 show that under dynamic disturbances, each policy exhibits a different balance between robustness of dominance constraint compliance and service quality (throughput and late… view at source ↗

**Figure 12.** Figure 12: Computational scalability of a single allocation step (N = 102 to 104 ). Proposed mechanism: ~5.5× for 100× agents. Pairwise dummy: O(N2 ). 4. Discussion This work reformulated the resource allocation problem in AI systems from the perspective of statistical physics, and demonstrated through simulation that the Boltzmann– Softmax allocation rule combined with the AHC++ control algorithm can suppress domin… view at source ↗

read the original abstract

In large-scale AI systems, allocating scarce resources such as GPU compute time and bandwidth among multiple agents is a critical challenge. Conventional policies focus on efficiency metrics, potentially leading to dominance concentration that undermines system diversity and stability. We propose Computable Fair Division (CFD), a framework that reinterprets the Boltzmann-Softmax function not as a selection tool but as a probabilistic resource allocation mechanism, redefining the inverse temperature parameter $\beta$ as a computable control variable governing the efficiency-fairness balance. Static analysis reveals a Pareto frontier with a near-optimal Stability Corridor where total loss remains approximately constant across policy weights. In the dynamic setting, AHC++ (Adaptive Hard-Cap Controller++) updates $\beta$ in real time using the error between observed dominance and a policy-specified target as feedback. Simulations show that AHC++ suppresses extreme dominance concentration under exogenous shocks while tracking fairness targets without substantial throughput degradation. Scalability analysis confirms that a 100x increase in agents yields only approximately 5.5x increase in execution time. Code: https://github.com/entrofy-ai/computable-fairness

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Boltzmann-Softmax gets recast as a tunable resource allocator with a feedback controller on top, but the dynamic claims rest on thin simulation evidence.

read the letter

The paper's core move is to treat the inverse temperature beta in Boltzmann-Softmax as an explicit control variable for balancing efficiency and fairness in multi-agent resource allocation, then wrap it in AHC++ to adjust beta from observed dominance error. That reinterpretation plus the controller is the actual new piece. The static analysis shows a Pareto frontier and a Stability Corridor where total loss stays roughly flat, which is a clean observation. The scalability test with 100x agents and only 5.5x runtime is also useful for anyone thinking about deployment. The linked code is a plus for reproducibility checks later.

Referee Report

3 major / 2 minor

Summary. The paper proposes Computable Fair Division (CFD), reinterpreting the Boltzmann-Softmax function as a probabilistic resource allocation mechanism for AI systems (e.g., GPU time) where the inverse temperature β serves as a tunable control variable for the efficiency-fairness tradeoff. Static analysis identifies a Pareto frontier with a near-optimal Stability Corridor of approximately constant total loss. In the dynamic case, AHC++ (Adaptive Hard-Cap Controller++) adjusts β in real time via feedback from the error between observed dominance concentration and a target level. Simulations are claimed to show that AHC++ suppresses extreme dominance under exogenous shocks while tracking fairness targets with negligible throughput degradation; a scalability test reports that a 100× increase in agents produces only a ~5.5× increase in execution time. Reproducible code is provided via GitHub.

Significance. If the dynamic control result holds under characterized conditions, the framework supplies a practical, feedback-based method for maintaining diversity in large-scale AI resource allocation without large efficiency penalties. The open code is a clear strength supporting reproducibility, and the reported near-linear scaling with agent count is a concrete, falsifiable claim that could be tested in deployed systems.

major comments (3)

[Abstract / Simulations] Abstract and Simulations section: the headline claim that AHC++ 'suppresses extreme dominance concentration under exogenous shocks while tracking fairness targets without substantial throughput degradation' is presented without any reported simulation setup details, number of trials, statistical measures, error bars, or exact metrics (e.g., dominance concentration definition, throughput units), preventing verification that the data support the stated outcomes.
[Dynamic setting (AHC++)] Dynamic setting / AHC++ description: the controller updates β using the error between externally observed dominance and the target, yet no derivation, stability bounds, or analysis of measurement latency, noise, or agent response dynamics is supplied; this feedback loop is load-bearing for the central dynamic claim but remains unexamined beyond simulation outcomes.
[Scalability analysis] Scalability analysis: the statement that a 100× increase in agents yields only ~5.5× execution time requires an explicit baseline implementation, complexity derivation, and scaling law to substantiate the claim; without these the result cannot be assessed as general.

minor comments (2)

[Introduction] The terms 'Computable Fair Division (CFD)' and 'AHC++' are introduced without explicit comparison to prior fair-division or control-theoretic allocation literature, which would clarify novelty.
[Dynamic setting] Notation for the dominance error signal and the precise functional form of the β-update rule should be given as an equation rather than described in prose only.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments, which highlight important areas for improving clarity and rigor. We address each major comment point by point below, with honest indications of where the manuscript will be revised.

read point-by-point responses

Referee: [Abstract / Simulations] Abstract and Simulations section: the headline claim that AHC++ 'suppresses extreme dominance concentration under exogenous shocks while tracking fairness targets without substantial throughput degradation' is presented without any reported simulation setup details, number of trials, statistical measures, error bars, or exact metrics (e.g., dominance concentration definition, throughput units), preventing verification that the data support the stated outcomes.

Authors: We agree that the current presentation lacks sufficient experimental details to allow independent verification. In the revised manuscript we will expand both the abstract (if space permits) and the Simulations section to report: 100 independent trials per condition with distinct random seeds; means and standard deviations for all key metrics; error bars on all figures; the precise definition of dominance concentration (Gini coefficient over per-agent resource shares); and throughput (normalized aggregate allocation efficiency relative to the unconstrained optimum). These additions will directly support the headline claims. revision: yes
Referee: [Dynamic setting (AHC++)] Dynamic setting / AHC++ description: the controller updates β using the error between externally observed dominance and the target, yet no derivation, stability bounds, or analysis of measurement latency, noise, or agent response dynamics is supplied; this feedback loop is load-bearing for the central dynamic claim but remains unexamined beyond simulation outcomes.

Authors: The referee is correct that the manuscript supplies no formal derivation or stability analysis of the AHC++ feedback loop. The current work treats AHC++ as an empirically validated practical controller. In revision we will add a short derivation of the proportional update rule and a qualitative discussion of stability under the assumption of linear agent response. A full treatment of measurement latency, sensor noise, and closed-loop agent dynamics lies beyond the scope of the present study; we will explicitly note this limitation and flag it as future work while retaining the simulation evidence as the primary support for the dynamic claim. revision: partial
Referee: [Scalability analysis] Scalability analysis: the statement that a 100× increase in agents yields only ~5.5× execution time requires an explicit baseline implementation, complexity derivation, and scaling law to substantiate the claim; without these the result cannot be assessed as general.

Authors: We accept that the scalability statement requires additional substantiation. The reported factor was measured on the open-source implementation (baseline: unoptimized Python loop). In the revised manuscript we will state the algorithmic complexity (O(n log n) dominated by the sorted cumulative-share step for dominance tracking), include a log-log plot of wall-clock time versus agent count, and report the fitted scaling exponent. The GitHub repository already contains the exact scripts used for these measurements, enabling direct reproduction. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper introduces CFD by reinterpreting the Boltzmann-Softmax function as a resource allocation mechanism with β as an explicit control variable. Static Pareto analysis and the Stability Corridor are derived from the allocation equations without fitting to target outcomes. The AHC++ controller uses an external observed dominance error signal as feedback for β updates, which is independent of the allocation equations themselves rather than a fitted or self-defined quantity. No self-citations, uniqueness theorems, or ansatzes from prior author work are invoked as load-bearing. The simulation claims rest on external dynamics and measurements, not on any reduction of predictions to inputs by construction.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 2 invented entities

The central claim rests on the reinterpretation of beta as a real-time control variable and the introduction of AHC++ feedback; these are the primary additions beyond standard mathematics.

free parameters (2)

target dominance level
Policy-specified target used as reference for the error signal in AHC++ feedback loop.
beta
Inverse temperature treated as dynamically adjustable control variable rather than fixed hyperparameter.

axioms (1)

domain assumption Boltzmann-Softmax function can be repurposed from action selection to probabilistic resource distribution among agents.
This reinterpretation is the foundational step enabling the CFD framework.

invented entities (2)

Computable Fair Division (CFD) no independent evidence
purpose: Framework that treats Boltzmann-Softmax as resource allocator with beta as fairness-efficiency control.
Newly defined framework in the paper.
AHC++ (Adaptive Hard-Cap Controller++) no independent evidence
purpose: Real-time controller that updates beta using dominance error feedback.
New adaptive controller proposed for the dynamic setting.

pith-pipeline@v0.9.0 · 5731 in / 1379 out tokens · 37712 ms · 2026-05-25T00:32:21.363453+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel; dAlembert_to_ODE_general echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

We reinterpret the Boltzmann–Softmax function … as a probabilistic resource allocation mechanism, redefining the inverse temperature parameter β as a computable control variable … efficiency loss … fairness loss … entropy-based … unified loss function L_tot(β,λ) = λ L_eff(β) + (1−λ) L_ineq(β)
IndisputableMonolith/Foundation/BranchSelection.lean; IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean branch_selection; alpha_pin_under_high_calibration echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

AHC++ … updates β in real time using the error between observed dominance and a policy-specified target as feedback … Stability Corridor … near-optimal operating region where total loss remains approximately constant

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

3 extracted references · 3 canonical work pages · 1 internal anchor

[1]

Arrow, K. J. (1951). Social choice and individual values. Yale University Press. Atkinson, A. B. (1970). On the measurement of inequality. Journal of Economic Theory, 2(3), 244–263. Barocas, S., Hardt, M., & Narayanan, A. (2023). Fairness and machine learning: Limitations and opportunities. MIT Press. Bommasani, R., Hudson, D. A., Adeli, E., Altman, R., A...

work page internal anchor Pith review Pith/arXiv arXiv 1951
[2]

Park, J.-W., & Kim, C. U. (2021). Getting to a feasible income equality. PLOS ONE, 16(3), e0249204. Park, J.-W., Kim, C. U., & Isard, W. (2012). Permit allocation in emissions trading using the Boltzmann distribution. Physica A: Statistical Mechanics and Its Applications, 391(20), 4883–

work page 2021
[3]

U., Ghim, C

Park, J.-W., Kim, J. U., Ghim, C. -M., & Kim, C. U. (2022). The Boltzmann fair division for distributive justice. Scientific Reports, 12, 16179. Samuelson, P. A. (1947). Foundations of economic analysis. Harvard University Press. Shapley, L. S. (1953). A value for n -person games. In H. W. Kuhn & A. W. Tucker (Eds.), Contributions to the theory of games I...

work page arXiv 2022

[1] [1]

Arrow, K. J. (1951). Social choice and individual values. Yale University Press. Atkinson, A. B. (1970). On the measurement of inequality. Journal of Economic Theory, 2(3), 244–263. Barocas, S., Hardt, M., & Narayanan, A. (2023). Fairness and machine learning: Limitations and opportunities. MIT Press. Bommasani, R., Hudson, D. A., Adeli, E., Altman, R., A...

work page internal anchor Pith review Pith/arXiv arXiv 1951

[2] [2]

Park, J.-W., & Kim, C. U. (2021). Getting to a feasible income equality. PLOS ONE, 16(3), e0249204. Park, J.-W., Kim, C. U., & Isard, W. (2012). Permit allocation in emissions trading using the Boltzmann distribution. Physica A: Statistical Mechanics and Its Applications, 391(20), 4883–

work page 2021

[3] [3]

U., Ghim, C

Park, J.-W., Kim, J. U., Ghim, C. -M., & Kim, C. U. (2022). The Boltzmann fair division for distributive justice. Scientific Reports, 12, 16179. Samuelson, P. A. (1947). Foundations of economic analysis. Harvard University Press. Shapley, L. S. (1953). A value for n -person games. In H. W. Kuhn & A. W. Tucker (Eds.), Contributions to the theory of games I...

work page arXiv 2022