hub

arXiv preprint arXiv:1704.00805 , year=

On the properties of the softmax function with application in game theory, reinforcement learning , author= · 2017 · math.OC · arXiv 1704.00805

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

open full Pith review browse 13 citing papers arXiv PDF

abstract

In this paper, we utilize results from convex analysis and monotone operator theory to derive additional properties of the softmax function that have not yet been covered in the existing literature. In particular, we show that the softmax function is the monotone gradient map of the log-sum-exp function. By exploiting this connection, we show that the inverse temperature parameter determines the Lipschitz and co-coercivity properties of the softmax function. We then demonstrate the usefulness of these properties through an application in game-theoretic reinforcement learning.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Sharp Spectral Thresholds for Logit Fixed Points

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

For finite-dimensional affine logit systems the sharp dimension-free stability threshold is β‖ΠWΠ‖_{T→T}<2, extending the certified regime beyond classical conservative bounds.

A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies

cs.LG · 2025-10-17 · unverdicted · novelty 7.0

Establishes last-iterate convergence rates for on-policy Q-learning under minimal irreducibility assumptions, with sample complexity O(1/ξ²) matching off-policy up to exploration factors.

Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate

cs.LG · 2025-05-26 · unverdicted · novelty 7.0

ConfSMoE adds expert-opinion imputation and detaches softmax routing scores to ground-truth task confidence to relieve expert collapse in SMoE without extra load-balance losses, evaluated on four real-world datasets.

On Bayesian Softmax-Gated Mixture-of-Experts Models

stat.ML · 2026-04-22 · unverdicted · novelty 7.0

Bayesian softmax-gated mixture-of-experts models achieve posterior contraction for density estimation and parameter recovery using Voronoi losses, plus two strategies for choosing the number of experts.

Neural Policy Composition from Free Energy Minimization

math.OC · 2025-12-04 · unverdicted · novelty 6.0

Policy composition emerges from variational free energy minimization through a convergent gradient flow with a soft-competitive recurrent neural implementation.

Chaining Meets Chain Rule: Multilevel Entropic Regularization and Training of Neural Nets

cs.LG · 2019-06-26 · unverdicted · novelty 6.0

Derives algorithm-dependent generalization bounds for neural nets using multilevel entropic regularization and proposes a Metropolis-simulated multi-scale Gibbs training procedure tested on a two-layer net for MNIST.

Optimizing Server Placement for Vertical Federated Learning in Dynamic Edge/Fog Networks

cs.NI · 2026-05-10 · unverdicted · novelty 6.0

SC-DN establishes a global first-order stationary point per round and solves a mixed-integer signomial program to optimize four control variables for VFL, yielding better classification performance and lower resource use than greedy baselines on image and multi-modal data.

Rethinking Intrinsic Dimension Estimation in Neural Representations

cs.LG · 2026-04-22 · unverdicted · novelty 6.0

Common ID estimators fail to track the true intrinsic dimension of neural representations and are instead driven by other factors.

Learning Empirical Evidence Equilibria under Weak Environmental Coupling

cs.GT · 2026-05-18 · unverdicted · novelty 5.0 · 2 refs

Decentralized Q-learning agents reach an Empirical Evidence Equilibrium in weakly coupled dynamic environments.

Informative Graph Structure Learning

cs.LG · 2026-05-16 · unverdicted · novelty 5.0

InGSL reduces edge redundancy in existing graph structure learning methods by adding a mutual-information-guided diversity term, delivering better results with fewer edges across six tested frameworks.

Toward a Unified Lyapunov-Certified ODE Convergence Analysis of Smooth Q-Learning with p-Norms

cs.LG · 2024-04-20 · unverdicted · novelty 5.0

Unified ODE convergence analysis for smooth Q-learning variants via p-norm Lyapunov functions, valid even when the Bellman operator is not a contraction.

Structure-Centric Graph Foundation Model via Geometric Bases

cs.LG · 2026-05-09 · unverdicted · novelty 5.0

SCGFM creates transferable graph representations by aligning heterogeneous topologies to shared learnable geometric bases via Gromov-Wasserstein distances and re-encoding features accordingly.

Learning Cut Distributions with Quantum Optimization

quant-ph · 2026-04-15 · unverdicted · novelty 5.0

QAOA ansatz with finite layers can capture any bitstring distribution and solves the Fair Cut Cover problem with provable and empirical advantages over classical approximations on certain graphs.

citing papers explorer

Showing 13 of 13 citing papers.

Sharp Spectral Thresholds for Logit Fixed Points cs.LG · 2026-05-15 · unverdicted · none · ref 4 · internal anchor
For finite-dimensional affine logit systems the sharp dimension-free stability threshold is β‖ΠWΠ‖_{T→T}<2, extending the certified regime beyond classical conservative bounds.
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies cs.LG · 2025-10-17 · unverdicted · none · ref 55 · internal anchor
Establishes last-iterate convergence rates for on-policy Q-learning under minimal irreducibility assumptions, with sample complexity O(1/ξ²) matching off-policy up to exploration factors.
Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate cs.LG · 2025-05-26 · unverdicted · none · ref 10 · internal anchor
ConfSMoE adds expert-opinion imputation and detaches softmax routing scores to ground-truth task confidence to relieve expert collapse in SMoE without extra load-balance losses, evaluated on four real-world datasets.
On Bayesian Softmax-Gated Mixture-of-Experts Models stat.ML · 2026-04-22 · unverdicted · none · ref 112
Bayesian softmax-gated mixture-of-experts models achieve posterior contraction for density estimation and parameter recovery using Voronoi losses, plus two strategies for choosing the number of experts.
Neural Policy Composition from Free Energy Minimization math.OC · 2025-12-04 · unverdicted · none · ref 19 · internal anchor
Policy composition emerges from variational free energy minimization through a convergent gradient flow with a soft-competitive recurrent neural implementation.
Chaining Meets Chain Rule: Multilevel Entropic Regularization and Training of Neural Nets cs.LG · 2019-06-26 · unverdicted · none · ref 49 · internal anchor
Derives algorithm-dependent generalization bounds for neural nets using multilevel entropic regularization and proposes a Metropolis-simulated multi-scale Gibbs training procedure tested on a two-layer net for MNIST.
Optimizing Server Placement for Vertical Federated Learning in Dynamic Edge/Fog Networks cs.NI · 2026-05-10 · unverdicted · none · ref 57
SC-DN establishes a global first-order stationary point per round and solves a mixed-integer signomial program to optimize four control variables for VFL, yielding better classification performance and lower resource use than greedy baselines on image and multi-modal data.
Rethinking Intrinsic Dimension Estimation in Neural Representations cs.LG · 2026-04-22 · unverdicted · none · ref 48
Common ID estimators fail to track the true intrinsic dimension of neural representations and are instead driven by other factors.
Learning Empirical Evidence Equilibria under Weak Environmental Coupling cs.GT · 2026-05-18 · unverdicted · none · ref 15 · 2 links · internal anchor
Decentralized Q-learning agents reach an Empirical Evidence Equilibrium in weakly coupled dynamic environments.
Informative Graph Structure Learning cs.LG · 2026-05-16 · unverdicted · none · ref 54 · internal anchor
InGSL reduces edge redundancy in existing graph structure learning methods by adding a mutual-information-guided diversity term, delivering better results with fewer edges across six tested frameworks.
Toward a Unified Lyapunov-Certified ODE Convergence Analysis of Smooth Q-Learning with p-Norms cs.LG · 2024-04-20 · unverdicted · none · ref 33 · internal anchor
Unified ODE convergence analysis for smooth Q-learning variants via p-norm Lyapunov functions, valid even when the Bellman operator is not a contraction.
Structure-Centric Graph Foundation Model via Geometric Bases cs.LG · 2026-05-09 · unverdicted · none · ref 43
SCGFM creates transferable graph representations by aligning heterogeneous topologies to shared learnable geometric bases via Gromov-Wasserstein distances and re-encoding features accordingly.
Learning Cut Distributions with Quantum Optimization quant-ph · 2026-04-15 · unverdicted · none · ref 51
QAOA ansatz with finite layers can capture any bitstring distribution and solves the Fair Cut Cover problem with provable and empirical advantages over classical approximations on certain graphs.

arXiv preprint arXiv:1704.00805 , year=

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer