pith. sign in

The variational formulation of the

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

method 1

citation-polarity summary

years

2026 5

verdicts

UNVERDICTED 5

roles

method 1

polarities

use method 1

representative citing papers

Scaling Limits of Long-Context Transformers

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

For uniform keys on the d-dimensional sphere, softmax attention becomes selective at inverse temperature scaling β_n* ≍ n^{2/(d-1)}, with explicit limiting laws for attention weights and outputs in each regime.

Sobolev Regularized MMD Gradient Flow

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Sobolev regularization on the witness function enables global convergence of MMD gradient flows for both sampling and generative modeling without isoperimetric assumptions.

Learning Normalized Energy Models for Linear Inverse Problems

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Energy-based model with covariance regularization computes normalized posteriors for linear inverse problems without retraining, enabling adaptive sampling and blind estimation on image datasets.

citing papers explorer

Showing 5 of 5 citing papers.

  • Scaling Limits of Long-Context Transformers cs.LG · 2026-05-08 · unverdicted · none · ref 125

    For uniform keys on the d-dimensional sphere, softmax attention becomes selective at inverse temperature scaling β_n* ≍ n^{2/(d-1)}, with explicit limiting laws for attention weights and outputs in each regime.

  • Sobolev Regularized MMD Gradient Flow cs.LG · 2026-05-12 · unverdicted · none · ref 96

    Sobolev regularization on the witness function enables global convergence of MMD gradient flows for both sampling and generative modeling without isoperimetric assumptions.

  • Discrete Optimal Transport: Rapid Convergence of Simulated Annealing Algorithms cs.DS · 2026-05-07 · unverdicted · none · ref 26

    Using a new discrete Wasserstein distance and action functional, the paper proves polynomial convergence rates for annealed Glauber dynamics in mean-field Ising and Potts models.

  • Learning Normalized Energy Models for Linear Inverse Problems cs.LG · 2026-05-15 · unverdicted · none · ref 7

    Energy-based model with covariance regularization computes normalized posteriors for linear inverse problems without retraining, enabling adaptive sampling and blind estimation on image datasets.

  • Properties and limitations of geometric tempering for gradient flow dynamics stat.ML · 2026-04-22 · unverdicted · none · ref 96

    Geometric tempering yields exponential convergence bounds for both Wasserstein and Fisher-Rao flows but produces no speedup in the Fisher-Rao metric, with new adaptive schedules derived from the tempered dynamics.