Title resolution pending

· 2014 · DOI 10.1007/978-3-319-00227-9

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open at publisher browse 6 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Sinkhorn Treatment Effects: A Causal Optimal Transport Measure

stat.ML · 2026-05-08 · unverdicted · novelty 7.0

The Sinkhorn treatment effect is a new entropic optimal transport measure of divergence between counterfactual distributions that admits first- and second-order pathwise differentiability, debiased estimators, and asymptotically valid tests for distributional treatment effects.

Diffusion Operator Geometry of Feedforward Representations

cs.LG · 2026-05-01 · unverdicted · novelty 7.0

A Gaussian-kernel diffusion operator on feature clouds yields closed-form class affinities and spectra in Gaussian models, with provably smooth observables under perturbations.

Complexity Analysis of Normalizing Constant Estimation: from Jarzynski Equality to Annealed Importance Sampling and beyond

stat.ML · 2025-02-07 · unverdicted · novelty 7.0

Derives Õ(d β² A² / ε⁴) oracle complexity for AIS estimating normalizing constant Z to relative error ε and introduces reverse diffusion sampler for geometric paths with large action.

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.

Training-Induced Escape from Token Clustering in a Mean-Field Formulation of Transformers

cs.LG · 2026-05-08 · unverdicted · novelty 5.0

Training a mean-field Transformer under L2 regularization induces an escape from attention-driven token clustering in later layers after initial clustering.

Convergence rate of the occupation measure of classes of ergodic processes toward their invariant distribution in mean Wasserstein distance

math.PR · 2026-05-07 · unverdicted · novelty 5.0

General criteria extend L^p-mean Wasserstein convergence rates of occupation measures to non-stationary or non-Markovian ergodic processes under conditional convergence to equilibrium, with applications to Brownian diffusions and fractional Brownian driven SDEs.

citing papers explorer

Showing 6 of 6 citing papers.

Sinkhorn Treatment Effects: A Causal Optimal Transport Measure stat.ML · 2026-05-08 · unverdicted · none · ref 68
The Sinkhorn treatment effect is a new entropic optimal transport measure of divergence between counterfactual distributions that admits first- and second-order pathwise differentiability, debiased estimators, and asymptotically valid tests for distributional treatment effects.
Diffusion Operator Geometry of Feedforward Representations cs.LG · 2026-05-01 · unverdicted · none · ref 17
A Gaussian-kernel diffusion operator on feature clouds yields closed-form class affinities and spectra in Gaussian models, with provably smooth observables under perturbations.
Complexity Analysis of Normalizing Constant Estimation: from Jarzynski Equality to Annealed Importance Sampling and beyond stat.ML · 2025-02-07 · unverdicted · none · ref 9
Derives Õ(d β² A² / ε⁴) oracle complexity for AIS estimating normalizing constant Z to relative error ε and introduces reverse diffusion sampler for geometric paths with large action.
Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing cs.LG · 2026-05-15 · unverdicted · none · ref 16
Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.
Training-Induced Escape from Token Clustering in a Mean-Field Formulation of Transformers cs.LG · 2026-05-08 · unverdicted · none · ref 74
Training a mean-field Transformer under L2 regularization induces an escape from attention-driven token clustering in later layers after initial clustering.
Convergence rate of the occupation measure of classes of ergodic processes toward their invariant distribution in mean Wasserstein distance math.PR · 2026-05-07 · unverdicted · none · ref 47
General criteria extend L^p-mean Wasserstein convergence rates of occupation measures to non-stationary or non-Markovian ergodic processes under conditional convergence to equilibrium, with applications to Brownian diffusions and fractional Brownian driven SDEs.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer