pith. sign in

Journal of Machine Learning Research , volume=

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

method 2

citation-polarity summary

years

2026 6

verdicts

UNVERDICTED 6

roles

method 2

polarities

use method 2

representative citing papers

Delightful Gradients Accelerate Corner Escape

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Delightful Policy Gradient removes exponential corner trapping in softmax policy optimization for bandits and tabular MDPs, achieving logarithmic escape times and global O(1/t) convergence.

Actor-Critic Algorithm for Dynamic Expectile and CVaR

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

A model-free off-policy actor-critic algorithm is constructed for dynamic expectile and CVaR using a surrogate policy gradient without transition perturbation and elicitability-based value learning, with empirical outperformance in risk-averse domains.

citing papers explorer

Showing 6 of 6 citing papers.