pith. sign in

arXiv preprint arXiv:2101.10895 , year=

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

years

2026 2 2024 2

verdicts

UNVERDICTED 4

representative citing papers

Primal-Dual Policy Optimization for Linear CMDPs with Adversarial Losses

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

A new primal-dual algorithm for adversarial linear CMDPs achieves the first sublinear regret and constraint violation bounds of order K to the 3/4 using weighted LogSumExp softmax policies with periodic mixing and regularized dual updates.

Inpatient Overflow Management with Proximal Policy Optimization

math.OC · 2024-10-17 · unverdicted · novelty 6.0

A PPO reinforcement learning method using atomic actions, partially-shared policies, and queueing-informed value approximation scales inpatient overflow optimization to hospital systems with 20 patient classes and wards, matching or beating benchmarks where prior methods fail.

citing papers explorer

Showing 4 of 4 citing papers.