pith. sign in

Mas- tering the game of go with deep neural networks and tree search.nature, 529(7587):484–489

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

fields

cs.AI 3 cs.CL 2

years

2026 3 2025 2

representative citing papers

ANO: A Principled Approach to Robust Policy Optimization

cs.AI · 2026-05-04 · unverdicted · novelty 6.0

ANO derives a robust policy optimizer from geometric principles that replaces clipping with a smooth redescending gradient, showing better performance and stability than PPO, SPO, and GRPO in MuJoCo, Atari, and RLHF experiments.

citing papers explorer

Showing 5 of 5 citing papers.