pith. machine review for the scientific record. sign in

Intrinsic motivation and automatic curricula via asymmetric self-play

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.LG 2

years

2026 1 2019 1

roles

background 1

polarities

background 1

representative citing papers

Scaling Self-Play with Self-Guidance

cs.LG · 2026-04-22 · unverdicted · novelty 6.0

SGS adds self-guidance to LLM self-play for Lean4 theorem proving, surpassing RL baselines and enabling a 7B model to outperform a 671B model after 200 rounds.

citing papers explorer

Showing 2 of 2 citing papers.

  • Dota 2 with Large Scale Deep Reinforcement Learning cs.LG · 2019-12-13 · accept · none · ref 36

    OpenAI Five achieved superhuman performance in Dota 2 by defeating the world champions using scaled self-play reinforcement learning.

  • Scaling Self-Play with Self-Guidance cs.LG · 2026-04-22 · unverdicted · none · ref 34

    SGS adds self-guidance to LLM self-play for Lean4 theorem proving, surpassing RL baselines and enabling a 7B model to outperform a 671B model after 200 rounds.