pith. sign in

Mixed citations

Text2reward: Automated dense reward function generation for reinforcement learning

Mixed citation behavior. Most common role is background (67%).

8 Pith papers citing it
Background 67% of classified citations

citation-role summary

background 4 method 2

citation-polarity summary

clear filters

representative citing papers

Automatic Generation of High-Performance RL Environments

cs.LG · 2026-03-12 · conditional · novelty 7.0

Closed-loop prompt-based translation with hierarchical verification and iterative repair produces equivalent high-performance RL environments across five cases including new TCGJax.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Automatic Generation of High-Performance RL Environments cs.LG · 2026-03-12 · conditional · none · ref 24

    Closed-loop prompt-based translation with hierarchical verification and iterative repair produces equivalent high-performance RL environments across five cases including new TCGJax.