John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov

Eva Maxfield Brown, Lindsey Schwartz, Richard Lewei Huang, Nicholas Weber · 2024 · arXiv 7899.2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Mind the Gap Between Spatial Reasoning and Acting! Step-by-Step Evaluation of Agents With Spatial-Gym

cs.AI · 2026-04-10 · unverdicted · novelty 7.0

Spatial-Gym benchmark shows the best tested model solves only 16% of pathfinding tasks versus 98% for humans, with step-by-step and backtracking formats producing mixed effects across model strengths.

Operationalizing Research Software for Supply Chain Security

cs.SE · 2026-01-28 · accept · novelty 6.0

A harmonized RSSC-oriented taxonomy standardizes research software definitions, maps prior studies, and demonstrates that security signals from OpenSSF Scorecard vary meaningfully across taxonomy clusters on the RSE corpus.

citing papers explorer

Showing 2 of 2 citing papers.

Mind the Gap Between Spatial Reasoning and Acting! Step-by-Step Evaluation of Agents With Spatial-Gym cs.AI · 2026-04-10 · unverdicted · none · ref 3
Spatial-Gym benchmark shows the best tested model solves only 16% of pathfinding tasks versus 98% for humans, with step-by-step and backtracking formats producing mixed effects across model strengths.
Operationalizing Research Software for Supply Chain Security cs.SE · 2026-01-28 · accept · none · ref 2
A harmonized RSSC-oriented taxonomy standardizes research software definitions, maps prior studies, and demonstrates that security signals from OpenSSF Scorecard vary meaningfully across taxonomy clusters on the RSE corpus.

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov

fields

years

verdicts

representative citing papers

citing papers explorer