pith. sign in

arXiv preprint arXiv:2210.01021 , year=

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

cs.LG · 2026-05-29 · unverdicted · novelty 5.0

PVPO is a sample-efficient RL method that improves semantic, geometric, and physical quality in LLM LEGO assembly generation by mitigating the PhysHack failure mode where validity alone fails to ensure fidelity.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning cs.LG · 2026-05-29 · unverdicted · none · ref 12

    PVPO is a sample-efficient RL method that improves semantic, geometric, and physical quality in LLM LEGO assembly generation by mitigating the PhysHack failure mode where validity alone fails to ensure fidelity.