pith. sign in

Improving generalization in meta reinforcement learning using learned objectives

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

years

2025 2 2024 1

verdicts

UNVERDICTED 3

representative citing papers

ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution

cs.CL · 2025-09-17 · unverdicted · novelty 6.0

ShinkaEvolve improves sample efficiency in LLM-driven program evolution via parent sampling, code novelty rejection-sampling, and bandit LLM ensemble selection, achieving new SOTA circle packing with 150 samples and gains on math reasoning and competitive programming tasks.

citing papers explorer

Showing 3 of 3 citing papers.