pith. sign in

Title resolution pending

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 12

roles

background 1

polarities

background 1

representative citing papers

AEL: Agent Evolving Learning for Open-Ended Environments

cs.CL · 2026-04-23 · conditional · novelty 7.0

AEL uses a fast-timescale bandit for memory policy selection and slow-timescale LLM reflection for causal insights, achieving a Sharpe ratio of 2.13 on a 208-episode portfolio benchmark while showing that added mechanisms degrade performance.

Learning to Cut: Reinforcement Learning for Benders Decomposition

math.OC · 2026-05-07 · unverdicted · novelty 6.0

RLBD trains a neural policy with REINFORCE to select cuts adaptively in Benders decomposition, yielding faster convergence and better generalization than standard BD or SVM-based LearnBD on an EV charging problem.

citing papers explorer

Showing 12 of 12 citing papers.