pith. sign in

Rlprompt: Optimizing discrete text prompts with reinforcement learning,

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

dataset 1

citation-polarity summary

roles

dataset 1

polarities

use dataset 1

representative citing papers

Learning, Fast and Slow: Towards LLMs That Adapt Continually

cs.LG · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

Fast-Slow Training uses context optimization as fast weights alongside parameter updates as slow weights to achieve up to 3x better sample efficiency, higher performance, and less catastrophic forgetting than standard RL in continual LLM learning.

Large Language Models as Optimizers

cs.LG · 2023-09-07 · unverdicted · novelty 7.0

Large language models can optimize by being prompted with histories of past solutions and scores to propose better ones, producing prompts that raise accuracy up to 8% on GSM8K and 50% on Big-Bench Hard over human-designed baselines.

Robust Adaptation of Foundation Models with Black-Box Visual Prompting

cs.CV · 2024-07-04 · unverdicted · novelty 6.0

BlackVIP adapts foundation models via a Coordinator for input-dependent visual prompts and SPSA-GC for gradient estimation, enabling robust transfer on 19 datasets with low memory use and a link to randomized smoothing robustness.

citing papers explorer

Showing 6 of 6 citing papers.