pith. sign in

Title resolution pending

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 10

roles

background 1

polarities

background 1

clear filters

representative citing papers

Learning, Fast and Slow: Towards LLMs That Adapt Continually

cs.LG · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

Fast-Slow Training uses context optimization as fast weights alongside parameter updates as slow weights to achieve up to 3x better sample efficiency, higher performance, and less catastrophic forgetting than standard RL in continual LLM learning.

Contrastive Reflection for Iterative Prompt Optimization

cs.AI · 2026-06-29 · unverdicted · novelty 5.0

Contrastive Reflection identifies error-anchored slices in agent traces, adds contrastive successes, and uses a Teacher LLM to generate prompt edits that are accepted only if they improve validation performance, raising HotpotQA exact-match from 51.4% to 60.4%.

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • Contrastive Reflection for Iterative Prompt Optimization cs.AI · 2026-06-29 · unverdicted · none · ref 9

    Contrastive Reflection identifies error-anchored slices in agent traces, adds contrastive successes, and uses a Teacher LLM to generate prompt edits that are accepted only if they improve validation performance, raising HotpotQA exact-match from 51.4% to 60.4%.

  • Trace2Policy: From Expert Behavior Traces to Self-Evolving Decision Agents cs.AI · 2026-06-09 · unverdicted · none · ref 45

    Trace2Policy's EISR iteratively refines expert-derived rules into compiled Python code reaching 79.6% accuracy on skewed compliance tasks, outperforming one-shot LLM distillation and a deployed LLM baseline.