Title resolution pending

Krista Opsahl-Ong, Michael J · 2024 · arXiv 2406.11695

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Learning, Fast and Slow: Towards LLMs That Adapt Continually

cs.LG · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

Fast-Slow Training uses context optimization as fast weights alongside parameter updates as slow weights to achieve up to 3x better sample efficiency, higher performance, and less catastrophic forgetting than standard RL in continual LLM learning.

FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients

cs.CL · 2026-04-29 · unverdicted · novelty 7.0

FlowBot automatically induces LLM workflows through bilevel optimization with textual gradients, achieving competitive performance against human-crafted baselines.

Evolving and Detecting Multi-Turn Deception using Geometric Signatures

stat.ML · 2026-05-26 · unverdicted · novelty 6.0

Multi-objective genetic prompt optimization creates multi-turn deceptive datasets validated by humans, then detected with 0.89 recall using angular coverage, distance ratio, and linearity features in embeddings.

LLMs Are Already Good Tutors: Training-Free Prompt Optimization for Pedagogical Math Tutoring

cs.CL · 2026-05-26 · unverdicted · novelty 6.0

Training-free prompt optimization methods, including five new education-focused ones, surpass the strongest RL-trained baseline across five conditions on two OOD suites while showing distinct teaching behavior patterns.

optimize_anything: A Universal API for Optimizing any Text Parameter

cs.CL · 2026-05-19 · unverdicted · novelty 6.0

A universal LLM optimizer for text artifacts achieves SOTA results on six tasks including tripling ARC-AGI accuracy and cutting cloud costs by 40% via cross-task transfer and side information.

Contexting as Recommendation: Evolutionary Collaborative Filtering for Context Engineering

cs.CL · 2026-05-15 · conditional · novelty 6.0

NCCE reframes context engineering as instance-level recommendation via bootstrapped anchor contexts and a co-evolving neural collaborative filtering router that assigns specialized contexts per input.

EditFlow: Benchmarking and Optimizing Code Edit Recommendation Systems via Reconstruction of Developer Flows

cs.SE · 2026-02-25 · unverdicted · novelty 6.0

EditFlow reconstructs temporal developer editing flows from code changes to benchmark and optimize AI code edit recommenders so they align with natural incremental reasoning rather than static snapshots.

Contrastive Reflection for Iterative Prompt Optimization

cs.AI · 2026-06-29 · unverdicted · novelty 5.0

Contrastive Reflection identifies error-anchored slices in agent traces, adds contrastive successes, and uses a Teacher LLM to generate prompt edits that are accepted only if they improve validation performance, raising HotpotQA exact-match from 51.4% to 60.4%.

Trace2Policy: From Expert Behavior Traces to Self-Evolving Decision Agents

cs.AI · 2026-06-09 · unverdicted · novelty 5.0

Trace2Policy's EISR iteratively refines expert-derived rules into compiled Python code reaching 79.6% accuracy on skewed compliance tasks, outperforming one-shot LLM distillation and a deployed LLM baseline.

Automated Instruction Revision (AIR): A Structured Comparison of Task Adaptation Strategies for LLM

cs.CL · 2026-04-10 · unverdicted · novelty 5.0

AIR excels on label-remapping classification tasks while KNN retrieval leads on closed-book QA and fine-tuning leads on structured extraction and event-order reasoning, showing task-dependent adaptation performance.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Contrastive Reflection for Iterative Prompt Optimization cs.AI · 2026-06-29 · unverdicted · none · ref 9
Contrastive Reflection identifies error-anchored slices in agent traces, adds contrastive successes, and uses a Teacher LLM to generate prompt edits that are accepted only if they improve validation performance, raising HotpotQA exact-match from 51.4% to 60.4%.
Trace2Policy: From Expert Behavior Traces to Self-Evolving Decision Agents cs.AI · 2026-06-09 · unverdicted · none · ref 45
Trace2Policy's EISR iteratively refines expert-derived rules into compiled Python code reaching 79.6% accuracy on skewed compliance tasks, outperforming one-shot LLM distillation and a deployed LLM baseline.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer