What Makes Good In-Context Examples for

Liu, Jiachang, Shen, Dinghan, Zhang, Yizhe, Dolan, Bill, Carin, Lawrence, Chen, Weizhu , booktitle = · 2022 · DOI 10.18653/v1/2022.deelio-1.10

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

open at publisher browse 12 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Self-Improving In-Context Learning

cs.CL · 2026-05-22 · unverdicted · novelty 7.0

A test-time zeroth-order optimization of prompt embeddings using a bounded self-supervised proxy from demonstration log-probabilities improves ICL accuracy and correlates with gains across tasks.

Legal2LogicICL: Improving Generalization in Transforming Legal Cases to Logical Formulas via Diverse Few-Shot Learning

cs.CL · 2026-04-13 · unverdicted · novelty 7.0

Legal2LogicICL improves accuracy and generalization when mapping legal cases to logical formulas by retrieving balanced diverse exemplars at semantic and structural levels, backed by the new Legal2Proleg dataset.

LC-ICL: Label-Guided Contrastive In-Context Learning for Robust Information Extraction

cs.CL · 2026-06-28 · unverdicted · novelty 6.0

LC-ICL improves few-shot NER and RE by using label-guided contrastive demonstrations that pair positive samples with error-annotated negative samples.

Activation-Based Active Learning for In-Context Learning: Challenges and Insights

cs.CL · 2026-06-03 · unverdicted · novelty 6.0

MLP activations measured as massive activations or first four moments correlate weakly (max |Spearman| = 0.33) with in-context example quality across Llama-3.2-3B, Qwen2.5-3B, and multiple classification/generative tasks, so activation-based active learning should not be used for ICL.

CRAFT: Cost-aware Refinement And Front-aware Tuning of Prompts

cs.CL · 2026-06-03 · unverdicted · novelty 6.0

CRAFT is a Pareto-front prompt optimizer that allocates scarce LLM validation calls to candidates near the current front using accuracy- and cost-oriented generators plus NSGA-II retention.

Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

cs.CL · 2026-05-13 · unverdicted · novelty 6.0 · 2 refs

Many-shot CoT-ICL improves when demonstrations are ordered for smooth conceptual progression, with CDS delivering up to 5.42 percentage-point gains on math tasks using 64 examples.

Internalizing Curriculum Judgment for LLM Reinforcement Fine-Tuning

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

METIS internalizes curriculum judgment in LLM reinforcement fine-tuning by predicting within-prompt reward variance via in-context learning and jointly optimizing with a self-judgment reward, yielding superior performance and up to 67% faster convergence across math, code, and agent benchmarks.

cs.SE · 2026-05-08 · unverdicted · novelty 6.0

SPARK improves LLM-based test code fault localization by retrieving similar past faults and selectively annotating suspicious lines in new failing tests.

GRaSp: Automatic Example Optimization for In-Context Learning in Low-Data Tasks

cs.CL · 2026-05-08 · unverdicted · novelty 6.0

GRaSp optimizes in-context examples for LLMs via synthetic generation, clustering, dimensionality reduction, and genetic algorithms with diversity-adaptive mutation, reaching 45.84% micro-F1 on financial NER with real data and outperforming zero-shot and random few-shot baselines.

Understanding the Prompt Sensitivity

cs.CL · 2026-04-20 · unverdicted · novelty 5.0

LLMs disperse meaning-preserving prompts internally instead of clustering them, which produces an excessively high upper bound on output log-probability differences via Taylor expansion and Cauchy-Schwarz.

Querying an astronomical database using large language models: the ALeRCE text-to-SQL system

astro-ph.IM · 2026-06-16 · unverdicted · novelty 4.0

Presents a four-module LLM framework for text-to-SQL on the ALeRCE astro database, evaluated on 110 NL/SQL pairs across 13 models with perfect-match metrics.

When Reranking Hurts: Uncertainty-Based Gating for Few-Shot Reranking

cs.CL · 2026-06-30

citing papers explorer

Showing 12 of 12 citing papers.

Self-Improving In-Context Learning cs.CL · 2026-05-22 · unverdicted · none · ref 25
A test-time zeroth-order optimization of prompt embeddings using a bounded self-supervised proxy from demonstration log-probabilities improves ICL accuracy and correlates with gains across tasks.
Legal2LogicICL: Improving Generalization in Transforming Legal Cases to Logical Formulas via Diverse Few-Shot Learning cs.CL · 2026-04-13 · unverdicted · none · ref 11
Legal2LogicICL improves accuracy and generalization when mapping legal cases to logical formulas by retrieving balanced diverse exemplars at semantic and structural levels, backed by the new Legal2Proleg dataset.
LC-ICL: Label-Guided Contrastive In-Context Learning for Robust Information Extraction cs.CL · 2026-06-28 · unverdicted · none · ref 20
LC-ICL improves few-shot NER and RE by using label-guided contrastive demonstrations that pair positive samples with error-annotated negative samples.
Activation-Based Active Learning for In-Context Learning: Challenges and Insights cs.CL · 2026-06-03 · unverdicted · none · ref 40
MLP activations measured as massive activations or first four moments correlate weakly (max |Spearman| = 0.33) with in-context example quality across Llama-3.2-3B, Qwen2.5-3B, and multiple classification/generative tasks, so activation-based active learning should not be used for ICL.
CRAFT: Cost-aware Refinement And Front-aware Tuning of Prompts cs.CL · 2026-06-03 · unverdicted · none · ref 157
CRAFT is a Pareto-front prompt optimizer that allocates scarce LLM validation calls to candidates near the current front using accuracy- and cost-oriented generators plus NSGA-II retention.
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn cs.CL · 2026-05-13 · unverdicted · none · ref 23 · 2 links
Many-shot CoT-ICL improves when demonstrations are ordered for smooth conceptual progression, with CDS delivering up to 5.42 percentage-point gains on math tasks using 64 examples.
Internalizing Curriculum Judgment for LLM Reinforcement Fine-Tuning cs.LG · 2026-05-11 · unverdicted · none · ref 28
METIS internalizes curriculum judgment in LLM reinforcement fine-tuning by predicting within-prompt reward variance via in-context learning and jointly optimizing with a self-judgment reward, yielding superior performance and up to 67% faster convergence across math, code, and agent benchmarks.
Similar Pattern Annotation via Retrieval Knowledge for LLM-Based Test Code Fault Localization cs.SE · 2026-05-08 · unverdicted · none · ref 40
SPARK improves LLM-based test code fault localization by retrieving similar past faults and selectively annotating suspicious lines in new failing tests.
GRaSp: Automatic Example Optimization for In-Context Learning in Low-Data Tasks cs.CL · 2026-05-08 · unverdicted · none · ref 7
GRaSp optimizes in-context examples for LLMs via synthetic generation, clustering, dimensionality reduction, and genetic algorithms with diversity-adaptive mutation, reaching 45.84% micro-F1 on financial NER with real data and outperforming zero-shot and random few-shot baselines.
Understanding the Prompt Sensitivity cs.CL · 2026-04-20 · unverdicted · none · ref 55
LLMs disperse meaning-preserving prompts internally instead of clustering them, which produces an excessively high upper bound on output log-probability differences via Taylor expansion and Cauchy-Schwarz.
Querying an astronomical database using large language models: the ALeRCE text-to-SQL system astro-ph.IM · 2026-06-16 · unverdicted · none · ref 51
Presents a four-module LLM framework for text-to-SQL on the ALeRCE astro database, evaluated on 110 NL/SQL pairs across 13 models with perfect-match metrics.
When Reranking Hurts: Uncertainty-Based Gating for Few-Shot Reranking cs.CL · 2026-06-30 · unreviewed · ref 36

What Makes Good In-Context Examples for

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer