Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

Cheng, Jiale, Liu, Xiao, Zheng, Kehan, Ke, Pei, Wang, Hongning, Dong, Yuxiao · 2024 · DOI 10.18653/v1/2024.acl-long.176

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

Wait, am I Being Fair? Characterizing Deductive Stereotyping and Mitigating It with Fair-GCG

cs.CL · 2026-06-30 · unverdicted · novelty 6.0

The paper characterizes deductive stereotyping in LLMs and introduces Fair-GCG to discover injection phrases that improve fairness across benchmarks, reasoning, and real-world tasks.

CRAFT: Cost-aware Refinement And Front-aware Tuning of Prompts

cs.CL · 2026-06-03 · unverdicted · novelty 6.0

CRAFT is a Pareto-front prompt optimizer that allocates scarce LLM validation calls to candidates near the current front using accuracy- and cost-oriented generators plus NSGA-II retention.

iPOE: Interpretable Prompt Optimization via Explanations

cs.CL · 2026-05-18 · unverdicted · novelty 6.0

iPOE generates and optimizes annotation guidelines from explanations to produce interpretable prompts, reporting up to 39% gains over baselines on four datasets with LLM explanations substituting for human ones.

Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook

cs.CL · 2026-03-16

citing papers explorer

Showing 3 of 3 citing papers after filters.

Wait, am I Being Fair? Characterizing Deductive Stereotyping and Mitigating It with Fair-GCG cs.CL · 2026-06-30 · unverdicted · none · ref 55
The paper characterizes deductive stereotyping in LLMs and introduces Fair-GCG to discover injection phrases that improve fairness across benchmarks, reasoning, and real-world tasks.
CRAFT: Cost-aware Refinement And Front-aware Tuning of Prompts cs.CL · 2026-06-03 · unverdicted · none · ref 16
CRAFT is a Pareto-front prompt optimizer that allocates scarce LLM validation calls to candidates near the current front using accuracy- and cost-oriented generators plus NSGA-II retention.
iPOE: Interpretable Prompt Optimization via Explanations cs.CL · 2026-05-18 · unverdicted · none · ref 35
iPOE generates and optimizes annotation guidelines from explanations to produce interpretable prompts, reporting up to 39% gains over baselines on four datasets with LLM explanations substituting for human ones.

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

fields

years

verdicts

representative citing papers

citing papers explorer