How predictable are large language model capabilities? a case study on big-bench

Ye, Q · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Predicting Performance of Symbolic and Prompt Programs with Examples

cs.LG · 2026-05-15 · unverdicted · novelty 5.0

Proposes RAP, a retrieval-based approximate prior method, to predict performance of symbolic programs and LLM prompts on new tasks using a Bernoulli model and corpus-derived performance distributions.

citing papers explorer

Showing 1 of 1 citing paper.

Predicting Performance of Symbolic and Prompt Programs with Examples cs.LG · 2026-05-15 · unverdicted · none · ref 9
Proposes RAP, a retrieval-based approximate prior method, to predict performance of symbolic programs and LLM prompts on new tasks using a Bernoulli model and corpus-derived performance distributions.

How predictable are large language model capabilities? a case study on big-bench

fields

years

verdicts

representative citing papers

citing papers explorer