It includes paired comparison data from base and iterated models, as well as red teaming transcripts designed to expose model vulnerabilities

hh-rlhf • Publisher: Anthropic • Size: 14M instances • License: MIT • Link: https://github

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Large Language Model Prompt Datasets: An In-depth Analysis and Insights

cs.LG · 2025-10-10 · accept · novelty 7.0

Compilation and linguistic analysis of 129 LLM prompt datasets identifies distinguishing features, with syntactic distributions enabling high-accuracy lightweight routing and quality prediction in three downstream tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Large Language Model Prompt Datasets: An In-depth Analysis and Insights cs.LG · 2025-10-10 · accept · none · ref 54
Compilation and linguistic analysis of 129 LLM prompt datasets identifies distinguishing features, with syntactic distributions enabling high-accuracy lightweight routing and quality prediction in three downstream tasks.

It includes paired comparison data from base and iterated models, as well as red teaming transcripts designed to expose model vulnerabilities

fields

years

verdicts

representative citing papers

citing papers explorer