Make sure the output is less than 100 words

The output should be an appropriate response to the instruction, the input · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Understanding the Effects of RLHF on LLM Generalisation and Diversity

cs.LG · 2023-10-10 · unverdicted · novelty 6.0

RLHF improves OOD generalization over SFT especially under larger distribution shifts but reduces output diversity, revealing a tradeoff in LLM fine-tuning methods.

citing papers explorer

Showing 1 of 1 citing paper.

Understanding the Effects of RLHF on LLM Generalisation and Diversity cs.LG · 2023-10-10 · unverdicted · none · ref 14
RLHF improves OOD generalization over SFT especially under larger distribution shifts but reduces output diversity, revealing a tradeoff in LLM fine-tuning methods.

Make sure the output is less than 100 words

fields

years

verdicts

representative citing papers

citing papers explorer