Boyd and Kate G

Ryan L · 2020 · DOI 10.1126/sciadv.aba2196

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

POLARIS: Guiding Small Models to Write Long Stories

cs.CL · 2026-06-02 · unverdicted · novelty 5.0

POLARIS trains Qwen3.5-9B via GRPO with LLM-as-judge rewards and human-reference injection, yielding a model competitive with larger open-weight models on length adherence and quality, including generalization to 3x training length.

citing papers explorer

Showing 1 of 1 citing paper after filters.

POLARIS: Guiding Small Models to Write Long Stories cs.CL · 2026-06-02 · unverdicted · none · ref 52
POLARIS trains Qwen3.5-9B via GRPO with LLM-as-judge rewards and human-reference injection, yielding a model competitive with larger open-weight models on length adherence and quality, including generalization to 3x training length.

Boyd and Kate G

fields

years

verdicts

representative citing papers

citing papers explorer