pith. sign in

Title resolution pending

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.AI 1 cs.CL 1

years

2026 2

verdicts

UNVERDICTED 2

clear filters

representative citing papers

POLARIS: Guiding Small Models to Write Long Stories

cs.CL · 2026-06-02 · unverdicted · novelty 5.0

POLARIS trains Qwen3.5-9B via GRPO with LLM-as-judge rewards and human-reference injection, yielding a model competitive with larger open-weight models on length adherence and quality, including generalization to 3x training length.

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • POLARIS: Guiding Small Models to Write Long Stories cs.CL · 2026-06-02 · unverdicted · none · ref 41

    POLARIS trains Qwen3.5-9B via GRPO with LLM-as-judge rewards and human-reference injection, yielding a model competitive with larger open-weight models on length adherence and quality, including generalization to 3x training length.

  • Position: AI Safety Requires Effective Controllability cs.AI · 2026-05-26 · unverdicted · none · ref 36

    Position paper claiming that AI safety requires explicit runtime controllability and introducing ControlBench to demonstrate gaps in existing alignment methods.