GRPO fine-tuning of an 8B model with an authorship-verification-calibrated style judge yields an average style score of 0.893 across Mark Twain, Jane Austen, Charles Dickens, and Thomas Hardy, exceeding open-weight baselines.
about" and the phrase
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning
GRPO fine-tuning of an 8B model with an authorship-verification-calibrated style judge yields an average style score of 0.893 across Mark Twain, Jane Austen, Charles Dickens, and Thomas Hardy, exceeding open-weight baselines.