Constraint back-translation improves complex instruction following of large language models.arXiv preprint arXiv:2410.24175

Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li · arXiv 2410.24175

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

StoryAlign: Evaluating and Training Reward Models for Story Generation

cs.CL · 2026-05-06 · unverdicted · novelty 7.0

StoryReward, trained on a new 100k story preference dataset, sets state-of-the-art performance on the introduced StoryRMB benchmark for aligning LLM stories with human preferences.

citing papers explorer

Showing 1 of 1 citing paper.

StoryAlign: Evaluating and Training Reward Models for Story Generation cs.CL · 2026-05-06 · unverdicted · none · ref 22
StoryReward, trained on a new 100k story preference dataset, sets state-of-the-art performance on the introduced StoryRMB benchmark for aligning LLM stories with human preferences.

Constraint back-translation improves complex instruction following of large language models.arXiv preprint arXiv:2410.24175

fields

years

verdicts

representative citing papers

citing papers explorer