Pref-CTRL trains a multi-objective value function on preferences to guide representation editing for LLM alignment, outperforming RE-Control on benchmarks with better out-of-domain generalization.
Towards Scalable Automated Alignment of LLMs: A Survey
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
ReAlignFit uses chemical induced fit bias and subgraph information bottleneck to dynamically align molecular substructure representations and improve stability on rule-shifted and scaffold-shifted data.
Qwen2.5 LLMs scale pre-training data to 18 trillion tokens and apply multistage reinforcement learning, achieving competitive performance on benchmarks with models up to 5 times larger.
citing papers explorer
-
Pref-CTRL: Preference Driven LLM Alignment using Representation Editing
Pref-CTRL trains a multi-objective value function on preferences to guide representation editing for LLM alignment, outperforming RE-Control on benchmarks with better out-of-domain generalization.
-
Representational Alignment with Chemical Induced Fit for Molecular Relational Learning
ReAlignFit uses chemical induced fit bias and subgraph information bottleneck to dynamically align molecular substructure representations and improve stability on rule-shifted and scaffold-shifted data.
-
Qwen2.5 Technical Report
Qwen2.5 LLMs scale pre-training data to 18 trillion tokens and apply multistage reinforcement learning, achieving competitive performance on benchmarks with models up to 5 times larger.