Towards Scalable Automated Alignment of LLMs: A Survey

· 2024 · arXiv 2406.01252

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Pref-CTRL: Preference Driven LLM Alignment using Representation Editing

cs.CL · 2026-04-26 · unverdicted · novelty 6.0

Pref-CTRL trains a multi-objective value function on preferences to guide representation editing for LLM alignment, outperforming RE-Control on benchmarks with better out-of-domain generalization.

Representational Alignment with Chemical Induced Fit for Molecular Relational Learning

cs.LG · 2025-02-07 · unverdicted · novelty 5.0 · 2 refs

ReAlignFit uses chemical induced fit bias and subgraph information bottleneck to dynamically align molecular substructure representations and improve stability on rule-shifted and scaffold-shifted data.

Qwen2.5 Technical Report

cs.CL · 2024-12-19 · unverdicted · novelty 3.0

Qwen2.5 LLMs scale pre-training data to 18 trillion tokens and apply multistage reinforcement learning, achieving competitive performance on benchmarks with models up to 5 times larger.

citing papers explorer

Showing 3 of 3 citing papers.

Pref-CTRL: Preference Driven LLM Alignment using Representation Editing cs.CL · 2026-04-26 · unverdicted · none · ref 3
Pref-CTRL trains a multi-objective value function on preferences to guide representation editing for LLM alignment, outperforming RE-Control on benchmarks with better out-of-domain generalization.
Representational Alignment with Chemical Induced Fit for Molecular Relational Learning cs.LG · 2025-02-07 · unverdicted · none · ref 28 · 2 links
ReAlignFit uses chemical induced fit bias and subgraph information bottleneck to dynamically align molecular substructure representations and improve stability on rule-shifted and scaffold-shifted data.
Qwen2.5 Technical Report cs.CL · 2024-12-19 · unverdicted · none · ref 8
Qwen2.5 LLMs scale pre-training data to 18 trillion tokens and apply multistage reinforcement learning, achieving competitive performance on benchmarks with models up to 5 times larger.

Towards Scalable Automated Alignment of LLMs: A Survey

fields

years

verdicts

representative citing papers

citing papers explorer