pith. sign in

emnlp-main.307/

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.CY 1 cs.LG 1

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Discovering Implicit Large Language Model Alignment Objectives

cs.LG · 2026-02-17 · unverdicted · novelty 6.0

Obj-Disco decomposes LLM alignment reward signals into sparse weighted combinations of interpretable natural language objectives via iterative analysis of behavioral changes across checkpoints, capturing over 90% of observed reward behavior.

citing papers explorer

Showing 2 of 2 citing papers.