pith. sign in

hub

ISSN: 2640-3498

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

hub tools

citation-role summary

background 4

citation-polarity summary

years

2026 11 2025 2

roles

background 4

polarities

background 3 support 1

clear filters

representative citing papers

Discovering Implicit Large Language Model Alignment Objectives

cs.LG · 2026-02-17 · unverdicted · novelty 6.0

Obj-Disco decomposes LLM alignment reward signals into sparse weighted combinations of interpretable natural language objectives via iterative analysis of behavioral changes across checkpoints, capturing over 90% of observed reward behavior.

Order Is Not Control

cs.LG · 2026-06-11 · unverdicted · novelty 5.0

Order is distinct from control, where control is defined as a local receiver-gated response law demonstrated across biological circuits and LLM response panels with reported prediction accuracies of 72-84%.

citing papers explorer

Showing 4 of 4 citing papers after filters.